Donato Capitella
d0281bb526
feat: apply upstream llama.cpp patch to fix Gemma-4 inference issues
2026-04-06 10:25:42 +01:00
Donato Capitella
bbd8f02014
build: remove -DGGML_CUDA_DISABLE_FUSION=1 from cmake configuration in rocm7-nightlies Dockerfile (this was for a temporary test)
2026-04-03 15:21:58 +01:00
Donato Capitella
b376d1558b
build: disable GGML CUDA fusion in ROCm build configuration (temporary test)
2026-04-03 15:16:12 +01:00
Donato Capitella
ca84f4cbf3
patch: increasing MAX_REPETITION_THRESHOLD to allow complex agentic workflows
2026-03-25 09:23:19 +00:00
Donato Capitella
5f4698c959
build: Remove amdgpu-unroll-threshold-local CMAKE_HIP_FLAG from ROCm 7 nightlies Dockerfile.
2026-03-03 12:54:45 +00:00
Trevor Starick
be936d6b59
feat: add REPO/BRANCH build args for llama.cpp ( #59 )
...
- Introduce ARG REPO and ARG BRANCH to replace the hardcoded git clone with: `git clone -b ${BRANCH} --single-branch --recursive ${REPO}` . This allows overriding the llama.cpp repository and branch at build time via `--build-arg`.
- Update `docs/building.md` to recommend using `--build-arg` instead of updating the file
2026-02-17 19:29:48 +00:00
Donato Capitella
606bc292b9
attempting other ways to apply LLVM patch to rocm7
2026-02-04 16:59:43 +00:00
Donato Capitella
bd8069fe2f
remove AI slop and use correct envs to pass flasg to HIP compiler
2026-02-04 16:21:32 +00:00
Donato Capitella
7ffa22d8de
fix: Add temporary workaround for ROCm 7 performance regression by setting HIP_LLVM_FLAGS.
2026-02-04 14:50:32 +00:00
Donato Capitella
ea03c773c6
adding procps-ng to the toolbox runtime
2026-01-15 09:43:05 +00:00
Donato Capitella
783998589e
neclean up of legacy toolboxes, removal of rocwmma and renamed rocm7-alpha to rocm-7nightlies. Added new benchmarks
2026-01-10 10:31:04 +00:00