Donato Capitella
73be068e85
feat: upgrade ROCm toolboxes to 7.2.2 and update documentation and CI configurations
2026-04-26 16:25:44 +01:00
Donato Capitella
2c2c36d3da
add rocm-7.2.1-pr21344 toolbox (gfx1151 MMQ/MMVQ tile + nwarp tuning)
...
Adds a new toolbox variant based on PR #21344 (pedapudi/llama.cpp@gfx1151-opt)
which tunes MMQ tile sizes (x_max=48, y=64) and warp counts (nwarps=4) for
RDNA3_5 gfx1151, yielding up to +100% prefill throughput at small batch sizes.
Also adds BMI2/FMA/F16C CPU SIMD flags and GGML_CUDA_FA_ALL_QUANTS=ON to match
the benchmark build used in the PR. Wire up CI (build matrix + prune), the
refresh script, and run_benchmarks.sh so results land alongside rocm-7.2.1.
2026-04-15 09:23:58 +01:00
Donato Capitella
1dea385f6a
fix: remove trailing backtick causing syntax error in prune-old-toolboxes workflow
2026-04-09 19:00:44 +01:00
Donato Capitella
4ac481e7d1
chore: upgrade ROCm version from 7.2 to 7.2.1 across configuration and documentation
2026-04-09 18:33:52 +01:00
Donato Capitella
1807e8cff2
Adding ROCm 7.2 backend
2026-01-23 08:07:40 +00:00
Donato Capitella
783998589e
neclean up of legacy toolboxes, removal of rocwmma and renamed rocm7-alpha to rocm-7nightlies. Added new benchmarks
2026-01-10 10:31:04 +00:00
Donato Capitella
6b08c48d91
adjust toolbox pruning script to remove 6.4.4 old ones
2025-11-16 10:06:19 +00:00
Donato Capitella
40a47116a9
Merge remote-tracking branch 'origin/main' into pr-20
2025-11-12 13:19:56 +00:00
Niccolò Belli
abeec3526c
fix: rocm-7alpha pipelines and refresh-toolboxes.sh
2025-11-12 08:35:11 +01:00
Niccolò Belli
8b2cc6a0cc
feat: rocm-7alpha
2025-11-09 15:57:19 +01:00
Dave Reynolds
81a5743bfb
Add basic support for ROCm 7.1 release
2025-11-08 18:39:05 +00:00
Donato Capitella
b02ffb55f7
adding rocm-6.4.4
2025-09-27 18:57:03 +01:00
Donato Capitella
e86bf4c418
Adding bilding steps for FA ALL QUANTS
2025-09-16 11:07:46 +01:00
Donato Capitella
cf63be86bf
auto-run prune job on completion of build and publish job
2025-08-24 21:32:15 +01:00
Donato Capitella
8628db44d3
Adding rocm-6.4.3 toolboxes
2025-08-11 16:13:38 +01:00
Donato Capitella
9163cd7c60
Adding action to prune old toolboxes
2025-08-10 13:30:38 +01:00