Commit Graph

15 Commits

Author SHA1 Message Date
Donato Capitella 2c2c36d3da add rocm-7.2.1-pr21344 toolbox (gfx1151 MMQ/MMVQ tile + nwarp tuning)
Adds a new toolbox variant based on PR #21344 (pedapudi/llama.cpp@gfx1151-opt)
which tunes MMQ tile sizes (x_max=48, y=64) and warp counts (nwarps=4) for
RDNA3_5 gfx1151, yielding up to +100% prefill throughput at small batch sizes.

Also adds BMI2/FMA/F16C CPU SIMD flags and GGML_CUDA_FA_ALL_QUANTS=ON to match
the benchmark build used in the PR. Wire up CI (build matrix + prune), the
refresh script, and run_benchmarks.sh so results land alongside rocm-7.2.1.
2026-04-15 09:23:58 +01:00
Donato Capitella 1dea385f6a fix: remove trailing backtick causing syntax error in prune-old-toolboxes workflow 2026-04-09 19:00:44 +01:00
Donato Capitella 4ac481e7d1 chore: upgrade ROCm version from 7.2 to 7.2.1 across configuration and documentation 2026-04-09 18:33:52 +01:00
Donato Capitella 1807e8cff2 Adding ROCm 7.2 backend 2026-01-23 08:07:40 +00:00
Donato Capitella 783998589e neclean up of legacy toolboxes, removal of rocwmma and renamed rocm7-alpha to rocm-7nightlies. Added new benchmarks 2026-01-10 10:31:04 +00:00
Donato Capitella 6b08c48d91 adjust toolbox pruning script to remove 6.4.4 old ones 2025-11-16 10:06:19 +00:00
Donato Capitella 40a47116a9 Merge remote-tracking branch 'origin/main' into pr-20 2025-11-12 13:19:56 +00:00
Niccolò Belli abeec3526c fix: rocm-7alpha pipelines and refresh-toolboxes.sh 2025-11-12 08:35:11 +01:00
Niccolò Belli 8b2cc6a0cc feat: rocm-7alpha 2025-11-09 15:57:19 +01:00
Dave Reynolds 81a5743bfb Add basic support for ROCm 7.1 release 2025-11-08 18:39:05 +00:00
Donato Capitella b02ffb55f7 adding rocm-6.4.4 2025-09-27 18:57:03 +01:00
Donato Capitella e86bf4c418 Adding bilding steps for FA ALL QUANTS 2025-09-16 11:07:46 +01:00
Donato Capitella cf63be86bf auto-run prune job on completion of build and publish job 2025-08-24 21:32:15 +01:00
Donato Capitella 8628db44d3 Adding rocm-6.4.3 toolboxes 2025-08-11 16:13:38 +01:00
Donato Capitella 9163cd7c60 Adding action to prune old toolboxes 2025-08-10 13:30:38 +01:00