Commit Graph

28 Commits

Author SHA1 Message Date
Donato Capitella 3e3f3674a8 feat: add MTP (Multi-Token Prediction) support via new ROCm 7.2.3 and Vulkan RADV toolboxes 2026-05-14 20:09:03 +01:00
Noah Bliss ed889def9d Support Debian in toolbox/distrobox command check (#88)
* Support Debian in toolbox command check

Debian requires distrobox, like Ubuntu. This is a simple change to add support for it.

* Consolidation of logic

copilot complained, so used this as an opportunity to collapse unnecessary evals.
2026-05-12 12:25:59 +01:00
Donato Capitella 2e3dc657d2 chore: update ROCm version to 7.2.3 and remove deprecated pr21344 toolbox 2026-05-11 19:40:30 +01:00
Donato Capitella 73be068e85 feat: upgrade ROCm toolboxes to 7.2.2 and update documentation and CI configurations 2026-04-26 16:25:44 +01:00
Donato Capitella 2c2c36d3da add rocm-7.2.1-pr21344 toolbox (gfx1151 MMQ/MMVQ tile + nwarp tuning)
Adds a new toolbox variant based on PR #21344 (pedapudi/llama.cpp@gfx1151-opt)
which tunes MMQ tile sizes (x_max=48, y=64) and warp counts (nwarps=4) for
RDNA3_5 gfx1151, yielding up to +100% prefill throughput at small batch sizes.

Also adds BMI2/FMA/F16C CPU SIMD flags and GGML_CUDA_FA_ALL_QUANTS=ON to match
the benchmark build used in the PR. Wire up CI (build matrix + prune), the
refresh script, and run_benchmarks.sh so results land alongside rocm-7.2.1.
2026-04-15 09:23:58 +01:00
Donato Capitella 4ac481e7d1 chore: upgrade ROCm version from 7.2 to 7.2.1 across configuration and documentation 2026-04-09 18:33:52 +01:00
Donato Capitella ec245b9b17 feat: Implement OS-aware toolbox command selection (toolbox vs distrobox) in the script and clarify usage in the README. 2026-02-22 19:47:30 +00:00
Donato Capitella 13a0189b6a Simplify image cleanup by removing explicit logic for older tagged image versions. 2026-02-14 11:27:59 +00:00
Donato Capitella 06fc789eba chore: deprecate and remove ROCm 7.1.1 toolbox and all associated references. 2026-02-04 17:56:41 +00:00
Donato Capitella 1807e8cff2 Adding ROCm 7.2 backend 2026-01-23 08:07:40 +00:00
Donato Capitella 783998589e neclean up of legacy toolboxes, removal of rocwmma and renamed rocm7-alpha to rocm-7nightlies. Added new benchmarks 2026-01-10 10:31:04 +00:00
Donato Capitella 9ba6812003 feat: upgrade ROCm to 7.1.1 and update associated tooling and documentation 2025-12-07 09:30:14 +00:00
Donato Capitella 40a47116a9 Merge remote-tracking branch 'origin/main' into pr-20 2025-11-12 13:19:56 +00:00
Niccolò Belli 32f7667dca fix: typo in refresh-toolboxes.sh 2025-11-12 08:40:55 +01:00
Niccolò Belli abeec3526c fix: rocm-7alpha pipelines and refresh-toolboxes.sh 2025-11-12 08:35:11 +01:00
Donato Capitella e6ee164bf5 update refresh script 2025-10-11 18:35:18 +01:00
Donato Capitella 7e45d40a06 typo 2025-09-27 20:06:32 +01:00
Donato Capitella 11a0f232e3 updated refresh script and add base rocm-6.4.4 without ROCWMMA 2025-09-27 19:19:30 +01:00
Donato Capitella f4506ca711 remove rocm6.4.2 from refresh script 2025-09-16 12:34:38 +01:00
Donato Capitella 7c41016c39 added +x execution flag 2025-09-16 12:23:52 +01:00
Donato Capitella e7eb5062bf Updated refresh script to remove dangling images 2025-09-16 12:23:25 +01:00
Donato Capitella 5589e1e335 added llama-rocm-7rc-rocwmma-fa_all_quants to refresh script 2025-09-16 11:59:07 +01:00
Donato Capitella 7641ae0127 reducing size of rocm-6.4.3 2025-08-16 11:48:42 +01:00
Donato Capitella 551d14b11d Adding rocm-6.4.3 to README and to refresh script. Adding hipBLASLt. 2025-08-12 07:18:35 +01:00
Donato Capitella a9618d881b - Corrected typo in WMMA (was spelt wrong as waam)
- Included rocm-7rc-rocwmma toolbox
- Included updated results from benchmarks including rocm 7rc with ROMWMMA and hipBLASLt
2025-08-10 13:21:06 +01:00
Donato Capitella 0d6b2dc731 Addin rocm7-rocwaam toolbox 2025-08-09 19:21:52 +01:00
Donato Capitella 1c10985265 Updted README 2025-08-09 10:31:39 +01:00
Donato Capitella 4dd44db6fe Adding script to auto-refresh toolboxes 2025-08-06 16:13:24 +01:00