Commit Graph

73 Commits

Author SHA1 Message Date
Nicholas Burr 9418384c5f Updated HF_HUB_ENABLE_HF_TRANSFER to HF_XET_HIGH_PERFORMANCE. (#56) 2026-02-09 20:16:21 +00:00
Donato Capitella 632130a2c3 fix: Correct typo in "buy me a coffee" link in README. 2026-02-06 06:53:19 +00:00
Donato Capitella 033585368c fix ToC 2026-02-05 19:46:50 +00:00
Donato Capitella 4a97c47c4f docs: add project context and support sections to README. 2026-02-05 17:53:32 +00:00
dougs f28dee87ef Update README with correction to model download and inference instructions (#54)
Updated instructions for downloading model files to include both parts of the example GGUF
2026-02-05 11:21:47 +00:00
Donato Capitella eb92804284 feat: move and expand host configuration details with updated kernel parameters and explanations, and add a warning note. 2026-02-04 18:39:07 +00:00
Donato Capitella 9c6946e4b5 docs: add Table of Contents to README. 2026-02-04 18:34:21 +00:00
Donato Capitella 616d034bc6 typo 2026-02-04 18:32:24 +00:00
Donato Capitella 4d09c88011 tidy up README 2026-02-04 18:31:39 +00:00
Donato Capitella 3684e49a9d docs: update README to announce the application of a workaround for the ROCm 7 performance regression. 2026-02-04 18:05:10 +00:00
Donato Capitella 06fc789eba chore: deprecate and remove ROCm 7.1.1 toolbox and all associated references. 2026-02-04 17:56:41 +00:00
Donato Capitella 51aab9665d docs: Add a performance regression warning for Llama.cpp with ROCm 7.1+ or nightly builds. 2026-02-03 10:57:08 +00:00
Donato Capitella 62904f60dd update 2026-01-25 09:22:49 +00:00
Donato Capitella f5b3a2dfb9 udpated README 2026-01-25 09:22:05 +00:00
Donato Capitella 9511598be4 added 7.2 2026-01-23 09:19:22 +00:00
Donato Capitella 8da5395366 added script to manage cluster 2026-01-14 17:01:00 +00:00
Donato Capitella 7268e95b0f updates 2026-01-12 11:05:31 +00:00
Arnav Gupta 259bca04de small typo fix in 'interactive' word (#34)
sorry for just 1 letter pull-request, it just bothers me every time I try to find the link to open the benchmark results 🙈
2026-01-11 08:19:16 +00:00
Donato Capitella 783998589e neclean up of legacy toolboxes, removal of rocwmma and renamed rocm7-alpha to rocm-7nightlies. Added new benchmarks 2026-01-10 10:31:04 +00:00
Donato Capitella f0e9bc8865 adding warning 2026-01-08 14:47:14 +00:00
Donato Capitella 758f7e8b50 Update 2025-12-22 16:35:08 +00:00
Donato Capitella 9ba6812003 feat: upgrade ROCm to 7.1.1 and update associated tooling and documentation 2025-12-07 09:30:14 +00:00
Donato Capitella 62b0e5e173 updated recommendation for unified memory to reserve 4Gb to the base os. 2025-11-30 12:40:05 +00:00
Donato Capitella cbfa74b25b udpated Ubuntu instructions 2025-11-26 17:12:36 +00:00
Donato Capitella 5001413bcc more typos 2025-11-18 08:40:49 +00:00
Donato Capitella 6dc6423034 typo 2025-11-18 08:35:18 +00:00
Donato Capitella 140ba2d035 Updated README 2025-11-18 08:33:54 +00:00
Donato Capitella 0c0835e64c disable 15405 PR for rocmwmma-improved 2025-11-16 10:14:41 +00:00
S. Neuhaus 4ec72fa8f4 Fix command syntax for llama-cli usage 2025-10-30 18:11:28 +01:00
Donato Capitella 80e5683162 fixed-typo 2025-09-30 10:27:53 +01:00
Donato Capitella c2624c5cbe moved update notice up 2025-09-28 09:39:14 +01:00
Donato Capitella ba88675b9c Updated benchmarkls with ROCm 6.4.4 2025-09-28 09:38:04 +01:00
Donato Capitella 2663bd346c restore correct video link 2025-09-22 11:45:16 +01:00
Donato Capitella c4d34a21c6 Added Youtube Video 2025-09-22 11:43:28 +01:00
Donato Capitella c87360cf46 ubuntu users guidance update 2025-09-11 11:52:42 +01:00
Donato Capitella 1acda69224 updated benhcmakrs with reference llama 2 model 2025-08-18 22:25:28 +01:00
Donato Capitella b71a37647f Updated benchmakrs, removed old toolboxes and results 2025-08-17 12:32:08 +01:00
Donato Capitella 62e5080102 Updated benchmarks 2025-08-17 08:53:16 +01:00
Donato Capitella 551d14b11d Adding rocm-6.4.3 to README and to refresh script. Adding hipBLASLt. 2025-08-12 07:18:35 +01:00
Donato Capitella 2d55e329dc Fixed typo 2025-08-10 20:59:14 +01:00
Donato Capitella a9618d881b - Corrected typo in WMMA (was spelt wrong as waam)
- Included rocm-7rc-rocwmma toolbox
- Included updated results from benchmarks including rocm 7rc with ROMWMMA and hipBLASLt
2025-08-10 13:21:06 +01:00
Donato Capitella 65214aedfa Remove AI slop 2025-08-09 11:59:52 +01:00
Donato Capitella f194848b26 Better summary results, uncluding flash attention settings. 2025-08-09 11:58:42 +01:00
Donato Capitella 995ad2cd38 Updated benchmarks 2025-08-09 11:50:27 +01:00
Donato Capitella ff0a307389 Updated key benchmark findings 2025-08-09 11:47:51 +01:00
Donato Capitella bc9483b75d Adding new benchmarks 2025-08-09 11:25:44 +01:00
Donato Capitella 1c10985265 Updted README 2025-08-09 10:31:39 +01:00
Donato Capitella 3bea478db5 Poll workflow to trigger toolbox build automatically on llama.cpp master changes 2025-08-08 09:36:36 +01:00
Donato Capitella 3710de5d17 Added link to YouTube video and updated benchmarks 2025-08-06 19:14:42 +01:00
Donato Capitella 4dd44db6fe Adding script to auto-refresh toolboxes 2025-08-06 16:13:24 +01:00