Commit Graph

36 Commits

Author SHA1 Message Date
Donato Capitella ad32126872 Updating retries for run)_benchmark 2025-11-17 17:53:53 +00:00
Donato Capitella 12f057612b restoring correct llama-bench flags 2025-11-17 16:00:10 +00:00
Donato Capitella de02a53d96 restored correct benchmark behaviour 2025-11-17 15:55:31 +00:00
Donato Capitella f62c6e47c5 updated benchmark script to cover HBLASLT for all rocm backends 2025-11-17 15:30:19 +00:00
Donato Capitella 1bb4c1f0cc improve logic to check if a benchmakr as already been run 2025-11-17 11:01:19 +00:00
Donato Capitella de49d65b3c Ensure ROCBLAS_USE_HIPBLASLT is properly set remotely as well 2025-11-17 09:33:14 +00:00
Donato Capitella 17b1ec2825 run llama-bench INSIDe container (vibe coding is tiring) 2025-11-17 09:26:56 +00:00
Donato Capitella 6d8ac6d6f4 remove user from ssh script 2025-11-17 09:10:52 +00:00
Donato Capitella 1eade84757 fixed remote targets 2025-11-17 09:09:15 +00:00
Donato Capitella 1e184979df remove user from default host 2025-11-17 09:05:13 +00:00
Donato Capitella ecbe5c14c3 Fixed model path resolution 2025-11-17 08:57:42 +00:00
Donato Capitella a50adb0c15 add benchmakr script for RPC 2025-11-17 08:27:54 +00:00
Donato Capitella 67fb3a002b Updated benchmarks 2025-11-15 08:36:25 +00:00
Donato Capitella 1d945f2c21 change llama-bench retries to 3 2025-11-12 14:19:47 +00:00
Donato Capitella 79479ec596 adding rocm7alpha to the benchmarks 2025-11-12 14:06:00 +00:00
Donato Capitella f93c88b792 limit llama-bench to 1 try to support longer context 2025-11-12 13:57:24 +00:00
Donato Capitella 11048c22f2 update benchmark script 2025-11-12 13:48:42 +00:00
Donato Capitella 2f2b1b33af added Qwen3-Coder-30B-A3B-Instruct_Q4_K_M to the benchmarks 2025-10-20 20:05:56 +01:00
Donato Capitella 765cc5c733 updated benchs 2025-10-12 07:39:42 +01:00
Donato Capitella ba88675b9c Updated benchmarkls with ROCm 6.4.4 2025-09-28 09:38:04 +01:00
Donato Capitella 7dd4490398 updated benchmark script 2025-09-27 20:21:14 +01:00
Donato Capitella 006aaa64e1 Updated benchmarks 2025-09-17 10:41:14 +01:00
Donato Capitella 1acda69224 updated benhcmakrs with reference llama 2 model 2025-08-18 22:25:28 +01:00
Donato Capitella b71a37647f Updated benchmakrs, removed old toolboxes and results 2025-08-17 12:32:08 +01:00
Donato Capitella 62e5080102 Updated benchmarks 2025-08-17 08:53:16 +01:00
Donato Capitella d09179fcab Updating recurring wmma typo 2025-08-10 14:15:03 +01:00
Donato Capitella a9618d881b - Corrected typo in WMMA (was spelt wrong as waam)
- Included rocm-7rc-rocwmma toolbox
- Included updated results from benchmarks including rocm 7rc with ROMWMMA and hipBLASLt
2025-08-10 13:21:06 +01:00
Donato Capitella e49efe221e Changed to light theme and improved parsinf of mdoel paramater number. 2025-08-09 15:43:07 +01:00
Donato Capitella f194848b26 Better summary results, uncluding flash attention settings. 2025-08-09 11:58:42 +01:00
Donato Capitella 995ad2cd38 Updated benchmarks 2025-08-09 11:50:27 +01:00
Donato Capitella ff0a307389 Updated key benchmark findings 2025-08-09 11:47:51 +01:00
Donato Capitella bc9483b75d Adding new benchmarks 2025-08-09 11:25:44 +01:00
Donato Capitella 8972ef01ff adding raw benchmark results 2025-08-09 10:44:09 +01:00
Donato Capitella 3710de5d17 Added link to YouTube video and updated benchmarks 2025-08-06 19:14:42 +01:00
Donato Capitella c534a1b1ee Updated benchmark scirpt to skip combinations that have already been benchmarked 2025-08-06 18:16:30 +01:00
Donato Capitella e7e27e6cf3 Benchmark and container updates 2025-08-03 13:05:52 +01:00