Donato Capitella
|
c7f4ffc346
|
updated rpc benchmakrs with long context
|
2025-11-19 07:35:56 +00:00 |
|
Donato Capitella
|
1d88fca07d
|
added long context benchmakrs for RPC
|
2025-11-18 10:43:17 +00:00 |
|
Donato Capitella
|
d19875828c
|
add script to compare performance with/without forcing the hipblaslt path
|
2025-11-18 08:45:25 +00:00 |
|
Donato Capitella
|
ccf29e6b22
|
fixed naming convention
|
2025-11-17 23:09:04 +00:00 |
|
Donato Capitella
|
1d6d48fae1
|
updated benchmarks
|
2025-11-17 23:02:56 +00:00 |
|
Donato Capitella
|
ad32126872
|
Updating retries for run)_benchmark
|
2025-11-17 17:53:53 +00:00 |
|
Donato Capitella
|
12f057612b
|
restoring correct llama-bench flags
|
2025-11-17 16:00:10 +00:00 |
|
Donato Capitella
|
de02a53d96
|
restored correct benchmark behaviour
|
2025-11-17 15:55:31 +00:00 |
|
Donato Capitella
|
f62c6e47c5
|
updated benchmark script to cover HBLASLT for all rocm backends
|
2025-11-17 15:30:19 +00:00 |
|
Donato Capitella
|
1bb4c1f0cc
|
improve logic to check if a benchmakr as already been run
|
2025-11-17 11:01:19 +00:00 |
|
Donato Capitella
|
de49d65b3c
|
Ensure ROCBLAS_USE_HIPBLASLT is properly set remotely as well
|
2025-11-17 09:33:14 +00:00 |
|
Donato Capitella
|
17b1ec2825
|
run llama-bench INSIDe container (vibe coding is tiring)
|
2025-11-17 09:26:56 +00:00 |
|
Donato Capitella
|
6d8ac6d6f4
|
remove user from ssh script
|
2025-11-17 09:10:52 +00:00 |
|
Donato Capitella
|
1eade84757
|
fixed remote targets
|
2025-11-17 09:09:15 +00:00 |
|
Donato Capitella
|
1e184979df
|
remove user from default host
|
2025-11-17 09:05:13 +00:00 |
|
Donato Capitella
|
ecbe5c14c3
|
Fixed model path resolution
|
2025-11-17 08:57:42 +00:00 |
|
Donato Capitella
|
a50adb0c15
|
add benchmakr script for RPC
|
2025-11-17 08:27:54 +00:00 |
|
Donato Capitella
|
67fb3a002b
|
Updated benchmarks
|
2025-11-15 08:36:25 +00:00 |
|
Donato Capitella
|
1d945f2c21
|
change llama-bench retries to 3
|
2025-11-12 14:19:47 +00:00 |
|
Donato Capitella
|
79479ec596
|
adding rocm7alpha to the benchmarks
|
2025-11-12 14:06:00 +00:00 |
|
Donato Capitella
|
f93c88b792
|
limit llama-bench to 1 try to support longer context
|
2025-11-12 13:57:24 +00:00 |
|
Donato Capitella
|
11048c22f2
|
update benchmark script
|
2025-11-12 13:48:42 +00:00 |
|
Donato Capitella
|
2f2b1b33af
|
added Qwen3-Coder-30B-A3B-Instruct_Q4_K_M to the benchmarks
|
2025-10-20 20:05:56 +01:00 |
|
Donato Capitella
|
765cc5c733
|
updated benchs
|
2025-10-12 07:39:42 +01:00 |
|
Donato Capitella
|
ba88675b9c
|
Updated benchmarkls with ROCm 6.4.4
|
2025-09-28 09:38:04 +01:00 |
|
Donato Capitella
|
7dd4490398
|
updated benchmark script
|
2025-09-27 20:21:14 +01:00 |
|
Donato Capitella
|
006aaa64e1
|
Updated benchmarks
|
2025-09-17 10:41:14 +01:00 |
|
Donato Capitella
|
1acda69224
|
updated benhcmakrs with reference llama 2 model
|
2025-08-18 22:25:28 +01:00 |
|
Donato Capitella
|
b71a37647f
|
Updated benchmakrs, removed old toolboxes and results
|
2025-08-17 12:32:08 +01:00 |
|
Donato Capitella
|
62e5080102
|
Updated benchmarks
|
2025-08-17 08:53:16 +01:00 |
|
Donato Capitella
|
d09179fcab
|
Updating recurring wmma typo
|
2025-08-10 14:15:03 +01:00 |
|
Donato Capitella
|
a9618d881b
|
- Corrected typo in WMMA (was spelt wrong as waam)
- Included rocm-7rc-rocwmma toolbox
- Included updated results from benchmarks including rocm 7rc with ROMWMMA and hipBLASLt
|
2025-08-10 13:21:06 +01:00 |
|
Donato Capitella
|
e49efe221e
|
Changed to light theme and improved parsinf of mdoel paramater number.
|
2025-08-09 15:43:07 +01:00 |
|
Donato Capitella
|
f194848b26
|
Better summary results, uncluding flash attention settings.
|
2025-08-09 11:58:42 +01:00 |
|
Donato Capitella
|
995ad2cd38
|
Updated benchmarks
|
2025-08-09 11:50:27 +01:00 |
|
Donato Capitella
|
ff0a307389
|
Updated key benchmark findings
|
2025-08-09 11:47:51 +01:00 |
|
Donato Capitella
|
bc9483b75d
|
Adding new benchmarks
|
2025-08-09 11:25:44 +01:00 |
|
Donato Capitella
|
8972ef01ff
|
adding raw benchmark results
|
2025-08-09 10:44:09 +01:00 |
|
Donato Capitella
|
3710de5d17
|
Added link to YouTube video and updated benchmarks
|
2025-08-06 19:14:42 +01:00 |
|
Donato Capitella
|
c534a1b1ee
|
Updated benchmark scirpt to skip combinations that have already been benchmarked
|
2025-08-06 18:16:30 +01:00 |
|
Donato Capitella
|
e7e27e6cf3
|
Benchmark and container updates
|
2025-08-03 13:05:52 +01:00 |
|