amd-strix-halo-toolboxes

Author	SHA1	Message	Date
Donato Capitella	ad32126872	Updating retries for run)_benchmark	2025-11-17 17:53:53 +00:00
Donato Capitella	12f057612b	restoring correct llama-bench flags	2025-11-17 16:00:10 +00:00
Donato Capitella	de02a53d96	restored correct benchmark behaviour	2025-11-17 15:55:31 +00:00
Donato Capitella	f62c6e47c5	updated benchmark script to cover HBLASLT for all rocm backends	2025-11-17 15:30:19 +00:00
Donato Capitella	1bb4c1f0cc	improve logic to check if a benchmakr as already been run	2025-11-17 11:01:19 +00:00
Donato Capitella	de49d65b3c	Ensure ROCBLAS_USE_HIPBLASLT is properly set remotely as well	2025-11-17 09:33:14 +00:00
Donato Capitella	17b1ec2825	run llama-bench INSIDe container (vibe coding is tiring)	2025-11-17 09:26:56 +00:00
Donato Capitella	6d8ac6d6f4	remove user from ssh script	2025-11-17 09:10:52 +00:00
Donato Capitella	1eade84757	fixed remote targets	2025-11-17 09:09:15 +00:00
Donato Capitella	1e184979df	remove user from default host	2025-11-17 09:05:13 +00:00
Donato Capitella	ecbe5c14c3	Fixed model path resolution	2025-11-17 08:57:42 +00:00
Donato Capitella	a50adb0c15	add benchmakr script for RPC	2025-11-17 08:27:54 +00:00
Donato Capitella	67fb3a002b	Updated benchmarks	2025-11-15 08:36:25 +00:00
Donato Capitella	1d945f2c21	change llama-bench retries to 3	2025-11-12 14:19:47 +00:00
Donato Capitella	79479ec596	adding rocm7alpha to the benchmarks	2025-11-12 14:06:00 +00:00
Donato Capitella	f93c88b792	limit llama-bench to 1 try to support longer context	2025-11-12 13:57:24 +00:00
Donato Capitella	11048c22f2	update benchmark script	2025-11-12 13:48:42 +00:00
Donato Capitella	2f2b1b33af	added Qwen3-Coder-30B-A3B-Instruct_Q4_K_M to the benchmarks	2025-10-20 20:05:56 +01:00
Donato Capitella	765cc5c733	updated benchs	2025-10-12 07:39:42 +01:00
Donato Capitella	ba88675b9c	Updated benchmarkls with ROCm 6.4.4	2025-09-28 09:38:04 +01:00
Donato Capitella	7dd4490398	updated benchmark script	2025-09-27 20:21:14 +01:00
Donato Capitella	006aaa64e1	Updated benchmarks	2025-09-17 10:41:14 +01:00
Donato Capitella	1acda69224	updated benhcmakrs with reference llama 2 model	2025-08-18 22:25:28 +01:00
Donato Capitella	b71a37647f	Updated benchmakrs, removed old toolboxes and results	2025-08-17 12:32:08 +01:00
Donato Capitella	62e5080102	Updated benchmarks	2025-08-17 08:53:16 +01:00
Donato Capitella	d09179fcab	Updating recurring wmma typo	2025-08-10 14:15:03 +01:00
Donato Capitella	a9618d881b	- Corrected typo in WMMA (was spelt wrong as waam) - Included rocm-7rc-rocwmma toolbox - Included updated results from benchmarks including rocm 7rc with ROMWMMA and hipBLASLt	2025-08-10 13:21:06 +01:00
Donato Capitella	e49efe221e	Changed to light theme and improved parsinf of mdoel paramater number.	2025-08-09 15:43:07 +01:00
Donato Capitella	f194848b26	Better summary results, uncluding flash attention settings.	2025-08-09 11:58:42 +01:00
Donato Capitella	995ad2cd38	Updated benchmarks	2025-08-09 11:50:27 +01:00
Donato Capitella	ff0a307389	Updated key benchmark findings	2025-08-09 11:47:51 +01:00
Donato Capitella	bc9483b75d	Adding new benchmarks	2025-08-09 11:25:44 +01:00
Donato Capitella	8972ef01ff	adding raw benchmark results	2025-08-09 10:44:09 +01:00
Donato Capitella	3710de5d17	Added link to YouTube video and updated benchmarks	2025-08-06 19:14:42 +01:00
Donato Capitella	c534a1b1ee	Updated benchmark scirpt to skip combinations that have already been benchmarked	2025-08-06 18:16:30 +01:00
Donato Capitella	e7e27e6cf3	Benchmark and container updates	2025-08-03 13:05:52 +01:00

36 Commits