Commit Graph

249 Commits

Author SHA1 Message Date
Donato Capitella f5b9cd84a2 updated instructions to downgrade kernel 2026-01-11 18:26:26 +00:00
Donato Capitella d6c7456bd0 adding system info to benchmark display 2026-01-11 10:04:05 +00:00
Arnav Gupta 259bca04de small typo fix in 'interactive' word (#34)
sorry for just 1 letter pull-request, it just bothers me every time I try to find the link to open the benchmark results 🙈
2026-01-11 08:19:16 +00:00
Donato Capitella eff3a8ede8 updated downgrade instructions 2026-01-10 11:04:10 +00:00
Donato Capitella 783998589e neclean up of legacy toolboxes, removal of rocwmma and renamed rocm7-alpha to rocm-7nightlies. Added new benchmarks 2026-01-10 10:31:04 +00:00
Donato Capitella f0e9bc8865 adding warning 2026-01-08 14:47:14 +00:00
Donato Capitella 758f7e8b50 Update 2025-12-22 16:35:08 +00:00
Donato Capitella 2c8a1e2eef updated benchmarks 2025-12-21 18:49:08 +00:00
Donato Capitella 9ba6812003 feat: upgrade ROCm to 7.1.1 and update associated tooling and documentation 2025-12-07 09:30:14 +00:00
Donato Capitella 7584a31548 updated Qwen Next Benchmarks 2025-12-05 08:32:58 +00:00
Donato Capitella 62b0e5e173 updated recommendation for unified memory to reserve 4Gb to the base os. 2025-11-30 12:40:05 +00:00
Donato Capitella 7f34f51202 Added Qwen-3-Next benchmarks 2025-11-28 17:50:21 +00:00
Donato Capitella df54882433 remove manual application of RPC performance PR (this is merged into master now) 2025-11-28 14:20:03 +00:00
Donato Capitella cbfa74b25b udpated Ubuntu instructions 2025-11-26 17:12:36 +00:00
Donato Capitella 1b5ced1255 make PR-15405 application explicit in logs 2025-11-25 10:02:32 +00:00
Donato Capitella 5105f6cf10 add flag to remvoe RPC PR (for testing) 2025-11-24 17:16:06 +00:00
Donato Capitella c7f4ffc346 updated rpc benchmakrs with long context 2025-11-19 07:35:56 +00:00
Donato Capitella 1d88fca07d added long context benchmakrs for RPC 2025-11-18 10:43:17 +00:00
Donato Capitella d19875828c add script to compare performance with/without forcing the hipblaslt path 2025-11-18 08:45:25 +00:00
kevinjohncolo f631e45674 Create docker-compose-how-to.md (#28)
* Create docker-compose-how-to.md

Simple how-to on using docker compose instead of toolbox.

* Update docker-compose-how-to.md
2025-11-18 08:43:22 +00:00
Donato Capitella 5001413bcc more typos 2025-11-18 08:40:49 +00:00
Donato Capitella 6dc6423034 typo 2025-11-18 08:35:18 +00:00
Donato Capitella 140ba2d035 Updated README 2025-11-18 08:33:54 +00:00
Donato Capitella ccf29e6b22 fixed naming convention 2025-11-17 23:09:04 +00:00
Donato Capitella 1d6d48fae1 updated benchmarks 2025-11-17 23:02:56 +00:00
Donato Capitella ad32126872 Updating retries for run)_benchmark 2025-11-17 17:53:53 +00:00
Donato Capitella 12f057612b restoring correct llama-bench flags 2025-11-17 16:00:10 +00:00
Donato Capitella de02a53d96 restored correct benchmark behaviour 2025-11-17 15:55:31 +00:00
Donato Capitella f62c6e47c5 updated benchmark script to cover HBLASLT for all rocm backends 2025-11-17 15:30:19 +00:00
Donato Capitella 528923aa66 restore PR_15405 for Vulkan backends 2025-11-17 11:44:11 +00:00
Donato Capitella eae357f9dd disable PR_15405 for vulkan 2025-11-17 11:19:51 +00:00
Donato Capitella 1bb4c1f0cc improve logic to check if a benchmakr as already been run 2025-11-17 11:01:19 +00:00
Donato Capitella de49d65b3c Ensure ROCBLAS_USE_HIPBLASLT is properly set remotely as well 2025-11-17 09:33:14 +00:00
Donato Capitella 17b1ec2825 run llama-bench INSIDe container (vibe coding is tiring) 2025-11-17 09:26:56 +00:00
Donato Capitella 6d8ac6d6f4 remove user from ssh script 2025-11-17 09:10:52 +00:00
Donato Capitella 1eade84757 fixed remote targets 2025-11-17 09:09:15 +00:00
Donato Capitella 1e184979df remove user from default host 2025-11-17 09:05:13 +00:00
Donato Capitella ecbe5c14c3 Fixed model path resolution 2025-11-17 08:57:42 +00:00
Donato Capitella a50adb0c15 add benchmakr script for RPC 2025-11-17 08:27:54 +00:00
Donato Capitella 79a2438861 copy rpc-server binary to runtime container 2025-11-17 08:04:02 +00:00
Donato Capitella 9254f7b9e2 revert styatic library flag 2025-11-16 22:43:14 +00:00
kyuz0 c0e74afbb8 Disable RPC PR for .rocm-7alpha-rocwmma-improved 2025-11-16 10:32:54 +00:00
Donato Capitella 0c0835e64c disable 15405 PR for rocmwmma-improved 2025-11-16 10:14:41 +00:00
Donato Capitella 6b08c48d91 adjust toolbox pruning script to remove 6.4.4 old ones 2025-11-16 10:06:19 +00:00
Donato Capitella 7e583193d0 migrated to fedora 43 from rawhide to fix build issues 2025-11-16 10:04:39 +00:00
Donato Capitella a164b2308b switching from rawhide to 43 2025-11-16 09:44:29 +00:00
Donato Capitella 5253e1143b tryign DBUILD_SHARED_LIBS to check if it fixes HIP backend build issues 2025-11-16 09:37:53 +00:00
Donato Capitella 8cea1363f3 remove dangling 2025-11-16 08:39:39 +00:00
Donato Capitella bf0d083975 Dropping PR 15405 (llama.cpp RPC experimental improvement) due to compile issue 2025-11-16 08:25:25 +00:00
Donato Capitella 9de07b1d25 Enable RPC builds and merge PR 15405 across Dockerfiles 2025-11-16 07:54:49 +00:00