Commit Graph

49 Commits

Author SHA1 Message Date
Donato Capitella 9016c0f8f8 update benchs 2026-04-15 16:54:34 +01:00
Donato Capitella 9707a15df7 feat: add benchmark results for rocm-7_2_1-pr21344 and update results metadata 2026-04-15 11:39:10 +01:00
Donato Capitella 14fae26ad0 add minimax m2.7 benchmarks 2026-04-15 08:09:12 +01:00
Donato Capitella d74db71362 archvied old multi-node benchmarks 2026-04-11 11:20:30 +01:00
Donato Capitella 7aa6e6dea9 update benchmarks 2026-04-11 11:18:45 +01:00
Donato Capitella c129a04a1c refactor: remove hblt0 benchmark support and associated comparison scripts 2026-04-10 11:23:06 +01:00
Donato Capitella a7ace8dba7 updted benchmarks 2026-03-30 08:37:15 +01:00
esc247 eb03432a50 added router mode section and example models.ini file for use with router mode (#67) 2026-03-04 12:09:11 +00:00
Trevor Starick be936d6b59 feat: add REPO/BRANCH build args for llama.cpp (#59)
- Introduce ARG REPO and ARG BRANCH to replace the hardcoded git clone with: `git clone -b ${BRANCH} --single-branch --recursive ${REPO}` . This allows overriding the llama.cpp repository and branch at build time via `--build-arg`.

- Update `docs/building.md` to recommend using `--build-arg` instead of updating the file
2026-02-17 19:29:48 +00:00
Donato Capitella 8ff812fbb5 updated benchmarks 2026-02-09 13:30:26 +00:00
Donato Capitella 2d09b9e6db updated benchmarks 2026-02-05 19:03:13 +00:00
Donato Capitella d97efb0cb9 updated gpt-oss benchmakrs to test rocm7 performance patch 2026-02-04 17:46:43 +00:00
Donato Capitella d674531182 added rocm-7.2 benchmarks 2026-01-23 15:11:13 +00:00
Donato Capitella 6d70dfc73b updated with dual-server benchmarks 2026-01-12 13:19:23 +00:00
Donato Capitella 7268e95b0f updates 2026-01-12 11:05:31 +00:00
Donato Capitella f5b9cd84a2 updated instructions to downgrade kernel 2026-01-11 18:26:26 +00:00
Donato Capitella d6c7456bd0 adding system info to benchmark display 2026-01-11 10:04:05 +00:00
Donato Capitella eff3a8ede8 updated downgrade instructions 2026-01-10 11:04:10 +00:00
Donato Capitella 783998589e neclean up of legacy toolboxes, removal of rocwmma and renamed rocm7-alpha to rocm-7nightlies. Added new benchmarks 2026-01-10 10:31:04 +00:00
Donato Capitella f0e9bc8865 adding warning 2026-01-08 14:47:14 +00:00
Donato Capitella 2c8a1e2eef updated benchmarks 2025-12-21 18:49:08 +00:00
Donato Capitella 7584a31548 updated Qwen Next Benchmarks 2025-12-05 08:32:58 +00:00
Donato Capitella 7f34f51202 Added Qwen-3-Next benchmarks 2025-11-28 17:50:21 +00:00
Donato Capitella c7f4ffc346 updated rpc benchmakrs with long context 2025-11-19 07:35:56 +00:00
kevinjohncolo f631e45674 Create docker-compose-how-to.md (#28)
* Create docker-compose-how-to.md

Simple how-to on using docker compose instead of toolbox.

* Update docker-compose-how-to.md
2025-11-18 08:43:22 +00:00
Donato Capitella ccf29e6b22 fixed naming convention 2025-11-17 23:09:04 +00:00
Donato Capitella 1d6d48fae1 updated benchmarks 2025-11-17 23:02:56 +00:00
Donato Capitella 67fb3a002b Updated benchmarks 2025-11-15 08:36:25 +00:00
Donato Capitella 2f2b1b33af added Qwen3-Coder-30B-A3B-Instruct_Q4_K_M to the benchmarks 2025-10-20 20:05:56 +01:00
Donato Capitella 765cc5c733 updated benchs 2025-10-12 07:39:42 +01:00
Donato Capitella ba88675b9c Updated benchmarkls with ROCm 6.4.4 2025-09-28 09:38:04 +01:00
Donato Capitella 006aaa64e1 Updated benchmarks 2025-09-17 10:41:14 +01:00
Donato Capitella 1acda69224 updated benhcmakrs with reference llama 2 model 2025-08-18 22:25:28 +01:00
Donato Capitella b71a37647f Updated benchmakrs, removed old toolboxes and results 2025-08-17 12:32:08 +01:00
Donato Capitella 62e5080102 Updated benchmarks 2025-08-17 08:53:16 +01:00
Donato Capitella a9618d881b - Corrected typo in WMMA (was spelt wrong as waam)
- Included rocm-7rc-rocwmma toolbox
- Included updated results from benchmarks including rocm 7rc with ROMWMMA and hipBLASLt
2025-08-10 13:21:06 +01:00
Donato Capitella 0d6b2dc731 Addin rocm7-rocwaam toolbox 2025-08-09 19:21:52 +01:00
Donato Capitella 3e4a3e7f9e Updated slider 2025-08-09 15:58:29 +01:00
Donato Capitella e49efe221e Changed to light theme and improved parsinf of mdoel paramater number. 2025-08-09 15:43:07 +01:00
Donato Capitella 995ad2cd38 Updated benchmarks 2025-08-09 11:50:27 +01:00
Donato Capitella bc9483b75d Adding new benchmarks 2025-08-09 11:25:44 +01:00
Donato Capitella 0dd1f8d047 moved to docs folder for github pages support 2025-08-09 10:42:33 +01:00
Donato Capitella feb6b069de sorted table rows 2025-08-06 18:45:14 +01:00
Donato Capitella 40b6e6d665 Updated benchmark results with gtp-oss and glm-4.5-air models 2025-08-06 18:43:35 +01:00
Donato Capitella 2c90eac378 Add build-and-push workflow 2025-08-06 08:56:58 +01:00
Donato Capitella 63ceb6ee57 Fixed gguf-vram-estimator.py path 2025-08-03 14:02:03 +01:00
Donato Capitella 645a318257 Removed typo 2025-08-03 14:01:06 +01:00
Donato Capitella 6c66edf0b7 Updates to Dockerfile buil docs 2025-08-03 13:56:16 +01:00
Donato Capitella e7e27e6cf3 Benchmark and container updates 2025-08-03 13:05:52 +01:00