Donato Capitella
1421e87060
feat: add Plausible analytics script to documentation index page
2026-04-21 09:47:16 +01:00
Donato Capitella
9016c0f8f8
update benchs
2026-04-15 16:54:34 +01:00
Donato Capitella
9707a15df7
feat: add benchmark results for rocm-7_2_1-pr21344 and update results metadata
2026-04-15 11:39:10 +01:00
Donato Capitella
14fae26ad0
add minimax m2.7 benchmarks
2026-04-15 08:09:12 +01:00
Donato Capitella
d74db71362
archvied old multi-node benchmarks
2026-04-11 11:20:30 +01:00
Donato Capitella
7aa6e6dea9
update benchmarks
2026-04-11 11:18:45 +01:00
Donato Capitella
c129a04a1c
refactor: remove hblt0 benchmark support and associated comparison scripts
2026-04-10 11:23:06 +01:00
Donato Capitella
a7ace8dba7
updted benchmarks
2026-03-30 08:37:15 +01:00
esc247
eb03432a50
added router mode section and example models.ini file for use with router mode ( #67 )
2026-03-04 12:09:11 +00:00
Trevor Starick
be936d6b59
feat: add REPO/BRANCH build args for llama.cpp ( #59 )
...
- Introduce ARG REPO and ARG BRANCH to replace the hardcoded git clone with: `git clone -b ${BRANCH} --single-branch --recursive ${REPO}` . This allows overriding the llama.cpp repository and branch at build time via `--build-arg`.
- Update `docs/building.md` to recommend using `--build-arg` instead of updating the file
2026-02-17 19:29:48 +00:00
Donato Capitella
8ff812fbb5
updated benchmarks
2026-02-09 13:30:26 +00:00
Donato Capitella
2d09b9e6db
updated benchmarks
2026-02-05 19:03:13 +00:00
Donato Capitella
d97efb0cb9
updated gpt-oss benchmakrs to test rocm7 performance patch
2026-02-04 17:46:43 +00:00
Donato Capitella
d674531182
added rocm-7.2 benchmarks
2026-01-23 15:11:13 +00:00
Donato Capitella
6d70dfc73b
updated with dual-server benchmarks
2026-01-12 13:19:23 +00:00
Donato Capitella
7268e95b0f
updates
2026-01-12 11:05:31 +00:00
Donato Capitella
f5b9cd84a2
updated instructions to downgrade kernel
2026-01-11 18:26:26 +00:00
Donato Capitella
d6c7456bd0
adding system info to benchmark display
2026-01-11 10:04:05 +00:00
Donato Capitella
eff3a8ede8
updated downgrade instructions
2026-01-10 11:04:10 +00:00
Donato Capitella
783998589e
neclean up of legacy toolboxes, removal of rocwmma and renamed rocm7-alpha to rocm-7nightlies. Added new benchmarks
2026-01-10 10:31:04 +00:00
Donato Capitella
f0e9bc8865
adding warning
2026-01-08 14:47:14 +00:00
Donato Capitella
2c8a1e2eef
updated benchmarks
2025-12-21 18:49:08 +00:00
Donato Capitella
7584a31548
updated Qwen Next Benchmarks
2025-12-05 08:32:58 +00:00
Donato Capitella
7f34f51202
Added Qwen-3-Next benchmarks
2025-11-28 17:50:21 +00:00
Donato Capitella
c7f4ffc346
updated rpc benchmakrs with long context
2025-11-19 07:35:56 +00:00
kevinjohncolo
f631e45674
Create docker-compose-how-to.md ( #28 )
...
* Create docker-compose-how-to.md
Simple how-to on using docker compose instead of toolbox.
* Update docker-compose-how-to.md
2025-11-18 08:43:22 +00:00
Donato Capitella
ccf29e6b22
fixed naming convention
2025-11-17 23:09:04 +00:00
Donato Capitella
1d6d48fae1
updated benchmarks
2025-11-17 23:02:56 +00:00
Donato Capitella
67fb3a002b
Updated benchmarks
2025-11-15 08:36:25 +00:00
Donato Capitella
2f2b1b33af
added Qwen3-Coder-30B-A3B-Instruct_Q4_K_M to the benchmarks
2025-10-20 20:05:56 +01:00
Donato Capitella
765cc5c733
updated benchs
2025-10-12 07:39:42 +01:00
Donato Capitella
ba88675b9c
Updated benchmarkls with ROCm 6.4.4
2025-09-28 09:38:04 +01:00
Donato Capitella
006aaa64e1
Updated benchmarks
2025-09-17 10:41:14 +01:00
Donato Capitella
1acda69224
updated benhcmakrs with reference llama 2 model
2025-08-18 22:25:28 +01:00
Donato Capitella
b71a37647f
Updated benchmakrs, removed old toolboxes and results
2025-08-17 12:32:08 +01:00
Donato Capitella
62e5080102
Updated benchmarks
2025-08-17 08:53:16 +01:00
Donato Capitella
a9618d881b
- Corrected typo in WMMA (was spelt wrong as waam)
...
- Included rocm-7rc-rocwmma toolbox
- Included updated results from benchmarks including rocm 7rc with ROMWMMA and hipBLASLt
2025-08-10 13:21:06 +01:00
Donato Capitella
0d6b2dc731
Addin rocm7-rocwaam toolbox
2025-08-09 19:21:52 +01:00
Donato Capitella
3e4a3e7f9e
Updated slider
2025-08-09 15:58:29 +01:00
Donato Capitella
e49efe221e
Changed to light theme and improved parsinf of mdoel paramater number.
2025-08-09 15:43:07 +01:00
Donato Capitella
995ad2cd38
Updated benchmarks
2025-08-09 11:50:27 +01:00
Donato Capitella
bc9483b75d
Adding new benchmarks
2025-08-09 11:25:44 +01:00
Donato Capitella
0dd1f8d047
moved to docs folder for github pages support
2025-08-09 10:42:33 +01:00
Donato Capitella
feb6b069de
sorted table rows
2025-08-06 18:45:14 +01:00
Donato Capitella
40b6e6d665
Updated benchmark results with gtp-oss and glm-4.5-air models
2025-08-06 18:43:35 +01:00
Donato Capitella
2c90eac378
Add build-and-push workflow
2025-08-06 08:56:58 +01:00
Donato Capitella
63ceb6ee57
Fixed gguf-vram-estimator.py path
2025-08-03 14:02:03 +01:00
Donato Capitella
645a318257
Removed typo
2025-08-03 14:01:06 +01:00
Donato Capitella
6c66edf0b7
Updates to Dockerfile buil docs
2025-08-03 13:56:16 +01:00
Donato Capitella
e7e27e6cf3
Benchmark and container updates
2025-08-03 13:05:52 +01:00