Donato Capitella 864a02f526 Tidy up README
2025-07-28 21:05:09 +01:00
2025-07-28 19:44:18 +01:00
2025-07-28 21:00:49 +01:00
2025-07-28 21:05:09 +01:00

amd-strix-halo-toolboxes

This repository provides Fedora Rawhidebased containers for dev work on AMD Strix Halo GPUs (gfx1151):

  • Dockerfile.rocm — builds llama.cpp with ROCm (HIP) support
  • Dockerfile.vulkan — builds llama.cpp with Vulkan compute support

Both containers have up-to-date ROCm/Vulkan libs from Fedora Rawhide.

Prerequisites

Pull and Run Pre-built Containers

Pull pre-built images:

podman pull docker.io/kyuz0/amd-strix-halo-toolboxes:rocm
podman pull docker.io/kyuz0/amd-strix-halo-toolboxes:vulkan

Create toolboxes:

toolbox create llama-rocm \
  --image docker.io/kyuz0/amd-strix-halo-toolboxes:rocm \
  -- \
    --device /dev/kfd \
    --device /dev/dri \
    --group-add video \
    --security-opt seccomp=unconfined

toolbox create llama-vulkan \
  --image docker.io/kyuz0/amd-strix-halo-toolboxes:vulkan \
  -- \
    --device /dev/dri \
    --group-add video \
    --security-opt seccomp=unconfined

The -- passes remaining flags to Podman/Docker for GPU access.

Enter and test:

toolbox enter llama-rocm
llama-cli --list-devices

toolbox enter llama-vulkan
vulkaninfo | head -n 10
llama-cli --help

(Optional) Building the Images

podman build -t llama-rocm -f Dockerfile.rocm .
podman build -t llama-vulkan -f Dockerfile.vulkan .

Host Configuration

  • Machine: HP Z2 Mini G1a
  • Memory: 128 GB RAM (512 MB GPU in BIOS)
  • Host OS: Fedora 42, kernel 6.15.6-200.fc42.x86_64
  • Kernel boot parameters:
    amd_iommu=off amdgpu.gttsize=131072 ttm.pages_limit=335544321
    
    • amd_iommu=off disables IOMMU for lower latency.
    • amdgpu.gttsize=131072 enables unified GPU/system memory (up to 128 GB).
    • ttm.pages_limit=335544321 allows large pinned allocations.
  • Apply with:
    sudo grub2-mkconfig -o /boot/grub2/grub.cfg
    

Both containers use Fedora Rawhide packages for up-to-date ROCm and Vulkan support.

S
Description
No description provided
Readme 2 MiB
Languages
Python 72.4%
Shell 27.1%
C 0.5%