amd-strix-halo-toolboxes/docs/docker-compose-how-to.md at 614b00af3edb0b0ec92a3c351a8afc31c5a45465

Files

T

kevinjohncolo f631e45674 Create docker-compose-how-to.md (#28 )

* Create docker-compose-how-to.md

Simple how-to on using docker compose instead of toolbox.

* Update docker-compose-how-to.md

2025-11-18 08:43:22 +00:00

2.7 KiB

Raw Blame History

How to use docker-compose instead of toolbox

Vulkan AMDVLK
ROCm-6.4.4+ROCWMMA

1. Vulkan(AMDVLK)

Select applicable backend Dockerfile from repo. Example:
https://github.com/kyuz0/amd-strix-halo-toolboxes/blob/main/toolboxes/Dockerfile.vulkan-amdvlk
In the build file, change shell command to:

# shell
CMD ["/bin/bash", "-c", "llama-server --host $HOST --port $PORT -c $CONTEXT_LENGTH --temp $TEMPERATURE --jinja --no-mmap -ngl $NGL -fa $FA -m $MODEL_PATH"]

Build container with:

docker build -f Dockerfile.vulkan-amdvlk -t vulkan-amdvlk:1.0 .

Download your model files to a directory. We will mount this from the container. I use:

/mnt/models

Create your docker compose, using this template. Change the ports and paths as needed.

services:
  gpt-oss-120b:
    container_name: gpt-oss-120b
    image: vulkan-amdvlk:1.0
    ports:
      - "8069:8069"
    volumes:
      - /mnt/models:/mnt/models
    devices:
      - "/dev/dri:/dev/dri"
    privileged: true
    restart: unless-stopped
    environment:
      - HOST=0.0.0.0
      - PORT=8069
      - CONTEXT_LENGTH=120000
      - TEMPERATURE=0.0
      - MODEL_PATH=/mnt/models/gpt-oss-120b-UD-Q4_K_XL/gpt-oss-120b-UD-Q4_K_XL-00001-of-00002.gguf
      - NGL=999
      - FA=on

Start as usual.

docker compose up -d

2. ROCm-6.4.4-ROCWMMA

Select applicable backend Dockerfile from repo. Example:
https://github.com/kyuz0/amd-strix-halo-toolboxes/blob/main/toolboxes/Dockerfile.rocm-6.4.4-rocwmma
In the build file, change shell command to:

# shell
CMD ["/bin/bash", "-c", "llama-server --host $HOST --port $PORT -c $CONTEXT_LENGTH --temp $TEMPERATURE --jinja --no-mmap -ngl $NGL -fa $FA -m $MODEL_PATH"]

Build container with:

docker build -f Dockerfile.rocm-6.4.4-rocwmma -t rocm-6.4.4-rocwmma:1.0 .

Download your model files to a directory. We will mount this from the container. I use:

/mnt/models

Create your docker compose, using this template. Change the ports and paths as needed.

services:
  gpt-oss-120b:
    container_name: gpt-oss-120b
    image: rocm-6.4.4-rocwmma:1.0
    ports:
      - "8069:8069"
    volumes:
      - /mnt/models:/mnt/models
    devices:
      - "/dev/dri:/dev/dri"
      - "/dev/kfd:/dev/kfd"
    privileged: true
    restart: unless-stopped
    environment:
      - HOST=0.0.0.0
      - PORT=8069
      - CONTEXT_LENGTH=120000
      - TEMPERATURE=0.0
      - MODEL_PATH=/mnt/models/gpt-oss-120b-UD-Q4_K_XL/gpt-oss-120b-UD-Q4_K_XL-00001-of-00002.gguf
      - NGL=999
      - FA=on

Start as usual.

docker compose up -d

2.7 KiB Raw Blame History

How to use docker-compose instead of toolbox

Table of Contents

1. Vulkan(AMDVLK)

2. ROCm-6.4.4-ROCWMMA

2.7 KiB

Raw Blame History