Fix command syntax for llama-cli usage

This commit is contained in:
S. Neuhaus
2025-10-30 18:11:28 +01:00
committed by GitHub
parent 2f2b1b33af
commit 4ec72fa8f4
+1 -1
View File
@@ -145,7 +145,7 @@ Once inside, the following commands show how to run local LLMs:
* `llama-cli --list-devices` * `llama-cli --list-devices`
*Lists available GPU devices for Llama.cpp.* *Lists available GPU devices for Llama.cpp.*
* `llama-cli --no-mmap -ngl 999 -fa -m <model>` * `llama-cli --no-mmap -ngl 999 -fa 1 -m <model>`
*Runs inference on the specified model, with all layers on GPU and flash attention enabled (replace \*\* with your model path).* *Runs inference on the specified model, with all layers on GPU and flash attention enabled (replace \*\* with your model path).*
## 2.3 Downloading GGUF Models from HuggingFace ## 2.3 Downloading GGUF Models from HuggingFace