Merge pull request #17 from neuhaus/patch-1

Fix command syntax for llama-cli usage
2025-11-10 19:24:07 +00:00
parent 6d121bc88a 4ec72fa8f4
commit 19c67b6665
1 changed files with 1 additions and 1 deletions
@@ -145,7 +145,7 @@ Once inside, the following commands show how to run local LLMs:

 * `llama-cli --list-devices`
  *Lists available GPU devices for Llama.cpp.*
-* `llama-cli --no-mmap -ngl 999 -fa -m <model>`
+* `llama-cli --no-mmap -ngl 999 -fa 1 -m <model>`
  *Runs inference on the specified model, with all layers on GPU and flash attention enabled (replace \*\* with your model path).*

 ## 2.3 Downloading GGUF Models from HuggingFace