Fix command syntax for llama-cli usage
This commit is contained in:
@@ -145,7 +145,7 @@ Once inside, the following commands show how to run local LLMs:
|
|||||||
|
|
||||||
* `llama-cli --list-devices`
|
* `llama-cli --list-devices`
|
||||||
*Lists available GPU devices for Llama.cpp.*
|
*Lists available GPU devices for Llama.cpp.*
|
||||||
* `llama-cli --no-mmap -ngl 999 -fa -m <model>`
|
* `llama-cli --no-mmap -ngl 999 -fa 1 -m <model>`
|
||||||
*Runs inference on the specified model, with all layers on GPU and flash attention enabled (replace \*\* with your model path).*
|
*Runs inference on the specified model, with all layers on GPU and flash attention enabled (replace \*\* with your model path).*
|
||||||
|
|
||||||
## 2.3 Downloading GGUF Models from HuggingFace
|
## 2.3 Downloading GGUF Models from HuggingFace
|
||||||
|
|||||||
Reference in New Issue
Block a user