From 2d55e329dc12ae18b4bbe5a62391e9715dadda29 Mon Sep 17 00:00:00 2001 From: Donato Capitella Date: Sun, 10 Aug 2025 20:59:14 +0100 Subject: [PATCH] Fixed typo --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 128a355..a53ccc1 100644 --- a/README.md +++ b/README.md @@ -130,7 +130,7 @@ Once inside, the following commands show how to run local LLMs: * `llama-cli --list-devices` *Lists available GPU devices for Llama.cpp.* -* `llama-cli --no-mmap --ngl 999 -fa -m ` +* `llama-cli --no-mmap -ngl 999 -fa -m ` *Runs inference on the specified model, with all layers on GPU and flash attention enabled (replace \*\* with your model path).* ## 2.3 Downloading GGUF Models from HuggingFace