From 2d55e329dc12ae18b4bbe5a62391e9715dadda29 Mon Sep 17 00:00:00 2001
From: Donato Capitella <donato.capitella@withsecure.com>
Date: Sun, 10 Aug 2025 20:59:14 +0100
Subject: [PATCH] Fixed typo

---
 README.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/README.md b/README.md
index 128a355..a53ccc1 100644
--- a/README.md
+++ b/README.md
@@ -130,7 +130,7 @@ Once inside, the following commands show how to run local LLMs:
 
 * `llama-cli --list-devices`
   *Lists available GPU devices for Llama.cpp.*
-* `llama-cli --no-mmap --ngl 999 -fa -m <model>`
+* `llama-cli --no-mmap -ngl 999 -fa -m <model>`
   *Runs inference on the specified model, with all layers on GPU and flash attention enabled (replace \*\* with your model path).*
 
 ## 2.3 Downloading GGUF Models from HuggingFace