llama.cpp

by ggmlvb9994

Last updated Jul 27, 2026

LLM inference in C/C++

Install with winget

$ winget install --id ggml.llamacpp --exact --version b9994

Run in Command Prompt, PowerShell, or Windows Terminal. Prompts for any agreements.

Built by Pckgr

For Intune admins

Stop chasing app updates. Pckgr patches them for you.

Automated application patching for Microsoft Intune. Pckgr keeps a curated library of 1,000+ apps continuously up-to-date in your tenant via Microsoft Graph - no manual repackaging, no chasing vendor sites.

Start free 30-day trial

No credit card required.

Installers · vb9994

Architecture	Type	Scope	Install	Download
x64	ZIP archive	-		Direct

Copy a command tailored to that specific architecture, type, and scope - useful when winget would otherwise pick a different default.

Security

14 known CVEs via NVD

high7.8Patched in wingetCVE-2026-27940affects before b8146Mar 12, 2026
llama.cpp is an inference of several LLM models in C/C++. Prior to b8146, the gguf_init_from_file_impl() in gguf.cpp is vulnerable to an Integer overflow, leading to an undersized heap allocation. Using the subsequent fread() writes 528+ bytes of attacker-controlled data past th...
Vendor advisory
high8.8Patched in wingetCVE-2026-21869Jan 7, 2026
llama.cpp is an inference of several LLM models in C/C++. In commits 55d4206c8 and prior, the n_discard parameter is parsed directly from JSON input in the llama.cpp server's completion endpoints without validation to ensure it's non-negative. When a negative value is supplied a...
Vendor advisory
high8.6Patched in wingetCVE-2025-52566affects before b5721Jun 23, 2025
llama.cpp is an inference of several LLM models in C/C++. Prior to version b5721, there is a signed vs. unsigned integer overflow in llama.cpp's tokenizer implementation (llama_vocab::tokenize) (src/llama-vocab.cpp:3036) resulting in unintended behavior in tokens copying size co...
Patch Vendor advisory
high8.8Patched in wingetCVE-2025-49847affects before b5662Jun 17, 2025
llama.cpp is an inference of several LLM models in C/C++. Prior to version b5662, an attacker‐supplied GGUF model vocabulary can trigger a buffer overflow in llama.cpp’s vocabulary‐loading code. Specifically, the helper _try_copy in llama.cpp/src/vocab.cpp: llama_vocab::impl::to...
Patch Vendor advisory
critical10.0Patched in wingetCVE-2024-42479affects before b3561Aug 12, 2024
llama.cpp provides LLM inference in C/C++. The unsafe `data` pointer member in the `rpc_tensor` structure can cause arbitrary address writing. This vulnerability is fixed in b3561.
Patch Vendor advisory
medium5.3Patched in wingetCVE-2024-42478affects before b3561Aug 12, 2024
llama.cpp provides LLM inference in C/C++. The unsafe `data` pointer member in the `rpc_tensor` structure can cause arbitrary address reading. This vulnerability is fixed in b3561.
Patch Vendor advisory
medium5.3Patched in wingetCVE-2024-42477affects before b3561Aug 12, 2024
llama.cpp provides LLM inference in C/C++. The unsafe `type` member in the `rpc_tensor` structure can cause `global-buffer-overflow`. This vulnerability may lead to memory data leakage. The vulnerability is fixed in b3561.
Patch Vendor advisory
medium5.4Patched in wingetCVE-2024-41130affects before b3427Jul 22, 2024
llama.cpp provides LLM inference in C/C++. Prior to b3427, llama.cpp contains a null pointer dereference in gguf_init_from_file. This vulnerability is fixed in b3427.
Patch

Showing 8 of 14. Source: NVD, updated 6h ago. Patch status is best-effort: NVD's fix version is compared against the latest version in winget, but the two version formats don't always align. Confirm with the vendor advisory before treating any specific build as safe.

See a CVE that affects your fleet? Push the patched version to Intune in one click with Pckgr - automated patching is the only way to keep up.

Related apps

O
Ollama (Portable)Ollama
Ollama.Ollama.Portablev0.20.2
Start building with open models.
LM StudioLM Studio
ElementLabs.LMStudiov0.4.20+1
Discover, download, and run local LLMs
A
AIChatsigoden
sigoden.AIChatv0.30.0
All-in-one LLM CLI tool featuring Shell Assistant, Chat-REPL, RAG, AI tools & agents, with access to OpenAI, Claude, Gemini, Ollama, Groq, and more.
KelivoPsyche
Psyche.Kelivov1.1.17+61
A Flutter LLM Chat Client. Support Android & iOS & Harmony Next.
HyperChatDadigua
BigSweetPotatoStudio.HyperChatv1.8.4
A Chat client that strives for openness, utilizing APIs from various LLMs to achieve the best Chat experience, as well as implementing productivity tools through the MCP protocol.
OllamaOllama
Ollama.Ollamav0.32.5
Get up and running with large language models locally.

Frequently asked questions

How do I install llama.cpp on Windows?

Open Windows Terminal, PowerShell, or Command Prompt and run: winget install --id ggml.llamacpp --exact --version b9994. winget downloads the installer from ggml and runs it. Requires Windows 10 (1809+) or Windows 11.

How do I install llama.cpp silently for unattended deployment?

Add --silent and accept the agreements upfront: winget install --id ggml.llamacpp --exact b9994 --silent --accept-package-agreements --accept-source-agreements. This is the variant Intune, Configuration Manager, and other deployment tools should use.

How do I uninstall llama.cpp via winget?

Run: winget uninstall --id ggml.llamacpp --exact. Add --silent for unattended uninstalls. winget will use the registered uninstaller from llama.cpp's Apps & Features entry.

Is llama.cpp free?

llama.cpp is distributed under MIT. Refer to the publisher (https://github.com/ggml-org/llama.cpp) for the full license terms - Wingetly itself does not charge for installation.

Does llama.cpp work on Windows 10?

Yes, as long as your Windows 10 build supports winget (1809 or newer). winget ships with App Installer on Windows 10/11 and pulls llama.cpp directly from the publisher.

How do I keep llama.cpp up to date?

Run winget upgrade --id ggml.llamacpp --exact, or winget upgrade --all to update everything winget tracks. We index 10 versions of llama.cpp from microsoft/winget-pkgs.

Recent versions

b9994latest
b9982
b9969
b9957
b9947
b9935
b9910
b9893
b9878
b9873