How to open .GGUF files on Windows

To open .GGUF files on Windows, install or download a GGUF-capable local LLM tool (for example llama.cpp, or another desktop runtime that supports GGUF models).

Step-by-step instructions

Install or download a GGUF-capable local LLM tool (for example llama.cpp, or another desktop runtime that supports GGUF models).
Run the tool and provide the path to your .gguf model file (for llama.cpp, this is typically done by passing the model file as a command-line argument).
If the tool reports an unsupported model or format, verify that the tool version supports GGUF and that the model is actually a GGUF file.

Common issues

Nothing can “open” the file (it is not a document)

GGUF is a binary model container, so double-clicking it in a file manager usually does nothing useful or prompts you to choose an app.

Use a GGUF-capable local LLM tool (for example llama.cpp or another compatible runtime) rather than a document viewer or editor.
Load the model from within the tool (often via a command-line argument pointing to the .gguf path).

Tool says the model/format is unsupported

Not every GGML-based program supports every model file format or every GGUF variation; older builds may not recognize GGUF.

Update to a recent version/build of your GGUF-capable tool that explicitly supports GGUF (for example, a current llama.cpp build).
Confirm the file extension matches the actual content and that the file is a genuine GGUF model (not another format renamed).

Model loads but inference is extremely slow or fails due to resources

GGUF files can represent large models; even quantized models may require significant RAM/VRAM and compute, depending on model size and your hardware.

Try a smaller GGUF model variant (fewer parameters) or a more heavily quantized GGUF intended for your hardware.
Close other applications to free memory and ensure the runtime is configured for your system (for example, CPU-only vs. available acceleration, as supported by your chosen tool).

Security note

A .gguf file is a binary blob intended to be parsed by ML runtimes; it is not supposed to contain active scripts, but it can still trigger vulnerabilities in the software that parses it if the file is malformed or maliciously crafted.

Back to .GGUF extension page