.GGUF file extension
To open .GGUF files on Windows, install or download a GGUF-capable local LLM tool (for example llama.cpp, or another desktop runtime that supports GGUF models).
To open a .gguf file, you typically load it into a GGUF-capable local LLM runtime—for example llama.cpp, LM Studio, Ollama with a compatible model, or another desktop tool that explicitly supports GGUF. Most general-purpose apps (text editors, media players, office suites) cannot open GGUF because it is a binary model format, not a document.
Last updated: June 12, 2026
Open on your device
Choose your operating system for a dedicated step-by-step opening guide.
How to open .GGUF files
Use these platform-specific instructions to open .GGUF files safely.
Windows
- Install or download a GGUF-capable local LLM tool (for example llama.cpp, or another desktop runtime that supports GGUF models).
- Run the tool and provide the path to your .gguf model file (for llama.cpp, this is typically done by passing the model file as a command-line argument).
- If the tool reports an unsupported model or format, verify that the tool version supports GGUF and that the model is actually a GGUF file.
Mac
- Install or download a GGUF-capable local LLM tool (for example llama.cpp, or another desktop runtime that supports GGUF models).
- Start the tool and load the .gguf model file (commonly by specifying the file path in a command or configuration).
- If loading fails, try a newer build of the tool that explicitly supports GGUF.
Linux
- Install or build a GGUF-capable local LLM tool (for example llama.cpp, or another runtime that supports GGUF models).
- Run the program and pass your .gguf file as the model input (commonly via a CLI flag or a positional argument, depending on the tool).
- If you get a format/metadata error, ensure the model file is valid GGUF and that your runtime supports the particular model variant.
iOS
- GGUF files are not typically opened directly on iOS; transfer the .gguf to a desktop system and load it with a GGUF-capable local LLM runtime (llama.cpp is one common option).
Android
- GGUF files are not typically opened directly on Android; transfer the .gguf to a desktop system and load it with a GGUF-capable local LLM runtime (llama.cpp is one common option).
Security notes
- A .gguf file is a binary blob intended to be parsed by ML runtimes; it is not supposed to contain active scripts, but it can still trigger vulnerabilities in the software that parses it if the file is malformed or maliciously crafted.
- Prefer loading GGUF models only in well-maintained, up-to-date GGUF-capable software (for example current llama.cpp builds or other actively maintained local LLM tools) because the main risk is in the parser and surrounding runtime.
- Treat GGUF files as untrusted inputs when obtained from unknown sources; they may be very large and can cause resource exhaustion (RAM/disk) when downloaded or loaded.
If you did not expect this file
This extension is usually plain data, text, or structured content—not a program by itself. The practical risk is social engineering (a scam attachment or misleading filename). For trusted senders you rarely need heavy-handed antivirus wording; use these tools when you want an extra check on unexpected downloads.
Avast offers free and premium antivirus software that protects against viruses, malware, ransomware, and phishing. Scan files before opening them to ensure safety.
NortonNorton 360 delivers comprehensive antivirus protection, VPN, and identity theft monitoring. Scan files for threats before opening to keep your device secure.
We may earn a commission when you use affiliate links. This supports our free file extension guides.
Can't open this file?
These are the most common causes and fixes when .GGUF files fail to open.
Common reasons
- Nothing can “open” the file (it is not a document)
- Tool says the model/format is unsupported
- Model loads but inference is extremely slow or fails due to resources
Fix steps
- Use a GGUF-capable local LLM tool (for example llama.cpp or another compatible runtime) rather than a document viewer or editor.
- Load the model from within the tool (often via a command-line argument pointing to the .gguf path).
OS-specific troubleshooting
What is a .GGUF file?
GGUF is a binary format for storing model tensors together with metadata in a single file, intended for inference with GGML and executors built on GGML. Documentation for llama.cpp describes GGUF as the format used to store and distribute quantized language models, optimized for quick loading/saving.
Background
GGUF is associated with the GGML/llama.cpp ecosystem and is primarily used to package and distribute language models for local inference. In practical terms, a GGUF file is usually “the model” you point an inference tool at when you want to run a compatible LLM on your machine.
A key idea in GGUF is bundling both the tensor data (model weights) and metadata into one binary file, which is meant to improve portability and startup time when loading models for inference. This makes it convenient for distributing quantized variants of large models, which are frequently shared as standalone .gguf files.
Because GGUF is a machine-learning model container rather than an interchange format for training pipelines, typical user workflows revolve around choosing a GGUF-capable local inference tool (llama.cpp is a common open-source option; many desktop apps also load GGUF), choosing a GGUF model file, and then running inference (chat/completions) against it. The Hugging Face documentation also describes GGUF as a model file format used for inference with GGML and highlights its use for quick loading/saving in distribution contexts.
Further reading
Authoritative resources for more details on the .GGUF format.
Common .GGUF issues
Nothing can “open” the file (it is not a document)
GGUF is a binary model container, so double-clicking it in a file manager usually does nothing useful or prompts you to choose an app.
- Use a GGUF-capable local LLM tool (for example llama.cpp or another compatible runtime) rather than a document viewer or editor.
- Load the model from within the tool (often via a command-line argument pointing to the .gguf path).
Tool says the model/format is unsupported
Not every GGML-based program supports every model file format or every GGUF variation; older builds may not recognize GGUF.
- Update to a recent version/build of your GGUF-capable tool that explicitly supports GGUF (for example, a current llama.cpp build).
- Confirm the file extension matches the actual content and that the file is a genuine GGUF model (not another format renamed).
Model loads but inference is extremely slow or fails due to resources
GGUF files can represent large models; even quantized models may require significant RAM/VRAM and compute, depending on model size and your hardware.
- Try a smaller GGUF model variant (fewer parameters) or a more heavily quantized GGUF intended for your hardware.
- Close other applications to free memory and ensure the runtime is configured for your system (for example, CPU-only vs. available acceleration, as supported by your chosen tool).
FAQ
Is GGUF the same thing as a “quantized model”?
GGUF is a file format used to store models (tensors and metadata). In the llama.cpp ecosystem it is commonly used to distribute quantized language models, but GGUF itself is the container format, not a specific quantization method.
Can I open a .gguf file in a text editor?
Not meaningfully. GGUF is a binary format; a text editor will show unreadable bytes. Use a GGUF-capable local LLM tool instead.
What should I use GGUF files for?
They are primarily for running inference with GGUF-capable local LLM tools and runtimes (llama.cpp is a widespread open-source example) and for distributing models in a way that loads quickly and includes metadata.
Does GGUF have an official MIME type?
No official IANA-registered media type for GGUF is indicated by the IANA Media Types registry; if you see an application/x-* style type in the wild, treat it as unofficial.
Similar file extensions
Compare related formats in the same category to find the right tool faster.