Local LLM VRAM Calculator & Discovery Tool

LIVE TOOL: Click here to use the Calculator (gpuforllm.com)

Stop guessing if your GPU can run Llama 4 or DeepSeek V3. This tool calculates the exact VRAM requirements for modern Large Language Models based on GGUF quantization standards.

Features

Engineering-Grade Math: Accurately calculates KV Cache size using GQA (Grouped Query Attention) ratios. No more false OOM errors.
Smart Optimization: If a model doesn't fit, the tool calculates the exact Context Window reduction needed to make it run on your hardware.
Model Discovery: Input your GPU specs and get a list of every compatible model (Green/Yellow/Red status).
Latest Models (Nov 2025): Native support for Llama 4 (Scout/Maverick), DeepSeek V2.5, Qwen 2.5, Mistral Nemo, and Microsoft Phi-4.

How it works

Unlike basic calculators that just multiply parameters by 2, this tool accounts for:

GGUF Overhead: Uses real-world BPW (Bits Per Weight) measurements (e.g., 4.85 bpw for Q4_K_M).
Context Window: Calculates the memory footprint of the context using the model's specific architecture (Layers, Hidden Size, Heads, KV Heads).
System Overhead: Reserves VRAM buffer for OS and display output to prevent crashes.

Usage

This is a client-side tool (HTML/JS). You can run it locally or use the live version at gpuforllm.com.

Contributing

Found a bug? Open an Issue.
New model released? Open an Issue with the config.json details.

(Note: While the code is open source under MIT, the "GPUforLLM" branding, logo, and site content are the property of the creator.)

This project is licensed under the MIT License - see the LICENSE file for details. However, please link back to gpuforllm.com if you use this logic in your own projects.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
LICENSE		LICENSE
README.md		README.md
index.html		index.html
social-preview.png		social-preview.png
what-llms-can-i-run.html		what-llms-can-i-run.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Local LLM VRAM Calculator & Discovery Tool

LIVE TOOL: Click here to use the Calculator (gpuforllm.com)

Features

How it works

Usage

Contributing

About

Uh oh!

Releases

Languages

License

GPUforLLM/llm-vram-calculator

Folders and files

Latest commit

History

Repository files navigation

Local LLM VRAM Calculator & Discovery Tool

LIVE TOOL: Click here to use the Calculator (gpuforllm.com)

Features

How it works

Usage

Contributing

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Languages