Local LLM Compatibility Dataset
90 models by params, quantization, and memory load — which run locally on Apple Silicon and NVIDIA GPUs. Free under CC BY 4.0. Updated 2026-06-15.
This is ModelFit's open compatibility dataset: every model the site tracks, with the parameter size, quantization, minimum RAM, and estimated memory load used to decide what runs locally. Reuse it freely with attribution (CC BY 4.0) — credit ModelFit (modelfit.io). Machine-readable version: /api/dataset/, or get the CSV + JSON on GitHub.
| Model | Family | Params | Quant | Min RAM | Est. load | Local | ollama |
|---|---|---|---|---|---|---|---|
| Claude Fable 5 | Claude | 0B | API | 0 GB | — | Cloud | — |
| Claude 3.5 Sonnet | Claude | 120B | API | 0 GB | — | Cloud | — |
| Claude 3.7 Sonnet | Claude | 140B | API | 0 GB | — | Cloud | — |
| Claude 4 Sonnet | Claude | 160B | API | 0 GB | — | Cloud | — |
| Claude 3 Opus | Claude | 220B | API | 0 GB | — | Cloud | — |
| Claude 4 Opus | Claude | 240B | API | 0 GB | — | Cloud | — |
| Claude Opus 4.7 | Claude | 260B | API | 0 GB | — | Cloud | — |
| Claude Opus 4.8 | Claude | 270B | API | 0 GB | — | Cloud | — |
| DeepSeek-R1 Distill Qwen 7B | DeepSeek | 7B | Q4_K_M | 10 GB | ~5.5 GB | Yes | deepseek-r1:7b |
| DeepSeek-R1 Distill Qwen 14B | DeepSeek | 14B | Q4_K_M | 22 GB | ~11 GB | Yes | deepseek-r1:14b |
| DeepSeek-R1 Distill Llama 70B | DeepSeek | 70B | Q4_K_M | 48 GB | ~42 GB | Yes | deepseek-r1:70b |
| DeepSeek V4 Flash | DeepSeek | 284B | API | 0 GB | — | Cloud | — |
| DeepSeek-R1 671B | DeepSeek | 671B | Q4_K_M | 400 GB | ~380 GB | Yes | deepseek-r1:671b-q4_K_M |
| DeepSeek-V3 | DeepSeek | 671B | API | 0 GB | — | Cloud | — |
| DeepSeek-V3-0324 | DeepSeek | 671B | API | 0 GB | — | Cloud | — |
| DeepSeek-R1 | DeepSeek | 671B | API | 0 GB | — | Cloud | — |
| DeepSeek V4 Pro | DeepSeek | 1600B | API | 0 GB | — | Cloud | — |
| Gemma 3 1B Instruct | Gemma | 1B | Q4_K_M | 3 GB | ~1 GB | Yes | gemma3:1b |
| Gemma 2 2B Instruct | Gemma | 2B | Q4_K_M | 5 GB | ~1.8 GB | Yes | gemma2:2b-instruct-q4_K_M |
| Gemma 4 E2B | Gemma | 2.3B | Q4_K_M | 4 GB | ~2.3 GB | Yes | gemma4:e2b |
| Gemma 3 4B Instruct | Gemma | 4B | Q4_K_M | 8 GB | ~3.5 GB | Yes | gemma3:4b |
| Gemma 4 E4B | Gemma | 4.5B | Q4_K_M | 8 GB | ~4 GB | Yes | gemma4:e4b |
| Gemma 2 9B Instruct | Gemma | 9B | Q4_K_M | 14 GB | ~7 GB | Yes | gemma2:9b-instruct-q4_K_M |
| Gemma 3 12B Instruct | Gemma | 12B | Q4_K_M | 18 GB | ~9.5 GB | Yes | gemma3:12b |
| Gemma 4 12B | Gemma | 12B | Q4_K_M | 16 GB | ~8 GB | Yes | gemma4:12b |
| Gemma 4 26B-A4B | Gemma | 26B | Q4_K_M | 24 GB | ~16 GB | Yes | gemma4:26b |
| Gemma 2 27B Instruct | Gemma | 27B | Q4_K_M | 28 GB | ~21 GB | Yes | gemma2:27b-instruct-q4_K_M |
| Gemma 3 27B Instruct | Gemma | 27B | Q4_K_M | 28 GB | ~21 GB | Yes | gemma3:27b |
| Gemma 4 31B | Gemma | 31B | Q4_K_M | 32 GB | ~20 GB | Yes | gemma4:31b |
| Gemini 2.5 Flash | 80B | API | 0 GB | — | Cloud | — | |
| Gemini 2.5 Pro | 175B | API | 0 GB | — | Cloud | — | |
| Gemini 3.1 Pro | 300B | API | 0 GB | — | Cloud | — | |
| Granite 4.1 3B Instruct | Granite | 3B | Q4_K_M | 6 GB | ~2 GB | Yes | granite4.1:3b |
| Granite 4.1 8B Instruct | Granite | 8B | Q4_K_M | 12 GB | ~5.5 GB | Yes | granite4.1:8b |
| Kimi K2 Instruct | Kimi | 1000B | API | 0 GB | — | Cloud | — |
| Kimi K2.6 | Kimi | 1000B | API | 0 GB | — | Cloud | — |
| Kimi K2.7-Code | Kimi | 1000B | API | 0 GB | — | Cloud | — |
| LFM2.5 8B-A1B | LFM2 | 8.3B | Q4_K_M | 10 GB | ~5.5 GB | Yes | lfm2.5:8b-a1b-q4_K_M |
| LFM2 24B-A2B Instruct | LFM2 | 24B | Q4_K_M | 16 GB | ~14 GB | Yes | lfm2:24b-a2b |
| Llama 3.2 1B Instruct | Llama | 1B | Q4_K_M | 3 GB | ~1 GB | Yes | llama3.2:1b-instruct-q4_K_M |
| Llama 3.2 3B Instruct | Llama | 3B | Q4_K_M | 6 GB | ~2.5 GB | Yes | llama3.2:3b-instruct-q4_K_M |
| Llama 3.1 8B Instruct | Llama | 8B | Q4_K_M | 12 GB | ~6.5 GB | Yes | llama3.1:8b-instruct-q4_K_M |
| Llama 3.1 8B Instruct (Q5) | Llama | 8B | Q5_K_M | 14 GB | ~8 GB | Yes | llama3.1:8b-instruct-q5_K_M |
| Llama 3.1 70B Instruct | Llama | 70B | Q4_K_M | 48 GB | ~42 GB | Yes | llama3.1:70b-instruct-q4_K_M |
| Llama 3.3 70B Instruct | Llama | 70B | Q4_K_M | 48 GB | ~42 GB | Yes | llama3.3:70b-instruct-q4_K_M |
| Llama 4 Scout | Llama | 109B | Q4_K_M | 80 GB | ~67 GB | Yes | llama4:scout |
| Llama 4 Maverick | Llama | 400B | Q4_K_M | 256 GB | ~245 GB | Yes | llama4:maverick |
| Llama 3.1 405B Instruct | Llama | 405B | Q4_K_M | 256 GB | ~243 GB | Yes | llama3.1:405b-instruct-q4_K_M |
| MiniMax M3 | MiniMax | 428B | API | 0 GB | — | Cloud | — |
| Mistral 7B Instruct | Mistral | 7B | Q4_K_M | 10 GB | ~5.5 GB | Yes | mistral:7b-instruct-q4_K_M |
| Mistral Nemo 12B | Mistral | 12B | Q4_K_M | 18 GB | ~9.5 GB | Yes | mistral-nemo:12b |
| Mistral Small 22B | Mistral | 22B | Q4_K_M | 26 GB | ~17 GB | Yes | mistral-small:22b |
| Mistral Small 3.1 | Mistral | 24B | Q4_K_M | 24 GB | ~15 GB | Yes | mistral-small3.1:24b |
| Mixtral 8x7B Instruct | Mistral | 46.7B | Q4_K_M | 36 GB | ~30 GB | Yes | mixtral:8x7b |
| Mistral Medium 3.5 | Mistral | 128B | API | 0 GB | — | Cloud | — |
| NVIDIA Nemotron 3 Ultra | Nemotron | 550B | API | 0 GB | — | Cloud | — |
| GPT-4o mini | OpenAI | 40B | API | 0 GB | — | Cloud | — |
| GPT-4o | OpenAI | 200B | API | 0 GB | — | Cloud | — |
| GPT-5.5 | OpenAI | 400B | API | 0 GB | — | Cloud | — |
| Phi-3 Mini 3.8B | Phi | 3.8B | Q4_K_M | 7 GB | ~3.2 GB | Yes | phi3:mini |
| Phi-4 Mini 3.8B | Phi | 3.8B | Q4_K_M | 7 GB | ~3.2 GB | Yes | phi4-mini:3.8b |
| Phi-3 Medium 14B | Phi | 14B | Q4_K_M | 20 GB | ~11 GB | Yes | phi3:medium |
| Phi-4 14B | Phi | 14B | Q4_K_M | 22 GB | ~11.5 GB | Yes | phi4:14b-q4_K_M |
| Qwen3.7-Plus | Qwen | 0B | API | 0 GB | — | Cloud | — |
| Qwen2.5 0.5B Instruct | Qwen | 0.5B | Q4_K_M | 2 GB | ~0.8 GB | Yes | qwen2.5:0.5b-instruct-q4_K_M |
| Qwen3.5 0.8B Instruct | Qwen | 0.8B | Q4_K_M | 2 GB | ~0.8 GB | Yes | qwen3.5:0.8b |
| Qwen2.5 1.5B Instruct | Qwen | 1.5B | Q4_K_M | 4 GB | ~1.5 GB | Yes | qwen2.5:1.5b-instruct-q4_K_M |
| Qwen3.5 2B Instruct | Qwen | 2B | Q4_K_M | 4 GB | ~1.8 GB | Yes | qwen3.5:2b |
| Qwen2.5 3B Instruct | Qwen | 3B | Q4_K_M | 6 GB | ~2.5 GB | Yes | qwen2.5:3b-instruct-q4_K_M |
| Qwen3.5 4B Instruct | Qwen | 4B | Q4_K_M | 8 GB | ~3.5 GB | Yes | qwen3.5:4b |
| Qwen2.5 7B Instruct | Qwen | 7B | Q4_K_M | 10 GB | ~5.5 GB | Yes | qwen2.5:7b-instruct-q4_K_M |
| Qwen2.5 Coder 7B | Qwen | 7B | Q4_K_M | 10 GB | ~5.5 GB | Yes | qwen2.5-coder:7b |
| Qwen3 8B | Qwen | 8B | Q4_K_M | 12 GB | ~6.5 GB | Yes | qwen3:8b-q4_K_M |
| Qwen3.5 9B Instruct | Qwen | 9B | Q4_K_M | 14 GB | ~7 GB | Yes | qwen3.5:9b |
| Qwen2.5 14B Instruct | Qwen | 14B | Q4_K_M | 20 GB | ~11 GB | Yes | qwen2.5:14b-instruct-q4_K_M |
| Qwen2.5 Coder 14B | Qwen | 14B | Q4_K_M | 22 GB | ~11 GB | Yes | qwen2.5-coder:14b |
| Qwen3 14B | Qwen | 14B | Q4_K_M | 20 GB | ~11 GB | Yes | qwen3:14b-q4_K_M |
| Qwen3.5 27B Instruct | Qwen | 27B | Q4_K_M | 20 GB | ~16 GB | Yes | qwen3.5:27b |
| Qwen3.6 27B | Qwen | 27B | Q4_K_M | 24 GB | ~18 GB | Yes | qwen3.6:27b |
| Qwen3 30B | Qwen | 30B | Q4_K_M | 28 GB | ~22 GB | Yes | qwen3:30b |
| Qwen3.5 35B-A3B Instruct | Qwen | 35B | Q4_K_M | 24 GB | ~20 GB | Yes | qwen3.5:35b-a3b |
| Qwen3.6 35B-A3B | Qwen | 35B | Q4_K_M | 24 GB | ~22 GB | Yes | qwen3.6:35b-a3b |
| Qwen3.5 122B-A10B Instruct | Qwen | 122B | Q4_K_M | 96 GB | ~72 GB | Yes | qwen3.5:122b-a10b |
| Qwen3 235B A22B | Qwen | 235B | Q4_K_M | 192 GB | ~130 GB | Yes | qwen3:235b-a22b-q4_K_M |
| SmolLM2 360M | SmolLM | 0.36B | Q4_K_M | 1 GB | ~0.5 GB | Yes | smollm2:360m |
| Grok 4.3 | xAI | 314B | API | 0 GB | — | Cloud | — |
| GLM-5.2 | Zhipu | 0B | API | 0 GB | — | Cloud | — |
| GLM-5 | Zhipu | 32B | API | 0 GB | — | Cloud | — |
| GLM-4 Plus | Zhipu | 80B | API | 0 GB | — | Cloud | — |
| GLM-5.1 | Zhipu | 744B | API | 0 GB | — | Cloud | — |
Estimated load = approximate memory at Q4_K_M; estimates, not measured. See the hardware stats for RAM-tier guidance.