Best Local LLMs for Goose
Block's open-source autonomous developer agent (MCP-driven)
Goose (by Block) is an open-source autonomous developer agent (CLI and desktop) that executes, edits, and tests code using any LLM. It runs fully local via Ollama and is built around MCP extensions, so it leans almost entirely on the model's tool-calling ability. Block states it plainly: models without tool calling can only do chat completion, and every extension must be disabled. Choose accordingly.

Best pick
Qwen2.5 Coder 14B
Qwen2.5 is Goose's explicitly recommended local family; 14B is large enough to call tools consistently.
What Goose needs
Reliable native JSON tool/function calling plus enough context (8K+) — without it, Goose can only chat, because every extension and action is a tool call.
Goose Local LLM Tier List
Qwen2.5 is Goose's explicitly recommended local family; 14B is large enough to call tools consistently.
Large Qwen MoE — strong native tool calling with headroom for many MCP tools.
Recommended Llama family; 70B is the most reliable local tier for multi-tool agent loops.
Large Qwen3 MoE with native tool calling; a popular Goose-local choice.
Dense large Qwen; reliable tool calls.
Recommended Qwen coder; works but smaller, so slightly less consistent on the 11-tool default set.
Goose-recommended family with native function calling; solid mid-size agent model.
Tool calling present, but ~9B struggles to stay consistent across the full extension set.
Usable for few-tool setups; flaky once many tools are enabled.
Recommended family but small — reported tool-calling failures on M-series in community threads.
Flagged as lacking native tool calling in Goose discussions.
Gemma family reported with no native tool calling for Goose; chat only.
Native R1 does not support tool calling; the 14B distill is too small to compensate.
Tiers weigh tool-calling reliability, context window, and coding quality for Goose specifically — a model can rank higher for one tool than another. RAM figures are for Q4 quantization. Sources are listed below.
Local setup notes
Configure Goose with the Ollama provider. Raise OLLAMA_CONTEXT_LENGTH to 8K+ so the model can hold Goose's MCP tool definitions (the 4K default is too small and silently drops extensions and .goosehints). Stick to native-tool-calling families.
Goose official site ↗The weekly local-AI refresh
New open-weight models, real Apple Silicon benchmarks, and the one model worth running on your Mac this week. Free, one email a week, unsubscribe anytime.
Frequently Asked Questions
What is the best local model for Goose?+
Why does Goose ignore my extensions or .goosehints with local models?+
Can I use Gemma, Phi-4, or DeepSeek-R1 with Goose?+
Sources
Other AI Coding Tools
Claude Code
CloudAnthropic's terminal coding agent — pointed at a local model
OpenCode
Open-source terminal coding agent (Ollama / OpenAI-compatible)
OpenClaw
Self-hosted agentic assistant with first-class Ollama support
Aider
Terminal AI pair-programmer with its own edit-format leaderboard
Cline
Autonomous coding agent for VS Code (formerly Claude Dev)
Roo Code
Autonomous VS Code agent with modes (a Cline fork)
Continue.dev
Open-source chat, edit & autocomplete for VS Code / JetBrains
Open Claude Code
Open-source Claude Code CLI reimplementation, run on local models
Codex
CloudOpenAI's coding agent — ranked across the GPT-5 model lineup
Cursor
CloudThe AI code editor — a curated cross-vendor frontier lineup