SmolLM: The Tiniest Local AI Model

Hugging Face's SmolLM proves that AI can run on nearly anything. SmolLM2 360M loads in roughly half a gigabyte and needs only 1GB of RAM, the lowest requirement in our entire database. It is also the fastest model we track by speed score. Quality is limited at this size, so think of SmolLM as a specialist: text classification, simple extraction, short completions, and basic chat on devices that cannot fit anything larger. For older iPhones, entry-level Macs, and embedded projects, it is the minimum viable AI.

Hugging Face1 local models
DEVELOPER
Hugging Face
MODELS
1
SIZE RANGE
0.36B–0.36B
RAM RANGE
11 GB
Key Features
Tiny 360M-parameter model from Hugging Face
Loads in about 0.5GB, needs just 1GB of RAM
Highest speed score of any model in our database
Runs on old Macs, iPhones, and embedded or IoT devices
Handles classification, extraction, and short text tasks
One-line setup: ollama run smollm2:360m

All SmolLM Models

ModelSizeQuantVRAMMin RAMBest ForQualityOllama
SmolLM2 360M0.36BQ4_K_M0.5 GB1 GBChat, Embedded
38

Device Compatibility

Which SmolLM models can run on each device class, based on minimum RAM requirements.

ModeliPhoneAirProStudioMini
SmolLM2 360M (0.36B)ExcellentExcellentExcellentExcellentExcellent

RAM Requirements

SmolLM2 360M
0.5 GB · min 1 GB

Frequently Asked Questions

What tasks suit a sub-1B model like SmolLM?
Short, narrow tasks: text classification, keyword and field extraction, autocomplete-style completions, and simple one-turn Q&A. Multi-step reasoning, long conversations, and coding are beyond a 360M model. Step up to a 2-4B model for those.
Which devices can run SmolLM?
Almost all of them. With a 0.5GB load and a 1GB RAM floor, SmolLM2 360M runs on every Apple Silicon Mac, older Intel Macs, iPhones, and small single-board computers used in IoT projects.
Is SmolLM useful or just a demo?
It is genuinely useful for specific jobs: classification, extraction, and basic chat on devices that cannot run anything larger. It also works as a zero-cost test bed before wiring a bigger model into the same pipeline.
What Ollama command runs SmolLM?
Run `ollama run smollm2:360m`. The download is small and the model starts almost instantly, even on machines with very little free memory.

Related Model Families

Getting Started