← Back to all models
L

Llama 3.2 1B

by Meta
Efficient Free & Open Source 🏆 Ranked #84 of 85
35.8
Overall Score
out of 100
About

Meta's smallest Llama model, designed for on-device and embedded deployments. Llama 3.2 1B runs entirely on CPU and low-power devices with a 128K context window despite its tiny footprint.

Key Metrics
Context Window
128K
tokens
Avg Response
80
milliseconds
Input Cost
$0.01
per million tokens
Output Cost
$0.01
per million tokens
Arena ELO
1070
Chatbot Arena rating
MT-Bench
6.5
out of 10
Benchmark Scores
MMLU
49.3%
HumanEval
38.0%
MATH
25.0%
GPQA
15.0%
MT-Bench
65.0/10
Capability Profile
Strengths & Limitations
Strengths
✓ Ultra-tiny ✓ Runs on CPU ✓ Open source ✓ Edge deployment ✓ Very fast
Limitations
⚠ Very limited capability ⚠ Not for complex tasks ⚠ Basic knowledge base
Ideal Use Cases
On-device AI IoT Mobile Offline assistants Simple classification
Model Details
Provider Meta
Released 2024-09-25
Type Free & Open Source
Multimodal No
Tier Efficient
Global rank #84 / 85
Pricing (USD)
Input tokens $0.01/M
Output tokens $0.01/M
Per 1,000 tokens ≈ $0.0000 input / $0.0000 output
All Benchmarks
MMLU 49.3%
HumanEval 38.0%
MATH 25.0%
GPQA 15.0%
MT-Bench 6.5/10
Arena ELO 1070
Compare this model View Rankings

You might also consider