← Back to all models
G

Gemma 3 1B

by Google DeepMind
Efficient Free & Open Source 🏆 Ranked #85 of 85
35.5
Overall Score
out of 100
About

Google DeepMind's smallest Gemma 3 model at 1B parameters, designed for on-device inference with a 32K context window. Suitable for edge applications where memory is the primary constraint.

Key Metrics
Context Window
32K
tokens
Avg Response
90
milliseconds
Input Cost
$0.01
per million tokens
Output Cost
$0.01
per million tokens
Arena ELO
1050
Chatbot Arena rating
MT-Bench
6.2
out of 10
Benchmark Scores
MMLU
44.0%
HumanEval
40.0%
MATH
32.0%
GPQA
18.0%
MT-Bench
62.0/10
Capability Profile
Strengths & Limitations
Strengths
✓ Ultra-compact ✓ Open source ✓ On-device inference ✓ Low memory ✓ Fast
Limitations
⚠ Very limited capability ⚠ Not for complex tasks ⚠ Below larger Gemma models
Ideal Use Cases
On-device AI Embedded systems Mobile apps Simple assistants Edge deployment
Model Details
Provider Google DeepMind
Released 2025-03-12
Type Free & Open Source
Multimodal No
Tier Efficient
Global rank #85 / 85
Pricing (USD)
Input tokens $0.01/M
Output tokens $0.01/M
Per 1,000 tokens ≈ $0.0000 input / $0.0000 output
All Benchmarks
MMLU 44.0%
HumanEval 40.0%
MATH 32.0%
GPQA 18.0%
MT-Bench 6.2/10
Arena ELO 1050
Compare this model View Rankings

You might also consider