← Back to all models
G

Gemini 2.0 Flash

by Google DeepMind
Efficient Proprietary Multimodal 🏆 Ranked #16 of 22
75.7
Overall Score
out of 100
About

Google DeepMind's next-generation fast model offering impressive performance at a fraction of the cost. Gemini 2.0 Flash brings multimodal capabilities and a massive context window to real-time applications.

Key Metrics
Context Window
1.0M
tokens
Avg Response
520
milliseconds
Input Cost
$0.1
per million tokens
Output Cost
$0.4
per million tokens
Arena ELO
1252
Chatbot Arena rating
MT-Bench
8.8
out of 10
Benchmark Scores
MMLU
85.0%
HumanEval
87.4%
MATH
73.0%
GPQA
51.0%
MT-Bench
88.0/10
Capability Profile
Strengths & Limitations
Strengths
✓ Extremely fast ✓ Low cost ✓ Large context window ✓ Multimodal ✓ Agentic capabilities
Limitations
⚠ Less depth than Pro model ⚠ Newer with less community testing
Ideal Use Cases
Real-time applications High-volume tasks Chatbots Quick analysis Agentic workflows
Model Details
Provider Google DeepMind
Released 2025-01-21
Open source No
Multimodal Yes
Tier Efficient
Global rank #16 / 22
Pricing (USD)
Input tokens $0.1/M
Output tokens $0.4/M
Per 1,000 tokens ≈ $0.0001 input / $0.0004 output
All Benchmarks
MMLU 85.0%
HumanEval 87.4%
MATH 73.0%
GPQA 51.0%
MT-Bench 8.8/10
Arena ELO 1252
Compare this model View leaderboard

You might also consider