← Back to all models
C

Claude 3.7 Sonnet

by Anthropic
Flagship Paid API Multimodal 🏆 Ranked #8 of 85
88.6
Overall Score
out of 100
About

Anthropic's breakthrough model introducing extended thinking — the ability to reason step-by-step before responding. Claude 3.7 Sonnet achieves best-in-class MATH scores and strong coding, making it Anthropic's strongest release at the Sonnet price point.

Key Metrics
Context Window
200K
tokens
Avg Response
850
milliseconds
Input Cost
$3.0
per million tokens
Output Cost
$15.0
per million tokens
Arena ELO
1355
Chatbot Arena rating
MT-Bench
9.1
out of 10
Benchmark Scores
MMLU
90.7%
HumanEval
93.0%
MATH
96.2%
GPQA
70.0%
MT-Bench
91.0/10
Capability Profile
Strengths & Limitations
Strengths
✓ Extended thinking mode ✓ Best-in-class MATH ✓ Strong coding ✓ 200K context ✓ Safety-focused
Limitations
⚠ Slower in thinking mode ⚠ Higher cost ⚠ Verbose on simple tasks
Ideal Use Cases
Advanced mathematics Complex coding Research Agentic workflows Scientific reasoning
Model Details
Provider Anthropic
Released 2025-02-24
Type Paid API
Multimodal Yes
Tier Flagship
Global rank #8 / 85
Pricing (USD)
Input tokens $3.0/M
Output tokens $15.0/M
Per 1,000 tokens ≈ $0.0030 input / $0.0150 output
All Benchmarks
MMLU 90.7%
HumanEval 93.0%
MATH 96.2%
GPQA 70.0%
MT-Bench 9.1/10
Arena ELO 1355
Compare this model View Rankings

You might also consider