← Back to all models
D

DeepSeek R1 Distill 70B

by DeepSeek
Flagship Free & Open Source 🏆 Ranked #28 of 85
78.5
Overall Score
out of 100
About

A Llama-3.3-70B model distilled from the full DeepSeek R1 reasoning model, inheriting chain-of-thought reasoning capabilities at a fraction of the compute cost. One of the strongest open-source reasoning models available.

Key Metrics
Context Window
128K
tokens
Avg Response
1800
milliseconds
Input Cost
$0.59
per million tokens
Output Cost
$0.99
per million tokens
Arena ELO
1250
Chatbot Arena rating
MT-Bench
8.9
out of 10
Benchmark Scores
MMLU
84.0%
HumanEval
86.0%
MATH
90.0%
GPQA
55.0%
MT-Bench
89.0/10
Capability Profile
Strengths & Limitations
Strengths
✓ Strong reasoning ✓ Open source ✓ Distilled from R1 ✓ Maths excellence ✓ Chain-of-thought
Limitations
⚠ Slower due to reasoning ⚠ Requires 40GB+ VRAM ⚠ Verbose outputs
Ideal Use Cases
Mathematical reasoning Scientific problems Research Complex analysis Step-by-step solutions
Model Details
Provider DeepSeek
Released 2025-01-20
Type Free & Open Source
Multimodal No
Tier Flagship
Global rank #28 / 85
Pricing (USD)
Input tokens $0.59/M
Output tokens $0.99/M
Per 1,000 tokens ≈ $0.0006 input / $0.0010 output
All Benchmarks
MMLU 84.0%
HumanEval 86.0%
MATH 90.0%
GPQA 55.0%
MT-Bench 8.9/10
Arena ELO 1250
Compare this model View Rankings

You might also consider