← Back to all models
L

Llama 3.1 8B

by Meta
Efficient Free & Open Source 🏆 Ranked #75 of 85
60.5
Overall Score
out of 100
About

Meta's lightweight 8B model from the Llama 3.1 family. The most accessible large language model for consumer hardware — runs on a laptop GPU with 8GB VRAM whilst punching well above its weight class.

Key Metrics
Context Window
128K
tokens
Avg Response
400
milliseconds
Input Cost
$0.05
per million tokens
Output Cost
$0.1
per million tokens
Arena ELO
1170
Chatbot Arena rating
MT-Bench
8.2
out of 10
Benchmark Scores
MMLU
73.0%
HumanEval
72.6%
MATH
51.9%
GPQA
32.8%
MT-Bench
82.0/10
Capability Profile
Strengths & Limitations
Strengths
✓ Runs on consumer hardware ✓ Very fast ✓ Open source ✓ Low memory footprint ✓ Fine-tuneable
Limitations
⚠ Less capable than larger models ⚠ Limited complex reasoning ⚠ Smaller knowledge base
Ideal Use Cases
Edge deployment Personal AI assistants Rapid prototyping Chatbots Lightweight fine-tuning
Model Details
Provider Meta
Released 2024-07-23
Type Free & Open Source
Multimodal No
Tier Efficient
Global rank #75 / 85
Pricing (USD)
Input tokens $0.05/M
Output tokens $0.1/M
Per 1,000 tokens ≈ $0.0001 input / $0.0001 output
All Benchmarks
MMLU 73.0%
HumanEval 72.6%
MATH 51.9%
GPQA 32.8%
MT-Bench 8.2/10
Arena ELO 1170
Compare this model View Rankings

You might also consider