← Back to all models
G

Grok 4.1 Fast

by xAI
Efficient Paid API Multimodal 🏆 Ranked #37 of 85
76.9
Overall Score
out of 100
About

xAI's fast vision-language model with a 2M token context window, combining visual reasoning with near real-time response speeds. Built for high-throughput production workloads.

Key Metrics
Context Window
2.0M
tokens
Avg Response
500
milliseconds
Input Cost
$3.0
per million tokens
Output Cost
$15.0
per million tokens
Arena ELO
1270
Chatbot Arena rating
MT-Bench
8.8
out of 10
Benchmark Scores
MMLU
84.0%
HumanEval
84.0%
MATH
78.0%
GPQA
55.0%
MT-Bench
88.0/10
Capability Profile
Strengths & Limitations
Strengths
✓ 2M context window ✓ Vision capable ✓ Fast response ✓ Real-time web access ✓ High throughput
Limitations
⚠ Less depth than full Grok ⚠ Newer with limited public benchmarks
Ideal Use Cases
Real-time analysis Vision tasks High-volume chatbots Web search integration Production workloads
Model Details
Provider xAI
Released 2025-05-01
Type Paid API
Multimodal Yes
Tier Efficient
Global rank #37 / 85
Pricing (USD)
Input tokens $3.0/M
Output tokens $15.0/M
Per 1,000 tokens ≈ $0.0030 input / $0.0150 output
All Benchmarks
MMLU 84.0%
HumanEval 84.0%
MATH 78.0%
GPQA 55.0%
MT-Bench 8.8/10
Arena ELO 1270
Compare this model View Rankings

You might also consider