o3
OpenAI's most powerful reasoning model, using extended chain-of-thought to tackle the hardest problems in mathematics, science, and coding. o3 sets new standards on GPQA and competitive maths at the cost of higher latency and price.
You might also consider
Anthropic's most powerful and intelligent model, built for the most demanding tasks where quality outweighs cost. Claude Opus 4 leads on complex multi-step reasoning, graduate-level science, and nuanced long-form writing.
The latest and most capable Sonnet model to date. Claude Sonnet 4.6 brings further gains in mathematical reasoning and instruction following, making it Anthropic's most well-rounded model at the Sonnet price point.
DeepSeek's open-source reasoning model trained with reinforcement learning to rival OpenAI's o1. DeepSeek R1 achieves exceptional scores on mathematics and scientific reasoning benchmarks, making advanced chain-of-thought reasoning accessible to everyone.