All Models
Browse all 22 models grouped by provider.
OpenAI
4 modelsOpenAI's most powerful reasoning model, using extended chain-of-thought to tackle the hardest problems in mathematics, science, and coding. o3 sets new standards on GPQA and competitive maths at the cost of higher latency and price.
OpenAI's coding-focused flagship model with a 1 million token context window and top-tier performance on software engineering tasks. GPT-4.1 was specifically optimised for instruction following and agentic coding workflows.
OpenAI's flagship multimodal model combining text, vision, and audio capabilities. GPT-4o delivers state-of-the-art performance across reasoning, coding, and creative tasks whilst offering faster response times than its predecessors.
OpenAI's lightweight, cost-efficient model that punches well above its weight class. GPT-4o mini makes advanced AI capabilities accessible for high-volume, cost-sensitive applications without sacrificing too much quality.
Anthropic
6 modelsAnthropic's most powerful and intelligent model, built for the most demanding tasks where quality outweighs cost. Claude Opus 4 leads on complex multi-step reasoning, graduate-level science, and nuanced long-form writing.
The latest and most capable Sonnet model to date. Claude Sonnet 4.6 brings further gains in mathematical reasoning and instruction following, making it Anthropic's most well-rounded model at the Sonnet price point.
A refined iteration of Claude Sonnet 4 with improved performance on graduate-level reasoning and coding benchmarks. Claude Sonnet 4.5 delivers notably stronger results on GPQA and competitive maths whilst maintaining the same pricing as its predecessor.
Anthropic's fourth-generation Sonnet model, offering a significant leap in reasoning depth and coding accuracy over the 3.x series. Claude Sonnet 4 introduces refined tool use and improved adherence to complex multi-step instructions.
Anthropic's most intelligent model, excelling at complex reasoning and coding tasks. Claude 3.5 Sonnet sets new benchmarks for intelligence whilst maintaining the safety and harmlessness Anthropic is known for.
Anthropic's fastest and most compact model, designed for near-instant responsiveness in demanding applications. Claude 3 Haiku delivers excellent value for tasks requiring speed at scale whilst maintaining Anthropic's commitment to safety.
DeepSeek
2 modelsDeepSeek's open-source reasoning model trained with reinforcement learning to rival OpenAI's o1. DeepSeek R1 achieves exceptional scores on mathematics and scientific reasoning benchmarks, making advanced chain-of-thought reasoning accessible to everyone.
DeepSeek's breakthrough open-source model that shocked the AI industry with frontier-level performance at a fraction of the training cost. DeepSeek V3 demonstrates that cutting-edge AI is no longer exclusive to the largest technology companies.
Google DeepMind
3 modelsGoogle DeepMind's most advanced model, with standout performance in mathematics, science, and long-context reasoning. Gemini 2.5 Pro features a 1 million token context window and an experimental 2 million token mode, alongside strong multimodal capabilities.
Google DeepMind's next-generation fast model offering impressive performance at a fraction of the cost. Gemini 2.0 Flash brings multimodal capabilities and a massive context window to real-time applications.
Google DeepMind's highly capable multimodal model with a groundbreaking 1 million token context window. Gemini 1.5 Pro excels at long-document analysis, video understanding, and complex cross-modal tasks.
Meta
3 modelsMeta's flagship fourth-generation model using a Mixture-of-Experts architecture for efficient high-quality inference. Llama 4 Maverick delivers frontier-class performance as a fully open-source model with a 1 million token context window.
Meta's largest open-source model, competing directly with proprietary frontier models. Llama 3.1 405B can be self-hosted and fine-tuned, offering unmatched flexibility for organisations with data privacy requirements.
Meta's efficient Llama 4 model optimised for speed and cost. Despite being the lighter of the two Llama 4 releases, Scout achieves strong benchmark results and features an extraordinary 10 million token context window — the largest of any model.