Latest LLM
Explore the latest Large Language Models available for AI-powered development
GPT-5.2-Codex
Agentic Coding, Large-scale Refactoring, Cybersecurity Defense, Codex CLI/IDE integration, Long-horizon tasks.
GPT-5.2
Unified Reasoning, 400k Context, Agentic Workflows, Native Multimodality, High-Reliability.
Mistral Large 3
A state-of-the-art, open-weight, general-purpose model.
Ministral 3 14B
A powerful model offering best-in-class text generation capabilities.
Ministral 3 8B
A powerful and efficient model offering best-in-class performance for its size.
Ministral 3 3B
A tiny and efficient model offering best-in-class performance on edge devices.
Claude Opus 4.5
Anthropic's most powerful model (4.5 series), excelling in complex tasks, reasoning, and creativity.
Gemini 3 Pro
New generation of Google Gemini, natively multimodal with increased reasoning capabilities.
GPT 5.1
Iterative improvement of GPT-5, offering better reliability and more nuanced responses.
Kimi K2 Thinking
Advanced Moonshot AI model with reinforced 'Thinking' capabilities.
Claude Sonnet 4.5
Performance/cost balance of the 4.5 series, ideal for enterprise and scaling.
Magistral Medium 1.2
Our frontier-class multimodal reasoning model.
Claude Haiku 4.5
Fast, efficient Claude 4.5 series model optimized for low latency and cost.
Mistral Medium 3.1
Our frontier-class multimodal model released for general availability.
GPT-5
OpenAI's next major leap, promising general intelligence closer to human level.
Claude Opus 4.1
Intermediate update to the Claude 4 Opus series.
Codestral
Our cutting-edge language model for coding tasks.
Voxtral Mini
A mini version of our first audio input model.
Voxtral Small
Our first model with audio input capabilities for general use.
GPT oss 20b
Performant mid-sized open-source model (20B), optimized for self-hosting.
GPT oss 120b
Large open-source model (120B) rivaling proprietary models in reasoning.
Devstral Medium 1.0
An enterprise grade text model, that excels at development tasks.
Devstral Small 1.1
An update to our open source model that specializes in development.
Grok 4
Fourth iteration of xAI, deeply integrated with real-time data.
Voxtral Mini Transcribe
An efficient audio input model, fine-tuned for transcription tasks.
Mistral Small 3.2
An update to our previous small model, optimized for efficiency.
Claude 4 Sonnet
The 'workhorse' of series 4, efficient and versatile.
Codestral Embed
Our state-of-the-art semantic model for extracting embeddings from code.
Claude 4 Opus
Flagship model of generation 4, pushing boundaries of context and understanding.
Gemini 2.5 Flash
Ultra-fast and economical model from Google, ideal for high-frequency applications.
Llama 4 Maverick
Flagship, Native Multimodal, MoE (400B), Advanced Reasoning, SOTA, High-Performance.
Mistral Medium 3
Our frontier-class multimodal model released for general availability.
Mistral OCR
Our OCR service powering our Document AI capabilities.
OpenAI o4-mini
Small, performant reasoning model, economical successor.
GPT-4.5 nano
Very compact model for embedded or mobile applications.
OpenAI GPT-4.1
Advanced generalist model, text and code, successor to GPT-4/Turbo.
Llama 4 Scout
Efficiency, Speed, Optimized MoE, Low-Latency, High-Reasoning/Low-Cost, Agile.
GPT-4.1 mini
Miniaturized and optimized version of the GPT-4.1 branch.
Grok-2
Second generation xAI Grok model, oriented towards conversation, code, and real-time web access via X (Twitter).
Mistral Moderation
Our moderation service that enables our safety capabilities.
Claude 3.5 Haiku
Fast and light version of Claude 3.5, optimized for latency and cost.
GitHub Copilot (chat & IDE)
Development assistant based on OpenAI models (including GPT-4o), optimized by GitHub for code, reviews, and context generation.
OpenAI o3
Reasoning model optimized for complex tasks, logic, and problem solving (advanced reasoning).
Perplexity pplx-llama-3.1-sonar-large-online
Large Perplexity model based on Llama 3.1, specialized in online search and cited answers.
Perplexity pplx-llama-3.1-sonar-small-online
Lighter and faster version of Perplexity sonar-online, adapted for real-time usage.
Claude 3.5 Sonnet
Intermediate Claude 3.5 model, great speed/quality balance, strong in reasoning and code.
OpenAI GPT-4o
Real-time multimodal model (text, image, audio) optimized for interactivity and cost.