Latest LLM Models
A curated catalog of recent language models for coding, reasoning, and multimodal development workflows
Mistral Medium 3.5
State-of-the-art general-purpose model focused on long-horizon instruction following, reasoning, and coding at a lower cost tier.
GPT-5.5
OpenAI's latest flagship model for agentic work, advanced coding, research, and professional workflows.
Kimi K2.6
Moonshot AI's latest frontier model with strong long-context reasoning, agentic coding, and multimodal capabilities.
Claude Opus 4.7
Anthropic's most capable generally available model for complex reasoning, agentic coding, long-running tasks, and stronger vision.
GPT-5.4 mini
Smaller GPT-5.4 variant for fast, efficient coding and professional tasks with lower cost and latency.
GPT-5.4 nano
The lightest GPT-5.4 variant, designed for high-volume, low-latency tasks while keeping the GPT-5.4 family behavior.
Mistral Small 4
Compact multimodal model built for general chat, coding, agentic tasks, and complex reasoning.
Grok 4.20 Multi-agent
xAI's multi-agent Grok model for collaborative reasoning, tool use, and advanced long-context workflows.
Grok 4.20
Latest mainstream Grok release for chat, reasoning, and tool-assisted workflows in xAI's platform.
GPT-5.4
OpenAI's frontier model for complex professional work, combining strong reasoning, coding, and large-context tool use.
Gemini 3.1 Pro
Google's flagship Gemini model for complex multimodal tasks, creative work, coding, and advanced reasoning.
Claude Sonnet 4.6
Anthropic's balanced model focused on strong reasoning, coding, and speed for day-to-day production workloads.
Kimi K2.5
Earlier Kimi release focused on improved reasoning and multimodal capabilities before the K2.6 update.
Gemini 3 Flash
Google's fast Gemini model optimized for responsive multimodal workloads and agentic coding at scale.
Mistral Large 3
Mistral AI's flagship open-weight multimodal and multilingual model with strong agentic capabilities and a 256k context window.
Grok 4.1 Fast
Faster Grok variant for lower-latency inference, large-context workloads, and enterprise API usage.
Claude Haiku 4.5
Anthropic's fastest Claude 4.5 model, built for high-throughput tasks where latency and cost matter most.
Llama 4 Maverick
Meta's open-weight multimodal model for general AI workloads, coding, and efficient large-scale deployment.
Llama 4 Scout
Meta's long-context open-weight multimodal model optimized for deep analysis across massive documents and codebases.