PromptsLabPromptsLab
Coming Soon

Prompt Engineering Course

Get Early AccessLearn More

Models & APIs

Frontier models, reasoning models, and API providers.

Frontier Models (2025–2026)

ModelContextPriceKey Feature
GPT-5.2400K400KGeneral intelligence, 100% AIME 2025
Claude Opus 4.61M (beta)1M (beta)Coding, agentic tasks, extended thinking
Gemini 3 Pro1M1M#1 LMArena (~1500 Elo), multimodal
Grok 4.12M2M#2 LMArena (1483 Elo), low hallucination
Mistral Large 3256K256KBest open-weight (675B MoE/41B active), Apache 2.0
DeepSeek-V3.2128K128KBest value (671B MoE/37B active), MIT license
Llama 4 Maverick1M1MBeats GPT-4o (400B MoE/17B active), open-weight

Reasoning Models

ModelKey Feature
OpenAI o3 / o3-pro87.7% GPQA Diamond. Native tool use.
OpenAI o4-miniBest AIME at its cost class with visual reasoning.
DeepSeek-R1 / R1-0528Open-weight, RL-trained. 87.5% on AIME 2025. MIT license.
QwQ (Qwen with Questions)32B reasoning model. Apache 2.0. Comparable to R1.
Gemini 2.5 Pro/Flash (Thinking)Built-in reasoning with configurable thinking budget.
Claude Extended ThinkingHybrid mode with visible chain-of-thought and tool use.
Phi-4 Reasoning / Plus14B reasoning models rivaling much larger models. Open-weight.
GPT-OSS-120BOpenAI's open-weight with CoT. Near-parity with o4-mini. Apache 2.0.

Notable Open-Source Models

ModelKey Feature
Qwen3-235B-A22BAlibaba
Gemma 3Google
OLMo 2/3Allen AI
SmolLM3-3BHugging Face
Kimi K2Moonshot AI
Llama 4 ScoutMeta

Code-Specialized Models

ModelKey Feature
Qwen3-Coder (480B-A35B)69.6% SWE-bench — milestone for open-source coding. 256K context. Apache 2.0.
Devstral 2 (123B)72.2% SWE-bench Verified. 7x more cost-efficient than Claude Sonnet.
Codestral 25.01Mistral's code model. 80+ languages. Fill-in-the-Middle support.
DeepSeek-Coder-V2236B MoE / 21B active. 338 programming languages.
Qwen 2.5-Coder7B/32B. 92 programming languages. 88.4% HumanEval. Apache 2.0.

Foundational Models (Historical Reference)

ModelContextSignificance
GLM-130BTsinghuaOpen bilingual English/Chinese LLM (2023)
Falcon 180BTIILarge open generative model (2023)
Mixtral 8x7BMistral AIPioneered MoE architecture for open models (2023)
GPT-NeoX-20BEleutherAIEarly open autoregressive LLM
GPT-J-6BEleutherAIEarly open causal language model