Versions side by side — rows that differ across these models are highlighted.
| Current flagship; 256K context, optimized for multi-step agentic workflows, complex RAG with tool use, and structured outputs. $2.50/$10.00 per 1M. |
| Current flagship; cost-efficient ($1.25/$2.50 per 1M), 1M context, strong agentic tool-calling and native real-time X/web access. |
| Current closed-weights flagship; Intelligence Index v4.0 score 56.6 (top Chinese model); lowest reported hallucination rate (22.9%). $2.50/$7.50 per 1M tokens. |