Versions side by side — rows that differ across these models are highlighted.
| gemini-3.1-pro |
| grok-4.3 |
| qwen3.7-max |
| Note | Current flagship; 2M-token context window (largest production context of any model family), strong coding/reasoning/multimodal. GA February 2026. | Current flagship; cost-efficient ($1.25/$2.50 per 1M), 1M context, strong agentic tool-calling and native real-time X/web access. | Current closed-weights flagship; Intelligence Index v4.0 score 56.6 (top Chinese model); lowest reported hallucination rate (22.9%). $2.50/$7.50 per 1M tokens. |
|---|