GPT-5.2“发布在即”,微软CEO宣布:周五将揭晓“下一代”Agentic AI模型
Hua Er Jie Jian Wen·2025-12-11 06:07

| Benchmark | Description | GPT-S.2 | Gemini 3 Pro | Gemini 2.5 Pro | Claude Sonnet 4.5 | | --- | --- | --- | --- | --- | --- | | Humanity's Last Exam | Academic reasoning | 67.4% | 37.5% | 21.6% | 13.7% | | ARC-AGI-2 | Visual reasoning puzzies | 62.2% | 31.1% | 4.9% | 13.6% | | GPQA Diamond | Sclentific knowledge | 95.8% | 91.9% | 86.4% | 83.4% | | AIME 2025 (No tools) | Mathematics | 100% | 95.0% | 88.0% | 87.0% | | AIME 2025 (With code) | | 100% | 100% | - | 100% | | MathArena Apex | Chalienging Math Con ...