谷歌Gemini 3夜袭全球，暴击GPT-5.1，奥特曼罕见祝贺

Core Insights - Google has launched its new flagship AI model, Gemini 3 Pro, which is touted as the "strongest reasoning + multimodal + ambient programming" AI to date, outperforming competitors like OpenAI's GPT-5.1 in benchmark tests [1][3][9] Performance Highlights - Gemini 3 Pro achieved significant improvements over its predecessor, Gemini 2.5 Pro, and outperformed GPT-5.1 in various benchmarks, including: - Humanity's Last Exam (HLE): 45.8% (highest score) without tools [4][5] - GPQA Diamond: 91.9% [4][17] - AIME 2025 (Mathematics): 95.0% [4][18] - Vending-Bench 2: $5,478.16 in net worth [4][18] Multimodal Capabilities - The model excels in multimodal understanding, scoring 81.0% in MMMU-Pro and 87.6% in Video-MMMU, showcasing its ability to process and reason across different types of data [19][22] - Gemini 3 can interpret complex scientific concepts and generate high-fidelity visual code, enhancing its utility in various fields [22][24] Ambient Programming - Gemini 3 Pro has advanced ambient programming capabilities, allowing developers to create interactive applications with simple prompts, significantly improving the development process [14][31] - The model scored 1487 Elo in the WebDev Arena, indicating its strong performance in web development tasks [31][32] Deep Think Mode - The introduction of Gemini 3 Deep Think mode marks a new era in AI, achieving exceptional results in challenging benchmarks, including 41% in HLE and 93.8% in GPQA Diamond [25][28] - This mode enhances the model's ability to tackle complex problems and demonstrates its potential for advanced reasoning [25][28] Developer Integration - Gemini 3 is integrated into various platforms, including Google AI Studio and Google Antigravity, allowing developers to leverage its capabilities for building sophisticated applications [36][42] - The model's training was completed on Google's TPU, reinforcing its competitive edge in the AI landscape [54]