谷歌全线开挂！Gemini 3 Deep Think夺多项推理SOTA，Gemini亚洲新团队也官宣了

Core Insights - Gemini 3's Deep Think mode has officially launched, enhancing reasoning capabilities to tackle complex, multi-step, and innovative problems, including difficult scientific and mathematical questions [2] Group 1: Performance Metrics - In the ARC-AGI benchmark, which tests core capabilities of general intelligence, Gemini 3 Deep Think ranked first with an accuracy of 87.5%, outperforming models like GPT-5 and Claude Opus 4.5 [4] - In the ARC-AGI-2 test, which involves higher-order reasoning tasks, Gemini 3 Deep Think achieved a 45.1% accuracy, 14% higher than the non-Deep Think version of Gemini 3 Pro, which scored 31.1% [6] - Gemini 3 Deep Think also excelled in the HLE and GPQA Diamond tests, indicating significant improvements in abstract reasoning and scientific knowledge inference [8] Group 2: User Feedback and Reception - Users have praised the Deep Think mode for its performance, noting that it successfully solved complex issues that other models struggled with, such as a stack underflow bug [14] - The mode's creative scene reasoning capabilities have been highlighted as unprecedented, receiving high praise from users [16] - However, some users expressed concerns about the practical effectiveness of Gemini 3 and called for optimization of AGI-related features [17] Group 3: Team and Development - Google DeepMind announced the establishment of a new Gemini research team in Singapore, led by Yi Tay, focusing on advanced reasoning and improvements to Gemini models [21] - The team aims to recruit top global talent and collaborate with notable figures in the AI field, enhancing the capabilities of Gemini and its Deep Think mode [27] - The Gemini team was formed during Google's AI restructuring, merging Google Brain and DeepMind to create a comprehensive team for developing competitive foundational models [30] Group 4: New Product Launch - Google recently launched Google Workspace Studio, integrating AI capabilities to automate various office tasks, enhancing productivity for users [31][32] - This new product leverages the advanced reasoning and multi-modal understanding of Gemini 3, allowing users to create AI agents for complex tasks without coding [32]