“Gemini热潮”再度席卷全球! 谷歌(GOOGL.US)Deep Think“硬核升级”直指大型科研工程

Core Insights - Google has significantly upgraded its Gemini 3 AI model's Deep Think mode, focusing on addressing complex challenges in modern scientific research and engineering, sparking a new wave of "Gemini AI frenzy" globally [1][7] - The new Deep Think mode is now available to Google AI Ultra subscribers and is the first time Google has provided this functionality through the Gemini API to select researchers, engineers, and large enterprises [1][8] Performance Metrics - The updated Deep Think model achieved a score of 48.4% on Humanity's Last Exam (HLE), setting a new industry standard for contemporary AI models [5][7] - It scored 84.6% on the ARC-AGI-2 reasoning task benchmark, verified by the ARC Prize Foundation, and obtained a 3455 Elo rating on the Codeforces competitive programming platform [5][6] - The model also demonstrated gold medal-level performance in the written sections of the 2025 International Physics and Chemistry Olympiads, achieving 50.5% on the CMT-Benchmark [4][6] Application and Functionality - Deep Think is designed to facilitate practical applications, enabling researchers to interpret complex data and engineers to model intricate physical systems through coding [2][7] - The model's capabilities extend beyond mathematics and programming, now encompassing interdisciplinary research problems requiring a combination of physical intuition, structured chemical inference, mathematical formalization, and coding solutions [4][6] - The upgrade emphasizes structured reasoning and scalable inference capabilities, allowing for iterative exploration of multiple hypothesis spaces and continuous refinement through a "generate-validate-revise" loop [8] Market Positioning - The updated Deep Think model positions Google in direct competition with other AI products like OpenAI's ChatGPT and Anthropic's Claude, marking a shift from abstract reasoning to practical applications in large-scale research and engineering workflows [7][8] - By defining Deep Think as a specialized reasoning mode for scientific, research, and engineering challenges, Google aims to attract attention from developers and institutions through its impressive performance metrics and clear application scenarios [7][8]

“Gemini热潮”再度席卷全球! 谷歌(GOOGL.US)Deep Think“硬核升级”直指大型科研工程 - Reportify