谷歌(GOOGL.US)突然发布Gemini 3.1 Pro：核心推理性能直接翻倍

Core Insights - Google has launched its latest foundational model, Gemini 3.1 Pro, which significantly enhances its AI capabilities compared to the previous version, Gemini 3 Pro [1][2]. Performance Metrics - The new model, Gemini 3.1 Pro, has doubled the inference performance compared to Gemini 3 Pro, achieving a score of 77.1% in the ARC-AGI-2 evaluation, up from 31.1% [2]. - In various benchmark tests, Gemini 3.1 Pro shows strong performance, closely approaching the capabilities of Opus 4.6 in coding tasks, with a SWE-Bench verification score of 80.6% compared to Opus 4.6's 80.8% [2][3]. Benchmark Comparisons - In the "Humanity's Last Exam" benchmark, Gemini 3.1 Pro scored 44.4%, outperforming Gemini 3 Pro's 37.5% [3]. - For academic reasoning tasks, Gemini 3.1 Pro achieved 51.4%, while Gemini 3 Pro scored 45.8% [3]. - The model also excelled in the GPQA Diamond Scientific knowledge test with a score of 94.3%, compared to Gemini 3 Pro's 91.9% [3]. User Access and Tools - The new model is available for consumer users through the Gemini application and NotebookLM, with higher usage limits for Google AI Pro and Ultra subscribers [4]. - Enterprise clients can access the model via Vertex AI and Gemini Enterprise for testing [6].