谷歌夺回王座：Gemini 3.1 Pro来了！姚顺宇：后面还有更好的

Core Insights - Google has launched Gemini 3.1 Pro, an upgraded version of its AI model, to tackle complex challenges in science, research, and engineering [1][4][15] - The new model demonstrates significant improvements in reasoning capabilities, achieving a verified score of 77.1% on the ARC-AGI-2 benchmark, which is more than double the performance of its predecessor, Gemini 3 Pro [5][6] Performance Metrics - Gemini 3.1 Pro outperforms other models in various benchmarks, including: - 44.4% in Humanity's Last Exam for academic reasoning [6] - 94.3% in GPQA Diamond for scientific knowledge [8] - 68.5% in Terminal-Bench 2.0 for coding tasks [6] - 80.6% in SWE-Bench Verified for agentic coding [8] - The model's performance in multi-modal understanding reached 92.6% in the MMMLU test [8] Applications and Features - Gemini 3.1 Pro can visualize complex topics, organize scattered data, and turn creative projects into reality [12][20] - Notable applications include: 1. Generating animated SVG images based on text prompts [21] 2. Integrating complex systems, such as a real-time aviation dashboard [22] 3. Creating interactive designs, like a 3D simulation of a flock of birds [23] 4. Transforming literary themes into practical code for modern web design [24] Deployment and Pricing - The model is being integrated into various consumer and developer products, with a phased rollout starting now [15][26] - Pricing structure includes: - Developer access through Google AI Studio and Gemini API, with costs based on token usage [17] - Enterprise access via Vertex AI and Gemini Enterprise [17] - Consumer access through the Gemini app and NotebookLM [17] Future Plans - Google plans to further enhance Gemini 3.1 Pro in autonomous workflows and will soon open it for broader public use [26]