谷歌Gemini 3.1 Pro新王登场，一口气手搓Win11操作系统，造出模拟城市app，SVG效果绝了

Core Insights - Google has officially launched its new flagship model, Gemini 3.1 Pro, which outperforms previous models in 12 benchmark tests, including Gemini 3 Pro, Claude Opus 4.6, and GPT-5.2, achieving the top position [1][29]. Benchmark Performance - In the benchmark tests, Gemini 3.1 Pro scored 44.4% in the "Humanity's Last Exam" academic reasoning test, surpassing Gemini 3 Pro's 37.5% [2]. - The model achieved a remarkable 77.1% in the ARC-AGI-2 abstract reasoning puzzles, doubling the performance of Gemini 3 Pro [2][29]. - For scientific knowledge, Gemini 3.1 Pro scored 94.3% in the GPQA Diamond test, indicating strong performance in factual accuracy [2]. Model Enhancements - The enhancements in Gemini 3.1 Pro focus on complex task processing capabilities, including advanced reasoning, multimodal understanding, and project generation [10][29]. - The model's ability to generate detailed SVG animations and interactive designs has significantly improved, showcasing its capability in creative programming and interactive design [21][23]. User Access and Pricing - Starting today, Google AI Pro and Ultra subscribers can access Gemini 3.1 Pro through various platforms, while free users can ask two questions [10]. - The API pricing for Gemini 3.1 Pro follows a tiered model, with input prices set at $2.00 per million tokens for prompts under 200,000 tokens, and $4.00 for those exceeding this limit [10][11]. Industry Trends - The AI industry is shifting from general capability comparisons to real-world complex task execution, with companies focusing on enhancing reasoning, engineering, and multimodal understanding [33]. - Google's recent rapid advancements, including the launch of Gemini 3 Deep Think and Gemini 3.1 Pro, emphasize the importance of developing models that can address complex real-world problems [33].