OpenAI深夜双王炸，GPT-5.1 Pro紧急发布，降维打击Gemini 3

Core Insights - OpenAI has launched GPT-5.1 Pro and GPT-5.1-Codex-Max, enhancing emotional and intellectual capabilities in AI models [2][8] - The new models are designed for high-intensity development tasks, capable of working autonomously for over 24 hours and processing millions of tokens [5][23] - GPT-5.1-Codex-Max features a new compression mechanism, allowing it to handle longer contexts and complex tasks more efficiently [6][22] Group 1: Model Features - GPT-5.1 Pro emphasizes both emotional and intellectual strengths, pushing these advantages to a higher level [2] - GPT-5.1-Codex-Max is specifically trained for software, engineering, mathematics, and research tasks, resulting in improved performance and reduced token usage [4][10] - The model achieved a score of 77.9% on the SWE-bench Verified evaluation, outperforming previous models [12][13] Group 2: Performance and Efficiency - GPT-5.1-Codex-Max reduces token usage by approximately 30% during medium reasoning tasks, leading to lower operational costs for developers [14] - It can autonomously manage tasks over extended periods, maintaining coherence and efficiency through its compression mechanism [22][23] - The model has shown significant improvements in programming efficiency, with a reported 70% increase in Pull Request submissions among OpenAI engineers [25] Group 3: User Experience and Comparisons - Early testers of GPT-5.1 Pro have noted its superior clarity and insight compared to GPT-5.0, making complex topics more understandable [34] - While GPT-5.1 Pro excels in reasoning and deep thinking tasks, it is slower than competitors like Gemini 3, which may be more suitable for everyday tasks [35][40] - The interface limitations of GPT-5.1 Pro restrict its integration into IDEs and other toolchains, similar to its predecessor [40]