Core Insights - OpenAI has released GPT-5.2-Codex, the most advanced coding model designed for complex software engineering tasks, enhancing instruction-following and long-context understanding capabilities [1][3] Group 1: Model Enhancements - GPT-5.2-Codex improves upon GPT-5.2 with better instruction adherence and long-term context comprehension, particularly excelling in large code changes like refactoring and migration [3] - The model shows significant improvements in token efficiency for coding tasks, especially at medium and high reasoning levels, becoming a primary tool for the Codex team [3] - Enhanced capabilities in network security have been noted, with GPT-5.2-Codex outperforming all previous OpenAI models in this area [6][7] Group 2: Performance Metrics - GPT-5.2-Codex achieved state-of-the-art performance in SWE-Bench Pro and Terminal-Bench 2.0 benchmarks, which assess AI agents in real terminal environments [8][10] - The model can efficiently handle large codebases and maintain context over long sessions, enabling reliable completion of complex tasks like large-scale refactoring and feature building [8] Group 3: Security Applications - A security researcher utilized GPT-5.1-Codex-Max and Codex CLI to discover a vulnerability in React, demonstrating the model's application in real-world vulnerability research [6][21] - The process involved using Codex to set up a local testing environment and analyze potential attack surfaces, leading to the discovery of previously unknown vulnerabilities [22][25] Group 4: Deployment and Access - GPT-5.2-Codex is currently available to paid ChatGPT users and will be accessible to API users in the coming weeks, with OpenAI piloting access for invited users and professionals focused on defensive cybersecurity [7] - OpenAI is planning to ensure that each new model meets high cybersecurity capability standards, emphasizing responsible deployment alongside enhanced security measures [18][25]
OpenAI最强代码模型GPT-5.2-Codex上线
机器之心·2025-12-19 00:21