Workflow
OpenAI发布新模型硬刚Anthropic,Claude Code刚火,就被GPT-5-Codex拍在沙滩上?
3 6 Ke·2025-09-16 10:09

Core Insights - OpenAI has launched a new model, GPT-5-Codex, which is a fine-tuned variant of GPT-5 designed specifically for AI-assisted programming tools, demonstrating improved performance in coding tasks and dynamic thinking time [1][3][6] Model Features - GPT-5-Codex features enhanced code review capabilities, allowing it to identify potential critical errors before product release, thus helping developers mitigate risks [3][4] - The model can dynamically adjust its thinking time based on task complexity, enabling it to work independently for extended periods, completing large refactoring tasks and iterating until successful delivery [6][14] - It has become the default setting for Codex cloud tasks and code reviews, automatically auditing pull requests (PRs) in GitHub repositories [4][7] Performance Metrics - In benchmark tests, GPT-5-Codex outperformed GPT-5 in SWE-bench Verified tasks, which measure coding capabilities and code refactoring performance [8] - The model significantly reduces token usage for low-load tasks by 93.7% compared to GPT-5, while doubling the reasoning, editing, testing, and iteration time for high-complexity tasks [10][18] Market Context - The AI coding tools market is becoming increasingly competitive, with significant investments flowing into companies like Anysphere, which recently raised $900 million, and Anthropic, which secured $13 billion in funding [20][21][22] - The rapid growth of AI coding tools is prompting discussions about the future of programming jobs, with some suggesting a shift towards architecture design rather than traditional coding [19][20] User Feedback - Users have reported that GPT-5-Codex can autonomously run tasks for extended periods and effectively switch between local and web development environments, enhancing productivity [15][16] - There are concerns about the potential impact on entry-level programming jobs, as AI tools like GPT-5-Codex can operate continuously and at a lower cost than hiring junior developers [18][19]