GPT-5.2来了,首个“专家级”AI复仇成功,牛马打工人终于得救了
3 6 Ke·2025-12-11 23:58

Core Insights - OpenAI has launched GPT-5.2, which is positioned as the most powerful general-purpose AI model, designed to tackle complex knowledge-based tasks effectively [1][4]. Model Overview - Three versions of GPT-5.2 have been released: GPT-5.2 Instant, GPT-5.2 Thinking, and GPT-5.2 Pro [2]. - GPT-5.2 has shown significant improvements over its predecessor, GPT-5.1, in areas such as general intelligence, long text comprehension, tool utilization, and visual capabilities [6]. Performance Metrics - In various benchmarks, GPT-5.2 has achieved remarkable results: - SWE-Bench Pro: 55.6% accuracy, a 4.8% increase from GPT-5.1 [7]. - ARC-AGI-2: 52.9% accuracy, outperforming all competitors [7]. - GDPval: 70.9% of tasks completed successfully, surpassing human industry experts [11][27]. - The model's performance in investment banking tasks has improved by 9.3%, with scores rising from 59.1% to 68.4% [33]. Context and Knowledge Updates - GPT-5.2 features a context window of 400,000 tokens and a maximum output length of 128,000 tokens, allowing for extensive text processing [19]. - The knowledge base has been updated to include information up to August 31, 2025, ensuring the model is equipped with the latest data [19]. Cost Implications - The pricing for GPT-5.2 has increased by 40% compared to GPT-5.1, reflecting the enhanced capabilities and computational costs associated with the new model [19][20]. Competitive Landscape - The release of GPT-5.2 comes amid competition with Google's Gemini 3, although OpenAI executives have stated that the launch was not a direct response to this competitor [21]. - GPT-5.2 is marketed as the best model for professional knowledge work, capable of outperforming human experts in various tasks [25][29].