刚刚,智能体&编程新王Claude Opus 4.5震撼登场,定价大降2/3
机器之心·2025-11-24 23:49

Core Viewpoint - Anthropic has officially released its latest model, Claude Opus 4.5, which is touted as one of the most advanced AI models available today, showcasing significant improvements in programming, agent capabilities, and everyday tasks like handling spreadsheets and presentations [1][2]. Pricing and Accessibility - Claude Opus 4.5 is accessible via the Claude app, API, and major cloud platforms, with a new pricing structure set at $5 for every million tokens for input and $25 for output, representing a two-thirds reduction compared to the previous version, Opus 4.1 [5][6]. Performance Metrics - In benchmark tests, Claude Opus 4.5 achieved state-of-the-art (SOTA) performance, surpassing competitors like GPT-5.1-Codex-Max and Gemini 3 Pro in various software engineering tasks [2][12]. - The model scored higher than all human candidates in a challenging take-home exam designed to assess technical skills under time pressure, indicating its superior technical capabilities [11]. Enhanced Capabilities - Claude Opus 4.5 shows improvements across multiple domains, including visual reasoning, mathematical reasoning, and problem-solving, achieving SOTA levels in agent programming and tool usage [11][12][20]. - The model's ability to solve complex coding problems has improved by 10.6% compared to its predecessor, Sonnet 4.5 [14]. Developer Tools and Features - The Claude developer platform has been updated to support longer-running agents and improved user experience, allowing for multiple concurrent sessions in desktop applications [7][8]. - New features include an "effort" parameter in the API, enabling developers to balance between speed, cost, and model capability, resulting in significant reductions in token usage while maintaining performance [30][34]. Safety and Alignment - Claude Opus 4.5 is noted for its robust alignment and safety features, being one of the most resilient models against prompt injection attacks, which can mislead models into harmful behaviors [36][39]. - The model has shown substantial progress in mitigating concerning behaviors, enhancing its reliability in various applications [36][39].