Workflow
Claude 4系列模型(Claude Opus 4
icon
Search documents
“全球最强编程模型”来了!Anthropic发布Claude 4,连干七小时性能稳定
硬AI· 2025-05-23 15:03
Core Viewpoint - Anthropic's release of the Claude 4 series models marks a new era in AI capabilities, particularly in programming, potentially reshaping the software development industry landscape [4][17]. Group 1: Model Capabilities - Claude Opus 4 is touted as the "best programming model globally," capable of maintaining stable performance over long tasks requiring focus and effort, verified by Rakuten's 7-hour continuous operation [3][8]. - Claude Sonnet 4 shows a significant accuracy improvement, achieving 72.7% in the SWE-bench test compared to Sonnet 3.7's 62.3% [5][6]. - Both models utilize a hybrid design, allowing for immediate responses and deeper reasoning, enhancing their utility in complex coding and problem-solving scenarios [5][9]. Group 2: Extended Functionality - The new models introduce "extended thinking and tool usage," enabling Claude to utilize web searches and other tools during reasoning, improving response accuracy [11]. - Opus 4 significantly enhances memory capabilities, allowing it to create and maintain "memory files" when granted local file access, improving long-term task awareness and coherence [11][12]. Group 3: Product Launch and Integration - Claude Code has officially launched, receiving positive feedback during testing, and integrates seamlessly with platforms like GitHub Actions, VS Code, and JetBrains [12][13]. - The pricing structure remains consistent with previous models, with Opus 4 charging $15 and $75 per million tokens for input and output, respectively, and Sonnet 4 charging $3 and $15 [6]. Group 4: Competitive Landscape - The release of Claude 4 series intensifies competition among AI giants, with recent announcements from Microsoft, Google, and OpenAI highlighting the race for leading AI models [15]. - Investors are encouraged to reassess the competitive landscape, particularly Anthropic's position relative to OpenAI and Google, as the capabilities of the Claude 4 series may provide opportunities for increased market share [17].
速递|Anthropic推出Claude 4AI模型,高端模型Opus 4持续7小时输出不宕机,抢占AI编程入口
Z Potentials· 2025-05-23 03:33
图片来源: Anthropic 在周四的首届开发者大会上, Anthropic 推出了两款新的人工智能模型,这家初创公司声称它们至少 在流行基准测试中的表现属于行业最佳。 据 Anthropic 公司介绍, Claude 4 系列模型中的新成员 Claude Opus 4 和 Claude Sonnet 4 能够分析 大型数据集、执行长期任务并采取复杂行动。该公司表示 ,这两款模型针对编程任务进行了优化, 特别适合编写和编辑代码。 付费用户和免费聊天机器人应用用户均可使用 Sonnet 4 ,但仅付费用户能访问 Opus 4 。通过亚马逊 Bedrock 平台和谷歌 Vertex AI 提供的 Anthropic API 服务, Opus 4 定价为每百万 token 15/75 美元 (输入 / 输出), Sonnet 4 则为每百万 token 3/15 美元(输入 / 输出)。 Token 是 AI 模型处理的基础数据单元。 100 万 token 约等于 75 万个单词——比《战争与和平》全文 字数还多出约 16.3 万词。 Anthropic 推出 Claude 4 系列模型之际,该公司正寻求大幅提 ...
Claude 4发布:新一代最强编程AI?
Hu Xiu· 2025-05-23 00:30
Core Insights - Anthropic has officially launched the Claude 4 series models: Claude Opus 4 and Claude Sonnet 4, emphasizing their practical capabilities over theoretical discussions [2][3] - Opus 4 is claimed to be the strongest programming model globally, excelling in complex and long-duration tasks, while Sonnet 4 enhances programming and reasoning abilities for better user instruction responses [4][6] Performance Metrics - Opus 4 achieved a score of 72.5% on the SWE-bench programming benchmark and 43.2% on the Terminal-bench, outperforming competitors [6][19] - Sonnet 4 scored 72.7% on SWE-bench, showing significant improvements over its predecessor Sonnet 3.7, which scored 62.3% [15][19] New Features and Capabilities - Claude 4 models can utilize tools like web searches to enhance reasoning and response quality, and they can maintain context through memory capabilities [7][23] - Claude Code has been officially released, supporting integration with GitHub Actions, VS Code, and JetBrains, allowing developers to streamline their workflows [41][43] User Experience and Applications - Early tests with Opus 4 showed high accuracy in multi-file projects, and it successfully completed a complex open-source refactoring task over 7 hours [9][11] - Sonnet 4 is positioned as a more suitable option for most developers, focusing on clarity and structured code output [14][17] Market Positioning - The models are designed to cater to different user needs: Opus 4 targets extreme performance and research breakthroughs, while Sonnet 4 focuses on mainstream application and engineering efficiency [39][40] - Pricing remains consistent with previous models, with Opus 4 priced at $15 per million tokens for input and $75 for output, and Sonnet 4 at $3 and $15 respectively [38] Future Outlook - The introduction of Claude Code and the capabilities of Claude 4 models signal a shift in how programming tasks can be automated, potentially transforming the software development landscape [59][104] - The models are expected to facilitate a new era of low-cost, on-demand software creation, altering the roles of developers and businesses in the industry [105]