Imagine with Claude - filings, earnings calls, financial reports, news

Imagine with Claude

Search documents

Founder Park· 2025-09-30 03:46

Core Viewpoint - Anthropic has launched the Claude Sonnet 4.5 model, claiming it to be the best coding model in the world, with a focus duration of over 30 hours for complex multi-step tasks, surpassing OpenAI's GPT-5 Codex [2][9]. Pricing and Cost Efficiency - The pricing for Claude Sonnet 4.5 remains the same as its predecessor, at $3 per million tokens for input and $15 per million tokens for output. Cost savings of up to 90% can be achieved through prompt caching, and batch processing can save 50% [2]. Developer Tools and Integration - Anthropic has introduced the Claude Agent SDK and an experimental feature called "Imagine with Claude" for developers, allowing integration with platforms like Amazon Bedrock and Google Cloud's Vertex AI [3][26]. Performance Metrics - In the SWE-bench Verified evaluation, Claude Sonnet 4.5 achieved industry-leading scores, with a 61.4% score in the OSWorld benchmark, significantly improving from the previous model's 42.2% [10][12]. Enhanced Features - The model includes new features such as a checkpoint function in Claude Code, context editing, and memory tools, enabling it to handle longer tasks and more complex operations [4][24]. Application and Usability - Users can interact with Claude Sonnet 4.5 through the Claude.ai website and mobile applications, with integrated functionalities for code execution and file creation directly within conversations [5][6]. Safety and Alignment - Claude Sonnet 4.5 is noted for its improved alignment and safety features, reducing undesirable behaviors such as deception and flattery, and making significant progress in defending against prompt injection attacks [24][25]. Experimental Features - The "Imagine with Claude" feature allows real-time software generation, showcasing the model's capabilities in adapting to user requests without pre-written code [31][33]. Recommendations - Anthropic recommends all users upgrade to Claude Sonnet 4.5 for enhanced performance across all applications, with updates available for both the Claude Code and developer platform [34].

Artificial Intelligence

Artificial Intelligence

刚刚，Claude Sonnet 4.5重磅发布，编程新王降临

3 6 Ke· 2025-09-30 01:32

Core Insights - Anthropic has officially released Claude Sonnet 4.5, which is defined as the world's strongest code model, showcasing significant breakthroughs in agent construction, computer usage, reasoning, and mathematical capabilities [2][3]. Performance and Benchmarking - Sonnet 4.5 achieved top performance in various authoritative tests, including a 77.2% score in SWE-bench Verified for real software coding capabilities, and a 61.4% score in OSWorld for simulating real computer tasks, up from 42.2% in the previous version [4][10][13]. - The model demonstrated a 100% success rate in high school math competitions and improved performance in graduate-level reasoning and multilingual Q&A [4][10]. New Features and Product Upgrades - The release includes significant updates across the Claude product line, such as the introduction of "Checkpoints" in Claude Code, allowing users to save progress and revert to earlier states [6]. - Claude API has added context editing features and memory tools, enabling agents to run longer and handle more complex tasks [6][34]. Developer Resources - A new core resource, Claude Agent SDK, has been introduced, providing foundational capabilities for building intelligent agents [8][9]. - The SDK is designed to support a wide range of applications beyond coding, facilitating the development of autonomous agents for complex tasks [32]. Safety and Alignment - Sonnet 4.5 is noted for its improved alignment and safety features, significantly reducing harmful behaviors and enhancing defenses against prompt injection attacks [28][31]. - The model is released under the AI Safety Level 3 framework, incorporating various protective measures, including classifiers for sensitive content [31]. Pricing and Access - The pricing for Sonnet 4.5 remains consistent with Sonnet 4, set at $3 per million tokens for input and $15 per million tokens for output [35]. - The model is accessible through multiple channels, including Claude API, Amazon Bedrock, and Google Cloud Vertex AI [37]. Industry Impact - Claude Sonnet 4.5 is positioned as a powerful tool for developers and professionals in fields such as finance, medicine, and research, marking a significant advancement in AI capabilities and safety [40].

AI编程

智能体

Artificial Intelligence

Artificial Intelligence

Claude Sonnet 4.5

Claude Code

Claude API

Claude Sonnet 4.5被炸出来了，依旧最强编程，连续30小时自主运行写代码

量子位· 2025-09-30 00:57

Core Insights - The article discusses the release of Claude Sonnet 4.5, which has shown significant improvements over its predecessor, Claude Sonnet 4, in various performance metrics [2][8]. Performance Improvements - Claude Sonnet 4.5 achieved a score of 82.0% on the SWE-bench, an increase of 1.8 percentage points from Sonnet 4 [2]. - In the OSWorld test, it scored 60.2, nearly a 50% improvement over Sonnet 4 [7]. - The model can autonomously write code for up to 30 hours, producing over 11,000 lines of code, which is a significant increase from the previous model's 7-hour capability [3][5]. Benchmark Comparisons - Claude Sonnet 4.5 outperformed other models in various benchmarks, including: - Agentic coding: 77.2% [10] - Terminal-Bench: 50.0% [10] - High school math (AIME 2025): 100% accuracy with Python and 87% without tools [9][10]. - In specialized fields like finance, healthcare, and law, it showed over 60% win rates against baseline models [11]. Safety and Alignment - The model has undergone safety training to reduce undesirable behaviors such as flattery and deception, with a significant decrease in false positives from 0.15% to 0.02% [12][13]. - Claude Sonnet 4.5 has made notable advancements in defending against immediate injection attacks [12]. Pricing and Accessibility - The pricing for Claude Sonnet 4.5 remains the same as Sonnet 4, at $3 per million input tokens and $15 per million output tokens [24]. New Features and SDK - The Claude Agent SDK has been upgraded to support the development of general autonomous agents, enhancing its capabilities beyond just coding tasks [27]. - A new feature called "Imagine with Claude" allows users to generate software in real-time based on their requirements, facilitating the creation of functional prototypes without existing templates [32].

Artificial Intelligence

Claude Sonnet 4.5

Claude Agent SDK

Imagine with Claude

Artificial Intelligence

Claude Sonnet 4.5

Claude Agent SDK

Imagine with Claude

Claude Sonnet 4.5来了！能连续编程30多小时、1.1万行代码

机器之心· 2025-09-30 00:27

Core Insights - The article discusses the recent advancements in AI models, particularly the release of Claude Sonnet 4.5 by Anthropic, which is positioned as a leading model in various benchmarks and applications [1][4][5]. Model Performance - Claude Sonnet 4.5 achieved significant performance improvements in various benchmarks, including: - 77.2% in Agentic coding [2] - 82.0% in SWE-bench Verified [2] - 61.4% in OSWorld for computer use, up from 42.2% in the previous version [11] - The model shows enhanced capabilities in reasoning and mathematics, with a perfect score of 100% in high school math competitions [12][13]. Developer Tools and Features - Anthropic introduced the Claude Agent SDK, allowing developers to create their own intelligent agents [4][35]. - New features include checkpoint functionality for saving progress, a revamped terminal interface, and native VS Code extensions [8][4]. Safety and Alignment - Claude Sonnet 4.5 is noted for being the most aligned model to human values, with improvements in reducing undesirable behaviors such as flattery and deception [27][5]. - The model is released under AI safety level 3 (ASL-3), incorporating classifiers to detect potentially dangerous inputs and outputs [32]. User Experience and Applications - Early user experiences indicate that Claude Sonnet 4.5 performs exceptionally well in specialized fields such as finance, law, and STEM [13][21]. - The "Imagine with Claude" feature allows real-time software generation without pre-defined functions, showcasing the model's adaptability [36][38].

Artificial Intelligence

Artificial Intelligence