Claude Agent SDK
Search documents
How the New York Stock Exchange deploys Anthropic's Claude
American Banker· 2026-02-25 17:49
Core Insights - The New York Stock Exchange (NYSE) is rapidly advancing its use of agentic AI, particularly through collaboration with Anthropic's Claude generative AI, marking a significant shift in operational capabilities [1][2][3] AI Implementation and Development - The NYSE has transitioned from using AI primarily for code completion to employing it as a collaborative tool capable of complex reasoning and multistep tasks, enhancing its operational efficiency [2] - The exchange is reengineering its development processes by utilizing Claude for coding, testing, and documentation, moving towards a model that integrates multiple AI solutions and platforms [6][7] Industry Trends - Major financial institutions, including JPMorgan Chase and Goldman Sachs, are similarly embedding AI into core applications, indicating a broader trend in the financial sector towards AI integration [3][4] - The shift from AI as a point solution to a more embedded role in digital banking, payments, and fraud detection is becoming increasingly common among early adopters in the industry [4] Governance and Accountability - The NYSE processes over a trillion messages on peak trading days, necessitating a focus on system resilience and accountability in AI applications [9][10] - The introduction of probabilistic AI requires continuous monitoring of outcomes and behaviors, emphasizing the need for strong governance and oversight [10][12] Data and System Thinking - Data remains a critical component in AI deployment, with a focus on ensuring quality inputs to achieve desirable outputs [11][13] - Organizations are encouraged to adopt a systems thinking approach, considering the entire ecosystem of AI applications rather than isolated components [12]
美股异动丨财捷盘前涨超6%,与Anthropic达成多年合作伙伴关系
Ge Long Hui· 2026-02-24 13:51
Core Viewpoint - Intuit (INTU.US) shares rose over 6% in pre-market trading, reaching $381.3, following the announcement of a multi-year partnership with Anthropic aimed at providing customizable AI agents for mid-market enterprises [1] Group 1 - The partnership will enable businesses to utilize Anthropic's Claude Agent SDK on the Intuit platform to build and customize secure and accurate AI agents [1] - The collaboration focuses on enhancing compliance workflows for enterprises [1]
懂了很多道理,AI 依然要发疯
3 6 Ke· 2026-02-09 06:50
Core Insights - The article discusses the current challenges faced by AI agents, particularly in handling long-term tasks without reliable support systems [1] - It identifies two main issues: the "contextual black hole" where models struggle to understand complex contexts, and the "collapse of long-term planning" where models become confused over extended tasks [1][3] - A significant paper by Anthropic titled "The Hot Mess of AI" provides empirical evidence supporting these claims, indicating that as models grow stronger, they do not necessarily become less chaotic [3][6] Group 1: Model Limitations - The paper highlights the illusion of capability in AI models, suggesting that while they may appear to improve, their performance on complex tasks does not follow a linear trajectory [3][6] - The research introduces the concept of incoherence, measuring the proportion of errors caused by variance versus bias, revealing that longer tasks lead to increased incoherence [13][14] - It concludes that larger models tend to exhibit higher incoherence on difficult tasks, contradicting the assumption that larger models are more stable [15][17] Group 2: Theoretical Framework - The authors utilize the bias-variance decomposition to analyze model performance, quantifying the distance between average predictions and actual results [8][9] - They argue that the nature of autoregressive models inherently limits their ability to function as optimizers, which is essential for achieving AGI [20][23] - The paper suggests that the complexity of tasks grows exponentially, making it difficult for models to keep pace with the demands of long-term planning [21][24] Group 3: Potential Solutions - The article proposes several avenues for improvement, including ensembling methods to reduce incoherence by averaging multiple outputs [33][34] - It also discusses the importance of structured reasoning processes to mitigate variance during complex tasks [36] - Lastly, it suggests exploring new paradigms beyond token-level autoregression, such as large concept models that focus on high-level goals rather than discrete tokens [39][40]
X @Anthropic
Anthropic· 2026-02-03 19:38
Apple's Xcode now has direct integration with the Claude Agent SDK, giving developers the full functionality of Claude Code for building on Apple platforms, from iPhone to Mac to Apple Vision Pro.Read more: https://t.co/fyZ10bhkN3 ...
怎么做 Long-running Agents,Cursor、Anthropic 给了两种截然不同的思路
Founder Park· 2026-01-20 15:00
Core Viewpoint - The article discusses advancements in long-running AI agents, focusing on two approaches: Cursor's multi-agent parallel collaboration and Anthropic's memory continuity for single agents [4][27]. Group 1: Cursor's Approach - Cursor aims to execute complex, long-term tasks by running multiple agents in parallel, similar to human team collaboration [4][8]. - The initial attempts at coordination faced challenges, including inefficiencies due to locking mechanisms and a lack of accountability among agents [10][12]. - The introduction of role differentiation among agents—Planners, Workers, and Judges—improved project coordination and scalability [15][21]. - Successful experiments included building a web browser from scratch, generating over 1 million lines of code, and migrating a large codebase, demonstrating the effectiveness of the new structure [17][19]. Group 2: Anthropic's Approach - Anthropic focuses on maintaining memory continuity for agents across multiple work sessions, addressing the limitations of context windows [27][28]. - The dual-agent system consists of an Initializer Agent to set up the project environment and a Coding Agent to execute tasks incrementally [34][39]. - This method emphasizes structured task management and thorough testing, significantly improving the accuracy of functionality verification [42][46]. - Open questions remain regarding the potential for specialized agents in various domains beyond web development [53].
Claude Agent SDK [Full Workshop] — Thariq Shihipar, Anthropic
AI Engineer· 2026-01-05 17:00
[music] Okay. Yeah, thanks for joining me. I uh I'm still on the West Coast time, so it feels like I'm doing this at like 7:00 a.m. Uh so yeah, but um glad to talk to you about the Claude agent SDK. So um yeah, I think like this is going to be like a rough agenda, but we're going to talk about we're going to talk about like what is the claud agent SDK? Why use it? There's so many other agent frameworks. What is an agent? What is an agent framework? um how do you design an agent uh using the agent SDK or or ...
今年让AI可靠地抢走你的活儿?Anthropic 首席产品官曝新年目标:大模型不拼 “更聪明”,终结“公司上AI,员工更累”尴尬
AI前线· 2026-01-03 05:33
Core Insights - The article discusses the significant growth of Anthropic's Claude Code, which has surpassed OpenAI's market share, reaching over 52% in recent months, driven by its superior coding capabilities [2] - The conversation highlights the evolution of AI in programming, emphasizing the shift towards "vibe coding" and the emergence of AI as collaborative agents in software development [3][4] Market Dynamics - Anthropic's model share was around 25% from 2024 to early 2025, but it experienced a "hockey stick" growth in the last 3 to 6 months, indicating a rapid increase in adoption [2] - The article notes that the focus on coding capabilities has made Anthropic a preferred tool among developers, leading to its penetration into various use cases beyond traditional programming [2][11] Product Development - Mike Krieger, Anthropic's Chief Product Officer, outlines the future direction of "vibe coding," emphasizing the need for AI to enhance reliability and interaction, rather than just improving model intelligence [3][4] - The development of Claude Code was a strategic decision to allow the model to operate autonomously over longer periods, reflecting a belief in the model's potential for future capabilities [8][9] User Engagement - The article mentions that the initial release of Claude Code exceeded expectations, with users applying it in diverse fields such as bioinformatics and data science, indicating its versatility beyond coding [11] - There is a recognition that many users, including non-technical individuals, are beginning to explore AI tools for various applications, suggesting a shift in user demographics [12][19] Future Vision - Looking ahead to 2026, the focus will be on creating collaborative AI agents that can handle more complex tasks and integrate seamlessly into existing workflows, moving beyond simple tool usage [20][24] - The article emphasizes the importance of ensuring high-quality outputs from AI to genuinely enhance productivity, rather than adding to the workload [19][22]
Anthropic CPO:2026 企业 AI 要真干活,先跨过这道坎
3 6 Ke· 2025-12-29 03:46
Core Insights - The main issue with AI deployment in enterprises is not the technology itself but the organizational readiness to effectively utilize AI capabilities [2][9][23] Group 1: AI Capabilities and Deployment - AI models are increasingly powerful, yet many businesses struggle to leverage them effectively, often encountering obstacles during task execution [1][9] - Anthropic's Claude is designed not just as a chatbot but as a capable colleague that can take on entire workflows, demonstrating a shift in how AI is perceived and utilized [5][6] - The annual revenue for Claude Code exceeded $1 billion within six months of its launch, indicating strong market demand for AI tools that can perform tasks rather than just generate responses [5] Group 2: Organizational Challenges - The primary barriers to effective AI implementation are organizational issues, including unclear task definitions and inadequate data management [10][15] - Many companies lack clarity on where data is stored and how to label it, making it difficult for AI to access and utilize necessary information [11][12] - Access permissions and system navigation are often complex, hindering AI's ability to function effectively within existing workflows [13][14] Group 3: Task Management and Responsibility - Effective AI deployment requires clear task assignments, similar to how one would manage a new employee, ensuring that AI understands its role and the expectations [18][20] - Organizations must establish clear boundaries of responsibility for AI, allowing it to take ownership of specific tasks while ensuring accountability for outcomes [20][22] - Successful examples, such as GitHub's collaboration with Claude, illustrate the importance of defined processes and responsibilities in AI task execution [19][22] Group 4: Future Considerations - Companies must prepare themselves by addressing data organization, access permissions, task clarity, and responsibility delineation to fully leverage AI capabilities by 2026 [23][25]
Agent元年复盘:架构之争已经结束!?
自动驾驶之心· 2025-12-24 00:58
Core Insights - The article discusses the evolution of "Agent" technology, highlighting the emergence of "Deep Agent" and "Claude Agent SDK" as leading architectures in the field [3][57]. - It emphasizes that 2025 marks a pivotal year for agents, where technology readiness is evident, but full replacement of traditional methods has not yet been achieved [5][6]. Technical Perspectives - The architecture of agents has converged towards a general form represented by Claude Code and Deep Agent, focusing on their capabilities beyond programming [3][4]. - The article notes that the core capabilities of Claude Code, such as planning and context management, are applicable to various tasks beyond coding, leading to its rebranding as Claude Agent SDK [9]. Industry Recognition - The article asserts that while agent products have generated significant revenue in sectors like recruitment and marketing, the impact is less visible domestically due to a concentration of business in overseas markets [10]. - It identifies a shift in focus from technical architecture to business restructuring, emphasizing the need for industry professionals to adapt traditional workflows to be agent-friendly [10]. Definition and Characteristics of Deep Agent - A "Deep Agent" is characterized by its industry-specific knowledge and long-running capabilities, ensuring stability and reliability in task execution [11][12]. - The article outlines that a Deep Agent must demonstrate high levels of specialization and the ability to perform complex, multi-step tasks without failure [12]. Skills and Context Management - The introduction of "Agent Skills" allows for a more dynamic and efficient way to integrate business knowledge into agents, enhancing their capabilities [22][30]. - The concept of progressive disclosure is highlighted as a key design principle, enabling agents to load information as needed rather than all at once, improving context management [32][34]. Planning and Task Management - Planning is identified as a crucial component for agents to execute long-term tasks effectively, with the ability to decompose tasks into manageable sub-tasks [47][50]. - The article discusses the importance of context isolation and parallel execution in sub-agents, which enhances efficiency and reduces context confusion [50]. System Prompt and File Management - The article emphasizes the significance of detailed system prompts in guiding agent behavior and ensuring effective task execution [52]. - A well-structured file system is proposed as a means to manage context and facilitate collaboration among agents, allowing for long-term memory and efficient information retrieval [53][56]. Conclusion on Agent Technology - The article concludes that the agent technology landscape has reached a point of convergence, with established architectures like Claude Agent SDK and Deep Agent leading the way [57][58]. - It suggests that the future of agent technology will involve further specialization and adaptation to specific business needs, leveraging the strengths of existing frameworks [69][71].
Claude Code 豪气收购一家0收入前端公司:押注一位高中辍学创始人
AI前线· 2025-12-03 04:29
Core Insights - Anthropic announced the acquisition of Bun, a developer tool startup, marking a significant step into the developer tools sector [2] - The acquisition aims to enhance the performance and stability of Claude Code and other AI coding products, leveraging Bun's infrastructure [2][4] - Bun has become an essential tool for AI programming tools, addressing efficiency issues in agent distribution and execution [3] Summary by Sections Acquisition Details - The financial terms of the acquisition are undisclosed, but it aligns with Anthropic's strategy of seeking acquisitions that enhance technological capabilities and reinforce its leadership in enterprise AI [4] - Bun's integration is expected to accelerate the development of Claude Code and related tools, with a focus on maintaining high performance and lightweight solutions [15] Bun's Impact and Growth - Bun's monthly downloads exceed 7 million, with over 82,000 stars on GitHub, indicating its popularity among developers [4] - The tool has been adopted by companies like Midjourney and Lovable to improve development speed and efficiency [4] - Bun's single-file executables facilitate the distribution of CLI tools, making it a preferred choice for many coding agents [3] Future Prospects - The acquisition is seen as a way to provide long-term stability for Bun, allowing it to focus on building the best JavaScript tools without the pressure of immediate monetization [12][15] - Bun's roadmap will continue to emphasize high-performance JavaScript toolchains and Node.js compatibility, aiming to replace Node.js as the default server-side JavaScript runtime [17] - The integration with Anthropic is expected to enhance Bun's capabilities and speed of iteration, benefiting existing users [15] Community and Open Source Commitment - Bun will remain open-source under the MIT license, with the original team continuing to develop the tool [17] - The commitment to maintaining an active development community and transparency in the development process is emphasized [17]