Foundation models
Search documents
Legal AI is splitting in two—and most people miss the difference
Yahoo Finance· 2026-03-04 09:00
Core Insights - Thomson Reuters announced that CoCounsel has reached one million users across 107 countries and territories, highlighting significant growth in legal AI adoption [1] - Anthropic introduced an expanded suite of enterprise plugins for Claude, including tools tailored for legal, finance, and HR applications, indicating a shift towards specialized AI solutions in the legal sector [1][6] - The recent developments underscore the ongoing evolution in legal AI and the importance of understanding the underlying systems that support these technologies [3][4] Industry Analysis - A recent incident involving a general counsel testing Anthropic's Claude for contract review revealed that the AI pulled information from Wikipedia, sparking debate about the readiness of foundation models for legal work [2][4] - The reactions to this incident were polarized, with skeptics claiming that foundation models are inadequate for legal tasks, while proponents argued that improvements are forthcoming; however, both sides overlooked the critical issue of system architecture [4][5] - The distinction between AI capable of automating workflows and AI that can perform authoritative legal work is becoming increasingly recognized in the market, as evidenced by the stock performance of Thomson Reuters following the announcement of CoCounsel's user milestone [7]
Building Applications with AI Agents — Michael Albada, Microsoft
AI Engineer· 2025-07-24 15:00
Agentic Development Landscape - The adoption of agentic technology is rapidly increasing, with a 254% increase in companies self-identifying as agentic in the last three years based on Y Combinator data [5] - Agentic systems are complex, and while initial prototypes may achieve around 70% accuracy, reaching perfection is difficult due to the long tail of complex scenarios [6][7] - The industry defines an agent as an entity that can reason, act, communicate, and adapt to solve tasks, viewing the foundation model as a base for adding components to enhance performance [8] - The industry emphasizes that agency should not be the ultimate goal but a tool to solve problems, ensuring that increased agency maintains a high level of effectiveness [9][11][12] Tool Use and Orchestration - Exposing tools and functionalities to language models enables agents to invoke functions via APIs, but requires careful consideration of which functionalities to expose [14] - The industry advises against a one-to-one mapping between APIs and tools, recommending grouping tools logically to reduce semantic collision and improve accuracy [17][18] - Simple workflow patterns, such as single chains, are recommended for orchestration to improve measurability, reduce costs, and enhance reliability [19][20] - For complex scenarios, the industry suggests considering a move to more agentic patterns and potentially fine-tuning the model [22][23] Multi-Agent Systems and Evaluation - Multi-agent systems can help scale the number of tools by breaking them into semantically similar groups and routing tasks to appropriate agents [24][25] - The industry recommends investing more in evaluation to address the numerous hyperparameters involved in building agentic systems [27][28] - AI architects and engineers should take ownership of defining the inputs and outputs of agents to accelerate team progress [29][30] - Tools like Intel Agent, Microsoft's Pirate, and Label Studio can aid in generating synthetic inputs, red teaming agents, and building evaluation sets [33][34][35] Observability and Common Pitfalls - The industry emphasizes the importance of observability using tools like OpenTelemetry to understand failure modes and improve systems [38] - Common pitfalls include insufficient evaluation, inadequate tool descriptions, semantic overlap between tools, and excessive complexity [39][40] - The industry stresses the importance of designing for safety at every layer of agentic systems, including building tripwires and detectors [41][42]