Multi-Agent Systems

Search documents
Infrastructure for Multi-Agent Systems
Y Combinator· 2025-08-01 22:29
AI Agent Evolution - AI agents are evolving into distributed workflows with many sub-agent calls in a single run [1] - These multi-agent systems are useful for long-running workflows and agentic map reduce jobs [1] - These systems apply human-level judgment to filter and search through large amounts of data in parallel [1] Challenges in Building Multi-Agent Systems - Building these systems requires solving traditional distributed systems problems to ensure high throughput and reliability while controlling costs [2] - New problems include writing effective agent and sub-agent prompts, handling untrusted context, and monitoring and debugging agents [2] Call to Action - The industry is looking for builders who have felt this pain in production and want to create tools to make these systems easier to build and maintain [2] - The industry aims to make operating fleets of agents as routine and reliable as deploying a web service or running a Spark job [3]
统一20+多智能体方法,MASLab震撼发布
机器之心· 2025-06-13 04:31
Core Viewpoint - OpenAI aims to achieve "organizational-level" intelligence as the ultimate goal in the five stages towards AGI (Artificial General Intelligence), where AI can manage complex processes, make high-level decisions, and coordinate large-scale operations [1] Group 1: MASLab Introduction - MASLab is a collaborative initiative launched by ten institutions, including Shanghai Jiao Tong University and Oxford University, to accelerate the healthy development of Multi-Agent Systems (MAS) [2] - MASLab provides a unified, comprehensive, and research-friendly codebase for large model multi-agent systems, facilitating ease of use and reproducibility [4] Group 2: Features of MASLab - MASLab integrates over 20 mainstream MAS methodologies, covering results from major conferences over the past two years across various fields and task types [6] - The platform ensures evaluation fairness and reproducibility through standardized input preprocessing, LLM configuration, and evaluation protocols [8] Group 3: Experimental Analysis - Researchers have conducted extensive experiments using MASLab, covering over ten evaluation benchmarks, including MATH and GPQA, and analyzing the performance of eight major models [11] - The results demonstrate the current state of MAS methods, highlighting their strengths and weaknesses [14] Group 4: Innovations and Future Directions - MASLab-ReAct, a more efficient MAS method, supports various tools and has shown superior results on the GAIA validation set, indicating significant potential for real-world applications [16] - MASLab is an open-source platform aimed at community contributions, with plans to continuously release more methods and benchmarks to foster a sustainable MAS research community [22][23]