Workflow
多智能体系统
icon
Search documents
谷歌抢跑L3级AI,Gemini连续工作40分钟,Agent自动生成评审百条创意
量子位· 2025-11-19 01:37
Core Insights - Google is advancing towards L3 AI with its Gemini system, which can autonomously execute tasks for extended periods, marking a significant step in AI development [27][30][32]. Group 1: Gemini's Capabilities - Gemini can continuously operate for 40 minutes on a single task, showcasing its ability to handle complex processes [2][19]. - The system generates over 100 creative ideas based on user input, which are then evaluated and ranked by multiple agents, providing structured feedback [3][15]. - Users only need to make final decisions, as the exploration and iteration processes are managed by the agents, significantly reducing the time spent on refining outputs [4][11]. Group 2: Multi-Agent System - The multi-agent competition system integrates long-term thinking and adversarial generation, enhancing the quality of outputs by utilizing time effectively [10][12]. - This system allows for a comprehensive generation, competition, and selection process, resulting in a well-rounded set of ideas presented to users [15][20]. - Gemini for Enterprise includes applications for creative generation and collaborative research, demonstrating its versatility in different contexts [18][26]. Group 3: Future of AI - The development of L3 AI is characterized by the ability to autonomously run tasks over extended periods, with Gemini's capabilities aligning closely with this definition [30][32]. - Speculations suggest that future agents may be able to operate for even longer durations, potentially up to 3 hours by next year [33]. - As collaborative research features evolve, Gemini may reach L4 AI status, further enhancing its capabilities [37].
用「传心术」替代「对话」,清华大学联合无问芯穹、港中文等机构提出Cache-to-Cache模型通信新范式
机器之心· 2025-10-29 07:23
Core Insights - The article discusses the rapid advancements in large language models (LLMs) and the introduction of a new communication paradigm called Cache to Cache (C2C), which enhances multi-agent systems by allowing direct communication through KV-Cache instead of traditional Text to Text (T2T) methods [2][5][10]. Limitations of Existing Text Communication - T2T communication faces significant limitations, including information loss due to dimensionality reduction, semantic ambiguity inherent in natural language, and substantial delays caused by token-by-token output generation [7][8][6]. Advantages of KV-Cache - KV-Cache inherently contains multi-dimensional semantic information from the dialogue process, improving accuracy and efficiency. Experiments show that optimized KV-Cache can significantly enhance model accuracy and facilitate effective communication between different models [11][12][29]. C2C Mechanism - The C2C framework utilizes a fusion mechanism that integrates KV-Cache from different models, ensuring compatibility and effective information transfer. This involves a residual fusion structure to maintain the original semantics of the receiver model [16][17][19]. Performance and Efficiency - C2C demonstrates substantial performance improvements over T2T, with accuracy increases of 3% to 5% and speed enhancements of up to two times. The framework allows for efficient parallel processing, avoiding the inefficiencies of one-dimensional text output [29][31][28]. Experimental Results - The article presents various experimental results showing that C2C consistently outperforms T2T across multiple benchmarks, with significant accuracy gains and reduced inference times [28][31][29]. Future Prospects - The C2C paradigm has broad applications, including enhancing collaboration in multi-agent systems, integrating multimodal models, and improving privacy-aware cloud-edge collaboration. It is positioned as a key enabling technology for the next generation of multi-agent systems [36][38][39].
当AI成为你的新同事:Gartner 2026技术趋势揭示的人机共生未来
Sou Hu Cai Jing· 2025-10-21 23:54
Core Insights - The article discusses the transformative impact of AI on workplaces and daily life, highlighting the shift from AI as a tool to AI as an autonomous colleague [6][14] - Gartner's 2026 strategic technology trends report indicates that organizations and individuals are at a crossroads of transformation due to rapid technological advancements [6][12] Group 1: AI Evolution - AI is evolving from a passive tool to an intelligent colleague capable of making autonomous decisions and actions [6][7] - Multi-agent systems (MAS) are expected to become digital employees, collaborating to complete complex tasks, with organizations automating 80% of customer-facing processes by 2028 [7][10] - Physical AI is emerging, with robots and drones performing tasks in real-world environments, enhancing human capabilities in dangerous or repetitive jobs [7][10] Group 2: Domain-Specific AI - The rise of domain-specific language models (DSLM) is driven by the need for AI that understands industry-specific knowledge and terminology [8][9] - By 2028, over half of generative AI models in enterprises are expected to be domain-specific, utilizing the expertise of professionals to train AI [9] Group 3: Security and Trust - AI security is becoming increasingly critical, with over 50% of enterprises projected to adopt dedicated AI security platforms by 2028 [10][11] - Technologies like confidential computing and digital provenance are essential for protecting sensitive information and ensuring transparency in AI systems [10][11] Group 4: Geopolitical Factors - Geopolitical considerations are influencing technology choices, with a trend towards data and service localization, particularly in Europe and the Middle East [12] - By 2030, over 75% of enterprises in these regions are expected to migrate workloads back to local jurisdictions, impacting cloud strategies and service accessibility [12] Group 5: Organizational and Personal Adaptation - Companies must view AI as integral to their operations, with 80% expected to be enhanced by AI-driven small teams by 2030 [13] - Individuals will need to develop skills to work alongside AI, with a focus on maintaining critical thinking and creativity while leveraging AI capabilities [13][14]
Office Agent:新一代多智能体系统
Sou Hu Cai Jing· 2025-10-15 04:29
Core Insights - Microsoft has launched Office Agent, a multi-agent system built on an open-source technology stack and the Anthropic Claude model, aimed at enhancing content generation efficiency for users [1][6][22] - The system employs a new development paradigm called Taste-Driven Development (TDD), which focuses on aesthetic quality in content creation [1][6][12] Group 1: Office Agent Features - Office Agent automates the entire workflow from planning to writing and refining, significantly improving the efficiency of Office content production [1][3] - The system has achieved GAIA certification, demonstrating its superior performance in handling complex workflows [1][3] - It utilizes a collaborative workflow among specialized agents, including a central planning agent and various domain-specific agents [5][6] Group 2: Taste-Driven Development (TDD) - TDD enhances the aesthetic layout of AI-generated content by analyzing high-quality presentation samples to extract core design principles [6][12][14] - The workflow includes an iterative cycle where generated content undergoes quality and aesthetic evaluation through a content self-validation module [6][18] - TDD establishes a dual perspective framework for quality assessment, focusing on both content accuracy and aesthetic appeal [18][20] Group 3: Automation and User Experience - Office Agent introduces auto-theming, which generates designs that naturally fit the content rather than relying on preset templates [12][14] - The system incorporates expert-guided style rules to ensure that generated outputs are both aligned with core instructions and aesthetically refined [14][16] - Users can interact with Office Agent to review and adjust generated content, enhancing the collaborative aspect of the tool [21][22] Group 4: Performance Metrics - Microsoft has developed the TDDEval benchmark to assess TDD's performance across PowerPoint, Excel, and Word, ensuring a comprehensive evaluation of knowledge work [16][18] - The benchmark includes a variety of test tasks, ensuring the system's robustness in diverse scenarios [16][18] Group 5: Future Developments - Office Agent is currently available to Microsoft global personal and family subscription users, with business user support coming soon [22] - The system aims to integrate further within the Microsoft ecosystem, enhancing its capabilities in knowledge work creation and refinement [22][23]
北大汇丰王小愚:中国AI投资具备三大优势,首要挑战在核心技术依赖与硬件短板
Xin Lang Cai Jing· 2025-09-22 02:02
Core Viewpoint - The central financial work conference emphasizes the importance of technology finance, green finance, inclusive finance, pension finance, and digital finance for promoting high-quality financial development. The integration of 5G, AI, and blockchain is reshaping the financial infrastructure and service landscape, presenting both opportunities and challenges for the banking industry [1][3]. Group 1: Technological Integration in Finance - The collaboration of 5G, AI, and blockchain is fundamentally restructuring the architecture and operational logic of financial systems, enhancing payment systems, investment management, and supply chain finance [3][4][5]. - Payment and settlement systems can achieve real-time and trustworthy transactions, with 5G enabling millisecond-level latency and blockchain ensuring transaction immutability and traceability [3][4]. - AI enhances investment advisory and asset management by analyzing user preferences and market data, leading to more personalized and transparent investment strategies [4][5]. Group 2: Challenges of Technological Integration - The integration of these technologies may increase complexity and systemic risks within the financial system, such as compatibility issues between distributed ledgers and centralized AI frameworks [2][7]. - Performance bottlenecks exist between blockchain's low transaction per second (TPS) capabilities and the high throughput demands of 5G [6][7]. - The potential for AI algorithm resonance could amplify market volatility, leading to systemic risks if similar AI models are widely adopted [7]. Group 3: Key Players in the Ecosystem - Two types of companies are likely to dominate the "5G + AI + blockchain" ecosystem: technology giants with integration capabilities and specialized financial technology service providers [7][8]. - Technology giants can leverage their vast user bases and data resources to create efficient technology linkages, while specialized firms can focus on specific industry needs, enhancing their competitive edge [8]. Group 4: Future Directions in AI Investment - AI investment in China is driven by scenario-based applications, policy support, and engineering efficiency, with key challenges including reliance on core technologies and hardware limitations [9][12]. - The future of AI in finance will focus on multi-agent systems for decision-making, democratization of investment through asset tokenization, and seamless cross-border payment solutions [9][10][11]. - The evolution of AI technology is expected to shift from large models to intelligent agents capable of autonomous decision-making, enhancing operational efficiency in various sectors [12]. Group 5: Current Trends and Risks in Blockchain Investment - The current blockchain investment landscape is characterized by a mix of technological innovation and speculative behavior, leading to a phenomenon where "bad money drives out good" [14][17]. - Regulatory actions have targeted misleading cryptocurrency investment practices, indicating a need for clearer distinctions between genuine technological advancements and speculative projects [17][18]. - The differentiation between technological innovation and speculative behavior is crucial, with a focus on projects that do not promise financial returns and adhere to regulatory standards [18].
马斯克“巨硬计划”新动作曝光!从0建起算力集群,6个月完成OpenAI&甲骨文15个月的工作
Sou Hu Cai Jing· 2025-09-18 06:34
Core Insights - Elon Musk's "Macrohard" initiative has rapidly established a computing cluster capable of supporting 110,000 NVIDIA GB200 GPUs within six months, achieving a power supply scale of 200MW, which is a record compared to similar projects by OpenAI and Oracle that took 15 months [1][2][4] Group 1: Project Overview - The "Macrohard" project, which started in 2021, aims to automate the entire software development lifecycle using AI agents, including coding, design, testing, and management [2][4] - The Colossus II project, initiated on March 7, 2025, plans to deploy over 550,000 GPUs, with a peak power demand expected to exceed 1.1GW, and a long-term goal of expanding to 1 million GPUs [4][5] Group 2: Infrastructure and Power Supply - To meet the substantial power requirements, xAI has acquired a former Duke Energy power plant in Mississippi, which has been temporarily approved to operate gas turbines for 12 months [4][5] - xAI has partnered with Solaris Energy Infrastructure to lease gas turbines, with 400MW currently allocated to the project, and has invested $112 million in capital expenditures for this partnership [5] Group 3: Strategic Importance - The Macrohard initiative is becoming a crucial part of Musk's business strategy, positioning Tesla as an "AI robotics company," with 80% of its future value tied to robotics [6] - The AI software developed through Macrohard will enhance Tesla's autonomous driving algorithms and factory automation, while Tesla's extensive real-world data will provide valuable training data for the Macrohard project [6]
张小珺对话OpenAI姚顺雨:生成新世界的系统
Founder Park· 2025-09-15 05:59
Core Insights - The article discusses the evolution of AI, particularly focusing on the transition to the "second half" of AI development, emphasizing the importance of language and reasoning in creating more generalizable AI systems [4][62]. Group 1: AI Evolution and Language - The concept of AI has evolved from rule-based systems to deep reinforcement learning, and now to language models that can reason and generalize across tasks [41][43]. - Language is highlighted as a fundamental tool for generalization, allowing AI to tackle a variety of tasks by leveraging reasoning capabilities [77][79]. Group 2: Agent Systems - The definition of an "Agent" has expanded to include systems that can interact with their environment and make decisions based on reasoning, rather than just following predefined rules [33][36]. - The development of language agents represents a significant shift, as they can perform tasks in more complex environments, such as coding and internet navigation, which were previously challenging for AI [43][54]. Group 3: Task Design and Reward Mechanisms - The article emphasizes the importance of defining effective tasks and environments for AI training, suggesting that the current bottleneck lies in task design rather than model training [62][64]. - A focus on intrinsic rewards, which are based on outcomes rather than processes, is proposed as a key factor for successful reinforcement learning applications [88][66]. Group 4: Future Directions - The future of AI development is seen as a combination of enhancing agent capabilities through better memory systems and intrinsic rewards, as well as exploring multi-agent systems [88][89]. - The potential for AI to generalize across various tasks is highlighted, with coding and mathematical tasks serving as prime examples of areas where AI can excel [80][82].
DeepDiver-V2来了,华为最新开源原生多智能体系统,“团战”深度研究效果惊人
量子位· 2025-09-11 10:19
允中 发自 凹非寺 量子位 | 公众号 QbitAI 采用了 "团队作战" 模式:一个Planner负责任务分解,任务分发,进度审视和成果验收,多个专业Executor并行处理子任务,通过共享文件 系统高效交换信息。 与仅通过推理框架实现的多智能体系统不同,DeepDiver-V2以多智能体形态进行训练,模型天然具备更强的角色扮演和协同推理能力。这套 系统不仅在复杂知识问答任务上取得突破,更是能够 生成数万字的高质量深度研究报告 ,在多个榜单中表现亮眼。 它基于华为openPangu Agent推出的DeepDiver-V2,这是一个专攻AI深度搜索和长文调研报告生成的模型。 目前已开源 。 性能爆表:优于同规格竞品 数字最有说服力。DeepDiver-V2-7B和DeepDiver-V2-38B和在多个权威基准测试中表现亮眼: 让智能体组团搞深度研究,效果爆表! 华为最新发布 DeepDiver-V2原生多智能体系统 。 在长文报告生成方面 ,DeepDiver-V2提出了一个全新的面向深度调研报告生成的基准测试WebPuzzle-Writing,该基准给每个调研query设 置了详细的调研范围而非开放生成 ...
A2A、MCP、Gemini……谷歌技术专家手把手教你搭建 AI Agent
Founder Park· 2025-09-02 10:21
Core Insights - The article discusses a seminar featuring Google Cloud AI expert Shi Jie, focusing on techniques for building AI agents using ADK, A2A, MCP, and Agent Engine [2] - It emphasizes the potential of Google's latest AI technologies to create collaborative, efficient, and scalable multi-agent systems [2] - The future of agent development and its impact on human-computer interaction is also explored [2] Group 1: Seminar Details - The seminar will cover how to leverage ADK, A2A, MCP, and Agent Engine to construct AI agents [6] - It aims to provide insights into utilizing Google's latest AI technology for developing highly collaborative and efficient multi-agent systems [6] - The event is targeted at AI startup leaders, technical heads, AI product managers, solution architects, developers, and AI engineers [6] Group 2: Registration Information - Participants are encouraged to scan a QR code for registration, with limited slots available and registration subject to approval [3]
LLM也具有身份认同?当LLM发现博弈对手是自己时,行为变化了
3 6 Ke· 2025-09-01 02:29
Core Insights - The research conducted by Columbia University and Montreal Polytechnic reveals that LLMs (Large Language Models) exhibit changes in cooperation tendencies based on whether they believe they are competing against themselves or another AI [1][29]. Group 1: Research Methodology - The study utilized an Iterated Public Goods Game, a variant of the Public Goods Game, to analyze LLM behavior in cooperative settings [2][3]. - The game involved multiple rounds where each model could contribute tokens to a public pool, with the total contributions multiplied by a factor of 1.6 and then evenly distributed among players [3][4]. - The research was structured into three distinct studies, each examining different conditions and configurations of the game [8][14]. Group 2: Key Findings - In the first study, when LLMs were informed they were playing against "themselves," those prompted with collective terms tended to betray more, while those prompted with selfish terms cooperated more [15][16]. - The second study simplified the rules by removing reminders and reasoning prompts, yet the behavioral differences between the "No Name" and "Name" conditions persisted, indicating that self-recognition impacts behavior beyond mere reminders [21][23]. - The third study involved LLMs truly competing against their own copies, revealing that under collective or neutral prompts, being told they were playing against themselves increased contributions, while under selfish prompts, contributions decreased [24][28]. Group 3: Implications - The findings suggest that LLMs possess a form of self-recognition that influences their decision-making in multi-agent environments, which could have significant implications for the design of future AI systems [29]. - The research highlights potential issues where AI might unconsciously discriminate against each other, affecting cooperation or betrayal tendencies in complex scenarios [29].