Workflow
多智能体系统
icon
Search documents
在AI社会抓「内鬼」?上海AI Lab推出首个多智能体极端事件解释框架
机器之心· 2026-03-04 09:15
序言:数字镜像中的极端涌现 这类极端事件的出现并非源于代码漏洞,而是 来自系统演化的自发涌现。 由 上海人工智能实验室联合上海交通大学、复旦大学、中国人民大学、同济大学 开展的一项最新研究,决定拆解这些数字镜像中的「黑天鹅」演化过程,揪出那 个藏在复杂涌现背后、诱发系统崩溃的「内鬼」。 风起于青萍之末、不稳定的害群之马、羊群效应、毒瘤行为...... —— 在数字镜像的背面,这群科学家凝视着 AI 社会的「黑天鹅」时刻。 2023 年,斯坦福「模拟小镇」(Smallville)的爆火出圈,开启了大语言模型(LLM)驱动多智能体系统(MAS)模拟人类社会的元年。 如今,学术界已经构建出了各种高度复杂、垂直领域的 MAS 沙盒 —— 从复现宏观经济运行的社会系统,到模拟股票交易的金融市场,再到推演舆论演化的社交 网络。多智能体系统,正真正成为全方位映射人类社会的数字镜像。 然而,随着系统复杂程度的攀升,一种令人不安却极具研究价值的现象随之浮现:恶性通胀、股市崩盘、群体极化…… 这些现实人类社会的 「黑天鹅」 极端事 件,竟也在这群 AI 身上精准重演了。 论文链接: https://arxiv.org/pdf/2 ...
耗费2万美元、两周写10万行Rust代码!16个Claude智能体写的C编译器,能编译Linux内核却卡在“Hello World”?
程序员的那些事· 2026-02-11 09:44
Core Insights - The article discusses a groundbreaking experiment conducted by Anthropic researcher Nicholas Carlini, where a team of 16 Claude AI agents autonomously built a Rust-based C compiler capable of compiling the Linux 6.9 kernel without human intervention [1][4][5]. Group 1: Experiment Overview - The experiment lasted approximately two weeks, involving nearly 2000 Claude Code sessions, consuming around 20 billion input tokens and 1.4 million output tokens, with an API cost close to $20,000, resulting in a C compiler with about 100,000 lines of code [4]. - The compiler demonstrated capabilities beyond previous expectations of large language model (LLM) programming abilities, achieving a 99% pass rate on major compiler test suites and successfully compiling and running the game Doom [7][5]. Group 2: Methodology and Innovation - The key innovation of this experiment lies not in the model itself but in the collaborative approach, where the AI agents were set strict goals to work independently without relying on human input [6][9]. - A simple loop framework was established to allow the agents to take on new tasks immediately after completing previous ones, running within Docker containers to prevent local machine impact [6][8]. Group 3: Challenges and Limitations - Despite the impressive outcomes, the compiler faced criticism for not being able to compile a basic "hello world" program without manual intervention, raising questions about its maturity [10][13]. - Carlini explicitly outlined several limitations of the compiler, including its inability to independently compile the Linux kernel, reliance on GCC components for assembly and linking, and lower performance compared to established compilers [14][15][20]. - The project highlighted that the real challenge was not just writing code but creating an environment that allowed the AI to operate autonomously, emphasizing the need for rigorous testing and feedback mechanisms [21].
2026开局Update:锦秋与创业者的“全速前进”
锦秋集· 2026-02-03 10:44
Group 1 - The core viewpoint of the article discusses the emergence of 1.8 million "animation super individuals" enabled by technology, suggesting that animation can become a form of "super expression" for everyone [1] - The discussion features OiiOii's founder, who emphasizes that OiiOii is not just a generative tool but an intelligent collaborative agent system composed of AI scriptwriting, storyboarding, and sound effects [1] - The conversation aims to dissect the technological experiment surrounding "identity reversal" and the reconstruction of AI productivity [1] Group 2 - The first episode focuses on the dynamics of AI entrepreneurship in China and the U.S. in 2026, highlighting how Chinese entrepreneurs can position themselves on a global stage [2] - The guests include a dual-capacity investor with a background in AI research and early-stage VC, providing insights into the underlying logic of the Sino-U.S. AI investment ecosystem [2] - Key topics include the due diligence truths of Silicon Valley VCs, funding strategies for non-native entrepreneurs, and overlooked market gaps by OpenAI [2] Group 3 - The CES discussion involved around 40 participants from AI hardware and AI agent sectors, exchanging insights on industry trends observed during the CES event [3] - The "Predict 2026" roundtable gathered AI builders to share their predictions for the year, focusing on supply-side discussions and the evolving landscape of content production and trust in a saturated market [5] - A session on AI application gaps explored the challenges and future prospects of AI deployment, with founders and practitioners sharing their experiences [7] Group 4 - A conversation with a top AI comic company centered on multimodal content and the industrialization of content production, addressing emotional expression and monetization strategies [8] - The article highlights the achievements of various companies, including Inke's recent financing round of nearly 200 million RMB, indicating strong investor interest in humanoid robotics and core components [12] - The article also mentions the successful launch of several AI products at CES 2026, showcasing advancements in humanoid robots and smart home technology [19][21][23]
Agent当上群主后,群聊变成办事大厅了
量子位· 2026-02-02 03:39
西风 发自 凹非寺 量子位 | 公众号 QbitAI 文心APP的群里,最近有点"AI多势众"。 此群非一般的群,正是文心APP最近正在内测的 行 业首个"多 人、多Agent"群聊功能 。 该怎么形容它最贴切,一进这个群,就相当于进入了一个微型"办事处",有几位 随时待命、 各司其 职的Agent专员 ,能真正替你办事、帮 你支招,沟通效率还很高的那种。 它的用处很实在。 比如年初体检季,家人对着报告单上几个箭头忧心忡忡,亲戚群里七嘴八舌,焦虑在转发和猜测中发酵。这时就可以立刻拉个文心群。 大家聊天中一旦出现"指标异常要不要紧"等健康方面的疑问,原本在线的 群聊助手Agent 就会立刻拉 文心健康管家Agent 入群,用口语化 的表述解读专业术语,区分哪些问题需要重视、哪些不必过度担心。 这既回应了当事人的具体困惑,也平复了围观亲友的紧张情绪。专业信息成了可理解、可落实的建议。 再举个栗子,几个朋友想周末特种兵式出游,以往在群里定行程,常陷入"随便都行"和"怎么都行不通"的拉扯。 但建一个文心群聊,当大家讨论"这个季节哪儿人少景好""怎么走不绕路"时,不用你手动@,群聊助手便会主动识别需求给出建议,帮你做旅 ...
头部大模型厂商基本面更新与推荐
2026-02-02 02:22
Summary of Key Points from Conference Call Records Industry Overview - The large model industry has transitioned from the Chat paradigm to the Agent paradigm, with leading companies focusing on building native Agent capabilities rather than merely pursuing parameter scale [1][5] - Major internet companies are intensifying competition for AI super entry points, with Alibaba, ByteDance, and Tencent implementing various strategies to capture high-frequency traffic [1][8] Core Companies and Their Strategies Zhiyu (智谱) - Zhiyu has developed a full-stack large model technology and open-source strategy to build an industry ecosystem, with expected revenue reaching 700-800 million RMB by 2025, but it will not achieve profitability due to high R&D and delivery costs [3][12] - The company launched the AutoGLM model, which integrates deep research and operational capabilities, and updated its GLM 4 Air base model with 320 billion parameters, achieving performance comparable to Deepseek 1 with an 8x speed improvement [2][19] Minimax - Minimax has released its second-generation agent product, MiniMax Agent 2, which transforms the interaction logic from human adaptation to agent adaptation, enhancing its competitive edge [2][4] - The company is expected to achieve revenue close to 300 million RMB by 2025 and approximately 230 million USD by 2026, with a strong focus on C-end subscriptions and application in overseas markets [3][19] Kimi - Kimi has launched the Kimi 2.5 multimodal model, which can utilize up to 100 specialized agents to perform tasks in parallel, significantly lowering the AI interaction threshold [5][6] Deepseek - Deepseek focuses on niche technological breakthroughs, particularly in OCR and visual processing, to differentiate itself in the market [6][7] Competitive Landscape - The competition among leading large model companies is becoming increasingly differentiated, with a focus on high-level reasoning capabilities, native multimodality, and collaborative execution of complex tasks [3][6][14] - Companies are moving beyond pure technical competition to consider technology, product, ecosystem, and implementation capabilities [7][15] Market Trends and Predictions - By 2028, it is expected that 60% of systems will support multi-vendor interoperability, transitioning from single-platform to agent internet systems, although cost and user experience remain constraints [10] - The market for MaaS (Model as a Service) is projected to reach a penetration rate of 70% in China by 2030, with companies like Zhiyu leveraging their API and cloud services to adjust their revenue structure [19] Challenges and Opportunities - Independent large model companies like Zhiyu and Minimax face challenges in achieving a leading position in high-level reasoning and multimodal engineering, requiring significant R&D investment and rapid product iteration [15][16] - The competition for entry points among major internet companies poses a risk of winner-takes-all scenarios, particularly if they establish one or two super entry points by 2026-2027 [15][16] Financial Performance - Minimax's performance is driven by C-end subscriptions and application fees, with a significant increase in active users and ARPU from 6 USD to 15 USD [19][20] - Zhiyu is the largest large model startup in China by revenue, with a focus on local deployment and cloud business as growth engines, while also expanding into international markets to mitigate domestic pricing wars and policy risks [20]
2026 年,商业变革者将面对什么?a16z 的最新趋势观察
3 6 Ke· 2026-01-29 10:58
Group 1: AI Capabilities and Paradigms - Vertical AI is transitioning from information retrieval to "multi-agent mode," enabling unprecedented growth in industries like healthcare, legal, and housing, with companies achieving over $100 million in annual revenue [2] - By 2026, vertical AI will unlock "multi-agent mode," allowing for collaboration across various roles in industries, enhancing efficiency and understanding of complex workflows [3] - The emergence of "Agent-native" infrastructure will be crucial, as systems evolve to handle intelligent agent-driven workloads, requiring a redesign of control planes to manage high-frequency tool calls and complex concurrency [6][7] Group 2: Education and Talent Development - The first AI-native university is expected to emerge by 2026, focusing on real-time learning and self-optimizing educational systems, with courses and academic guidance adapting based on data feedback [4][5] - This AI-native university will train graduates proficient in system orchestration, addressing the talent gap in the new economy [5] Group 3: Content Creation and Media - 2026 is anticipated to be a pivotal year for multi-modal content creation, where AI can generate and edit content across various formats, enhancing creative control for users [8][9] - Video content will evolve into interactive environments, allowing for dynamic storytelling and user engagement, blurring the lines between creator and audience [10] Group 4: AI in Business Operations - The traditional metric of "screen time" as a value delivery indicator will be replaced by more complex ROI measures, focusing on outcomes rather than usage time [11] - Companies will increasingly adopt multi-agent systems to manage complex workflows, leading to a rethinking of organizational structures and roles [19][20] Group 5: Consumer AI and Personalization - Consumer-grade AI products will shift from productivity tools to enhancing personal connections and self-awareness, with a focus on understanding users' complete life contexts [21] - The trend towards personalized products will redefine how companies approach consumer engagement, moving from mass production to individualized experiences [13] Group 6: Research and Development - AI will play a significant role in accelerating scientific discovery through autonomous laboratories capable of conducting experiments and iterating research directions [15] - The integration of AI in research workflows will foster a new style of inquiry, emphasizing the relationships between ideas and enabling novel discoveries [22][23] Group 7: Data Privacy and Security - The need for transparent and auditable data access controls will become critical as AI systems operate autonomously, necessitating a shift towards "secrets as a service" to protect sensitive information [25] Group 8: Startup Ecosystem - A new wave of startups will emerge, focusing on providing services to newly established companies, leveraging the current AI product cycle to achieve scalability [26]
北京形成人工智能闭环式产业生态
Bei Jing Shang Bao· 2026-01-25 17:18
Core Insights - The artificial intelligence industry has transitioned from a phase of technological exploration to a focus on practical applications, with a notable shift towards multi-agent systems that outperform single-agent systems in specific tasks [1] - AI is expanding beyond digital realms into the physical world, moving towards multimodal models and addressing core challenges such as temporal and spatial cognition [1] - Beijing is positioned as a central hub for AI development, benefiting from a comprehensive ecosystem that supports industry growth [1] Industry Development - By 2025, Beijing's core AI industry is expected to reach a scale of 450 billion yuan, with over 2,500 companies, accounting for approximately half of the national figures [2] - The city is home to nearly 60 listed companies and around 40 unicorns in the AI sector, including the first domestic AI chip and large model companies [2] - Beijing has 148 scholars listed in the "AI 2000 Global Most Influential Scholars" list, representing over 40% of the national total, with a total of 15,000 AI scholars in the city [2] Ecosystem and Policy Support - A comprehensive policy framework and a complete layout from foundational computing power to application scenarios have created a closed-loop industrial ecosystem in Beijing [2] - The collaboration between research institutions, enterprises, and policy levels is driving breakthroughs in new technologies and applications in the AI field [2] - There is an expectation that 2026 will be a pivotal year for the explosion of intelligent agents in China [2]
2026北京两会|对话市政协委员王仲远:北京形成了人工智能闭环式产业生态
Bei Jing Shang Bao· 2026-01-25 11:17
Core Insights - The artificial intelligence industry has transitioned from a phase of rapid development to a more pragmatic focus on application efficiency, particularly moving from single-agent systems to multi-agent systems [2][5] - Beijing is positioned as a core hub for AI development, with a comprehensive ecosystem that supports the industry through policies, talent, and technological advancements [3][6] Industry Trends - The development of foundational models, especially large language models, has slowed, while the application of these models is accelerating, emphasizing the shift towards multi-agent systems [5][9] - AI is expanding beyond digital realms into the physical world, necessitating advancements in multi-modal models and world models to tackle challenges in time-space cognition and physical reasoning [2][5] Market Potential - By 2025, Beijing's AI core industry is expected to reach a scale of 450 billion yuan, with over 2,500 companies, accounting for about half of the national figures [3] - The city is home to nearly 60 listed AI companies and around 40 unicorns, showcasing its leadership in the AI sector [3] Talent and Education - Beijing boasts a significant talent pool, with 148 individuals listed in the "AI 2000 Global Influential Scholars" ranking, representing over 40% of the national total [3][7] - The city has a complete talent development chain, supported by top universities and research institutions, fostering the growth of AI professionals [7][8] Policy and Ecosystem - The policy framework in Beijing is comprehensive and practical, supporting both disruptive innovations and the development of new research institutions, which contributes to a closed-loop industrial ecosystem [6][8] - The collaboration between research institutions, enterprises, and policy-makers is driving breakthroughs in new technologies and applications in the AI field [3][6] Future Outlook - The year 2026 is anticipated to be a pivotal year for the explosion of intelligent agents in China, with expectations for significant advancements in multi-agent systems [3][8] - The focus is on achieving commercial viability for large models, which is essential for high-quality development in the industry [9][10]
硅谷风投教父谈AI行业现状:智能需求无限,基建和应用爆发才刚刚开始
3 6 Ke· 2026-01-21 23:46
Core Insights - The discussion emphasizes that concerns about an AI bubble are misguided, as the true measure of demand is API call volume rather than stock price fluctuations [10][29] - OpenAI's growth trajectory is highlighted, with significant increases in computing power and annual recurring revenue (ARR) projected for the coming years [2][3][25] - The conversation indicates that AI is transitioning from a novelty to a necessity in various sectors, particularly in healthcare, where AI tools are increasingly utilized by professionals [13][22] Group 1: AI Bubble and Demand - Vinod Khosla argues that the concept of an AI bubble is a misconception, stating that the only limitation on demand is the availability of computing resources [10][29] - API call volume is presented as the key indicator of AI's real demand, contrasting it with the internet bubble where traffic was low despite high valuations [10][29] - The current situation shows that demand is outpacing investment, which is different from the internet bubble scenario [10][30] Group 2: OpenAI's Growth and Business Model - OpenAI's computing power is expected to grow from approximately 200 megawatts in 2023 to over 2 gigawatts by 2025, with corresponding ARR increasing from $2 billion to over $20 billion [2][3][25] - The relationship between computing investment and revenue growth is described as nearly linear, indicating that AI is in a supply-constrained phase [5][11] - OpenAI's business model has evolved into a multi-faceted structure, incorporating various products and revenue streams, including subscriptions and potential licensing [11][26] Group 3: AI in Healthcare - AI is transforming the healthcare sector, with 66% of U.S. doctors reportedly using ChatGPT in their daily work [13][22] - The regulatory environment poses challenges for AI's full integration into healthcare, particularly regarding prescription capabilities [22][23] - AI's role in healthcare is seen as a means to enhance professional knowledge and improve patient interactions [22][23] Group 4: Future Trends and Predictions - Khosla predicts that the year 2026 will mark the emergence of agent technology and multi-agent systems as core themes in AI development [6][9] - The potential for a deflationary economy is discussed, where labor and expertise costs approach zero, leading to significant societal changes [15][46] - The conversation suggests that the next decade will see a shift towards a world where many services, including education and healthcare, become significantly cheaper or even free due to advancements in AI and robotics [15][46] Group 5: Opportunities for Startups - Startups are encouraged to focus on unique data and complex workflows as their competitive advantage, rather than competing directly with large models [14][42] - The discussion highlights the importance of specialized solutions built on top of foundational AI models, as no single company can dominate all areas [41][42] - The potential for "agentic commerce" and the complexities of agent interactions are identified as emerging areas of interest for new ventures [42]
腾讯研究院AI速递 20260122
腾讯研究院· 2026-01-21 16:01
Group 1 - DeepSeek's Model 1 has been discovered in the FlashMLA codebase, potentially indicating an upcoming release, featuring a 512-dimensional architecture and support for NVIDIA's Blackwell architecture [1] - Liquid AI has launched the open-source inference model LFM2.5-1.2B-Thinking, which operates on a liquid neural network architecture and requires only 900MB of memory on mobile devices, achieving a score of 88 on MATH-500 [2] - The xAI engineer revealed that AI is being tested as a "colleague" in the MacroHard project, achieving human speeds eight times faster, and the company is considering utilizing idle computing power from approximately 4 million Tesla vehicles in North America [3] Group 2 - Research indicates that models like DeepSeek-R1 can spontaneously form multi-role debate mechanisms, significantly improving accuracy through internal social dialogue [4][5] - Medical SAM3, a new model developed by the University of Central Florida, allows for expert-level segmentation in medical imaging using only text prompts, achieving an average accuracy increase from 11.9% to 73.9% across 33 datasets [6] - Anthropic's CEO predicts that AI will fully take over software engineering roles within 6-12 months, with a significant portion of entry-level jobs expected to disappear in the next 1-5 years [7] Group 3 - The Sequoia xbench team reported that top agents can handle over 60% of 104 daily tasks, indicating that foundational agent capabilities have become commoditized [8] - OpenAI's CFO discussed the maturation of multi-agent systems by 2026, emphasizing that AI bubbles should be measured by API call volumes rather than stock prices, with productivity increases of 27-33% for cutting-edge companies [9]