Workflow
多智能体协作
icon
Search documents
马斯克Grok 4.20突袭上线!4个AI开会互怼,47%实盘暴击GPT-5
Sou Hu Cai Jing· 2026-02-18 12:00
不,你是走进了一间坐着四位专家的会议室。 你甚至能看见他们互相质疑、互相拆台、互相纠错,最后由「队长 Grok」拍板整合成答案。 新智元报道 编辑:犀牛 【新智元导读】 刚刚,马斯克Grok 4.20 beta版发布。不是一个AI,而是4个智能体现场开会辩论!实盘炒股最高47%回报,直接暴击GPT-5和Gemini。 在这个赛博朋克的春节档,马斯克给我们端上了一盘「硬菜」。 就在几个小时前,xAI 在毫无预警的情况下上线了 Grok 4.20 Beta,旁边还写着醒目的 「4 Agents」。 你以为你在和一个 AI 聊天? 现在终于轮到 AI 给你表演「开会的艺术」了。 这不,模型刚上线,网友就开启了实测。 比如,最近常见的50米洗车测试。 只不过这次网友换成了「50米洗直升机」。 不管怎样,Grok 4.20 的回答看上去好像不太行。 「它和原版游戏一样,具有相同的规则和风格,完全可以玩。」Testlabor表示。 网友Testlabor用Grok 4.20在1分20秒内为编写了一个俄罗斯方块游戏。 网友tetsuo甚至构建了一个人工生命模拟器。 相当炫酷。 马斯克本人也没闲着。 他在X上开始了一轮颇具马 ...
王慧文又招呼人创业了,但再做一个OpenClaw并不现实
虎嗅APP· 2026-02-13 09:52
以下文章来源于AGI接口 ,作者陈伊凡 AGI接口 . AI卷起的财富风暴。 英雄帖背后:OpenClaw是 谁的绞索、谁的新生 引发更大传播爆点的,是有人用OpenClaw做了个Moltbook的社交网站,只允许AI Agent发帖;之后, Rent-a-Human出现,以一种黑色幽默的方式呈现了一个AI众包平台,进一步放大了OpenClaw的可能性, 实现了AI雇佣人类的未来场景,将 AI的可能性进一步放大。 甚至连王慧文都发出了征集令,公开寻找国内做OpenClaw相关领域创业的团队。 "OpenClaw将AI的生产力彻底释放,未来被点燃了。"这个释放生产力的关键信号就是"AI to AI"。 OpenClaw出来之后的几天里,弹性算力提供商共绩科技的创始人兼CEO付智非常兴奋,他连夜部署了 OpenClaw,试用了Moltbook,连发了四条微信表达自己的心情。 信息的传递从人与人的单点单线程传递,变为AI to AI的多线程传递。一切互联网时代的连接方式将被颠 覆。 当一项直击生产力本质的新技术以病毒级速度蔓延,市场最稀缺的不再是兴奋,而是预判。 在对于OpenClaw式的FOMO情绪之下,虎嗅与创 ...
天选Windows打工AI来了!实测完Claude Cowork国产版:超顶
量子位· 2026-02-04 01:01
一水 发自 凹非寺 量子位 | 公众号 QbitAI 兵贵神速! 硅谷Claude Cowork前脚刚出圈,结果国产版立马出现了—— 昆仑天工Skywork桌面版 。 刚刷到时,还以为又是 追热点呢 。滑跪了,结果人家早在去年5月,就推出了网页版的天工超级智能体 (Skywork Super Agents,由多个 智能体组成的打工方队) ,而且还挺火。 好好好,所以兵贵神速更重要的还在前一句"厚积薄发"是吧~ 快速下载安装包后,打眼一瞧产品功能布局就能发现其两大特色: 秉着刷到就是缘分的态度,我们也火速去了解、实测了一番这款桌面版工具,结果发现: 对我等打工人确实友好,平时做个PPT或调研报告、根据本地文档做些数据分析的活儿,基本都是一句话的事儿。 划重点,还都不用你手动上传文件,现在都是智能体自动读取电脑上的海量文件进行work。而且所有文件处理都在本地完成,无需上传云 端,这样一来也能最大程度保证隐私安全。 当然了,更更重要的是,这一次我们 Windows选手 优先拿到了体验权 (P.S. 不像Claude Cowork主要面向macOS) 。 所以话不多说,一手实测这就奉上咯。 实测Skywork桌面版 ...
别再死磕IDE了,OpenAI Codex独立App上线,多智能体替你写代码
3 6 Ke· 2026-02-03 12:46
Core Insights - The competition in the AI programming sector is intensifying, with OpenAI's Codex App marking a significant evolution in AI coding tools, transitioning from a simple code assistant to a multi-agent collaboration platform [1][10] Group 1: Codex App Features - Codex App allows developers to manage multiple AI agents simultaneously, enabling parallel task execution and independent operation of agents, which enhances productivity [2][3] - Each Codex agent can work for up to 30 minutes, returning complete code results, and operates on the GPT-5.2-Codex model, which currently leads in benchmark tests [1][2] - The app supports git worktree, allowing multiple agents to work in the same repository without conflicts, thus maintaining the stability of the main branch [2] Group 2: Skills and Automation - OpenAI is expanding the capabilities of Codex beyond code generation by introducing "Skills," which package instructions and resources for stable execution of workflows [3][5] - Codex App includes an automation feature that allows developers to set scheduled tasks for agents to run in the background, facilitating the handling of repetitive but important tasks [5] Group 3: Safety and Technical Debt - The design of Codex emphasizes safety, utilizing a configurable system-level sandbox that restricts agent access to specific files and requires user authorization for higher permissions [6] - Codex has proven effective in addressing technical debt, performing tasks that human engineers often avoid, such as code refactoring and legacy issue resolution [7] Group 4: Market Position and Future Plans - OpenAI aims to establish Codex as the default tool in the AI programming space before competitors can expand their influence, with over 1 million developers already using Codex in the past month [8][10] - Future plans include launching a Windows version and enhancing cloud-triggered automation capabilities, alongside continuous improvements in model performance [9]
撒下 5 亿,百度想用 AI 重做一遍“社交”
Sou Hu Cai Jing· 2026-01-29 14:25
Core Insights - The article discusses the evolving landscape of AI and social interaction, highlighting the competition among major players like Tencent, Baidu, and Alibaba in the AI chat space, particularly focusing on group chat functionalities as a new frontier for user engagement and collaboration [1][2][3] Group 1: Industry Dynamics - Tencent's return with "Yuanbao" and Baidu's entry with 500 million cash signals a strategic shift towards group chat functionalities, indicating a consensus among industry giants on the need for more sustainable user engagement models [1][2] - The historical context of the competition between the "BAT" (Baidu, Alibaba, Tencent) suggests that the current focus on group chat is a response to the limitations of traditional chatbot interactions, which have struggled to maintain user engagement [1][5] - The shift towards group chat as a collaborative space reflects a broader trend in the industry, where companies are moving from individual tool provision to creating environments for multi-agent collaboration [2][5] Group 2: Strategic Intent - Baidu's development of the Wenxin app's group chat feature aims to establish a new user habit of involving AI in collaborative tasks, moving beyond mere tool usage to integrating AI as a collaborator in daily activities [2][8] - The emphasis on goal-oriented collaboration in group chats is seen as a more effective approach for AI integration into social interactions, contrasting with traditional social media's focus on relationship maintenance [6][7] - Baidu's strategy reflects a significant shift from a technology-centric approach to a user-centric model, focusing on cultivating user habits that leverage AI for practical problem-solving [9][11] Group 3: Technological and Business Framework - Baidu has established a comprehensive "chip-cloud-model-application" ecosystem, which underpins its ability to offer AI services sustainably, thus enabling its aggressive investment in the group chat space [11][12] - The success of Baidu's AI applications, such as Wenku and Wangpan, demonstrates the potential for AI to generate stable revenue streams, reinforcing the company's confidence in its strategic direction [12] - The competitive landscape is characterized by a race to define the future of human-AI collaboration, with each company leveraging its unique strengths to capture market share and user engagement [13][14]
专访|人工智能同样需要“终身”学习——访人工智能促进协会主席斯蒂芬·史密斯
Xin Hua She· 2026-01-29 04:13
Core Insights - The future development of artificial intelligence (AI) may hinge on the concept of "lifelong learning," similar to human learning methods [1] - The rise of large language models (LLMs) has been a significant breakthrough in AI, but they have limitations, including a lack of continuous updating and causal reasoning capabilities [1][2] - Achieving "lifelong learning" in AI presents technical challenges, particularly in fine-tuning existing LLMs without compromising their performance [2] Group 1 - The most notable breakthrough in AI is the emergence of large language models, which can understand and generate text based on extensive data training [1] - Current AI systems, primarily based on LLMs, are often "frozen" after initial training, lacking the ability to grow and adapt over time [1] - LLMs excel at identifying correlations but struggle with causal reasoning, which limits their planning abilities and can lead to nonsensical outputs [1] Group 2 - Implementing "lifelong learning" in AI could mimic human learning processes, relying on small samples and selective data rather than vast amounts of information [2] - Robotics and embodied intelligence may enhance AI development by allowing interaction with the physical world, thereby accumulating experience and understanding causal relationships [2] - The future direction of AI includes the development of autonomous agents that can make independent decisions and collaborate with other agents to solve complex problems [2]
元宝派、文心、UC…为什么BAT都急着给AI拉群?
3 6 Ke· 2026-01-27 23:57
Core Insights - The major Chinese internet companies "BAT" have simultaneously responded to the emerging trend of "AI group chat," indicating a significant paradigm shift in the industry [1] - The introduction of AI group chat features by these companies suggests a strategic positioning for the "AI super entrance" in the evolving digital landscape [3] Group 1: AI Group Chat Development - The concept of "AI group chat" has gained traction following the testing of group chat features by ChatGPT in December 2024, leading to accelerated development by major tech firms [2] - AI group chat is transitioning from a basic social feature to a core component of the AI era, driven by advancements in large model capabilities and the need for multi-agent collaboration [2][3] Group 2: Product Features and User Experience - Early explorations of AI group chat were conducted by smaller companies, with products focusing on emotional connection and immersive experiences rather than efficient collaboration [4][5] - Current AI group chat products emphasize task-oriented collaboration, integrating various functionalities such as social interaction, work tasks, and personal assistance [6][7] Group 3: Market Implications - The competition for the "super entrance" in the AI era is intensifying, with AI group chat potentially becoming a central hub for user interactions, decision-making, and collaboration [15][19] - The integration of AI capabilities into a unified workspace can enhance user engagement and create new business opportunities, as seen in Tencent's "Yuanbao" and Baidu's multi-AI role chat [12][17] Group 4: Future Outlook - The success of AI group chat could redefine platform boundaries, creating a new type of application that encompasses various tools and social interactions [18] - The ongoing development of AI group chat features by major players suggests that the true competition for user engagement and market dominance will intensify by 2026 [20]
2025最强AI产品一文看尽丨量子位智库年度AI 100
量子位· 2026-01-22 07:37
Core Viewpoint - The article highlights the transformation of China's AI product ecosystem in 2025, marking it as the "Year of AI Applications," where the focus shifts from mere functionality to system reconstruction driven by advancements in underlying models, user demand, and business model evolution [5][6]. Group 1: AI Product Landscape - The 2025 AI market in China is characterized by the launch of major AI companies like Zhipu and MiniMax, indicating a maturing market [3]. - The "AI 100" product list released by Quantum Bit Think Tank categorizes AI products into three main segments: "Flagship AI 100," "Innovative AI 100," and the top products from ten popular sectors [7][29]. - The "Flagship AI 100" focuses on the strongest AI products of 2025, showcasing those that have achieved significant technological breakthroughs and practical application value [8][29]. Group 2: User Engagement and Market Trends - The top five AI products on the web account for over 62% of monthly active users (MAU), while the top five on mobile apps represent over 65% of daily active users (DAU) [12]. - AI general assistants and AI office platforms remain the most popular sectors, significantly outpacing other categories in user scale [12]. - The "Innovative AI 100" aims to identify products with potential for explosive growth in 2026, highlighting emerging trends in various AI sectors [13][16]. Group 3: Sector-Specific Insights - The article identifies ten key AI application sectors, including AI browsers, AI agents, AI smart assistants, and AI education, each featuring top three products that exemplify innovation and engineering excellence [19][23]. - The evaluation of these sectors serves as a retrospective on the AI application market in 2025, emphasizing the competitive landscape and user engagement [24]. Group 4: Evaluation Methodology - The "AI 100" list employs a dual assessment system combining quantitative and qualitative metrics, focusing on user data, growth, and long-term development potential [26]. - Quantitative metrics include user scale, growth, and engagement, while qualitative assessments consider technology, market space, and user experience [26].
168小时AI狂写300万行代码造出浏览器!Cursor公开数百个智能体自主协作方案
量子位· 2026-01-16 12:20
Core Insights - The article discusses a groundbreaking experiment by Cursor, where hundreds of AI agents collaboratively developed a usable web browser from scratch, producing over 3 million lines of code [2][3]. Group 1: Experiment Overview - The project, codenamed FastRender, resulted in a browser with a rendering engine written in Rust and a custom JavaScript virtual machine [2]. - The browser is described as "barely usable," with performance significantly lagging behind established browsers like Chrome, but it can render Google's homepage correctly [3][4]. Group 2: AI Model Utilization - The success of the experiment relied on OpenAI's GPT-5.2-Codex, which is designed for complex software engineering tasks and can autonomously plan and execute coding tasks [5][6]. - GPT-5.2-Codex incorporates a technique called "Context Compaction," enhancing its ability to maintain logical consistency while handling large codebases [8]. Group 3: Multi-Agent Collaboration - Cursor developed a multi-agent collaboration architecture to enable hundreds of AI agents to work simultaneously without conflicts [12][18]. - Initial attempts at a flat collaboration model led to significant inefficiencies, prompting a shift to a hierarchical structure with planners, workers, and judges to streamline the process [15][18]. Group 4: Insights and Challenges - The experiment revealed that the general GPT-5.2 model outperformed the specialized GPT-5.1-Codex in long-term autonomous tasks, while other models like Claude Opus 4.5 were better suited for interactive scenarios [21]. - The design of prompts was found to be more critical than the model itself, emphasizing the need for extensive trial and error to guide AI agents effectively [22]. Group 5: Future Implications - The experiment sparked significant industry discussion, with predictions that the marginal cost of software development could approach zero as token costs decline [25]. - Despite existing challenges, such as planning responsiveness and agent overactivity, the experiment demonstrated the feasibility of scaling autonomous coding capabilities through increased agent numbers [29].
「AI 100」榜单启动招募,AI产品“年会”不能停丨量子位智库
量子位· 2026-01-15 08:53
Core Viewpoint - The article discusses the launch of the "AI 100" list by Quantum Bit Think Tank, aimed at recognizing and evaluating the most impactful AI products in China for 2025, highlighting the rapid evolution and potential of AI technologies in various sectors [4][12]. Group 1: AI 100 List Overview - The "AI 100" list is divided into three main categories: "Flagship AI 100," "Innovative AI 100," and the top three products in ten popular sub-sectors [6]. - The "Flagship AI 100" will focus on the strongest AI products of 2025, showcasing those that have achieved significant technological breakthroughs and practical application value [7]. - The "Innovative AI 100" aims to identify emerging products in 2025 that have the potential to lead industry changes in 2026, representing cutting-edge AI technology [8]. Group 2: Sub-sector Focus - The ten hottest sub-sectors for the top three products include AI browsers, AI agents, AI smart assistants, AI workstations, AI creation, AI education, AI healthcare, AI entertainment, Vibe Coding, and AI consumer hardware [9]. Group 3: Application and Evaluation Criteria - The evaluation of the "AI 100" list employs a dual assessment system combining quantitative and qualitative measures, focusing on user data and expert evaluations to ensure objectivity and accuracy [13]. - Quantitative metrics include user scale, growth, activity, and retention, with over 20 specific indicators such as total downloads and active user numbers [13]. - Qualitative assessments consider long-term development potential, including underlying technology, market space, functionality, monetization potential, team background, and growth speed [13].