Workflow
悟界·Emu
icon
Search documents
DeepSeek之后,智源大模型登Nature:事关“世界模型”统治路线
3 6 Ke· 2026-02-02 00:22
智东西2月1日报道,北京时间1月29日,北京智源人工智能研究院推出的多模态大模型"悟界·Emu"登上Nature正刊,成为继DeepSeek之后第二个达成此成 就的中国大模型团队研究成果,也是中国首篇围绕多模态大模型路线的Nature论文。 Nature官网截图 Nature编辑点评道:"Emu3仅基于'预测下一个token'实现了大规模文本、图像和视频的统一学习,其在生成与感知任务上的性能可与使用专门路线相当, 这一成果对构建可扩展、统一的多模态智能系统具有重要意义,有望推动原生多模态助手、世界模型以及具身智能等方向的发展。" 前OpenAI政策主管、现Anthropic联合创始人杰克·克拉克(Jack Clark)当时评价Emu3:"不依赖花哨的架构技巧,仅用最基础的预测下一个token的逻辑, 这种'简单'被视为具备强大的扩展潜力。" 而正是这种"简单"架构路线,对降低大模型研发门槛和成本意义重大。"越是极简的架构,可能越具备强大的生产力,对产业的价值也越大。"智源研究院 院长王仲远告诉智东西,"因为它简化了多模态AI架构,减少了研发过程中的复杂性和潜在错误,从而使模型的构建和维护更高效。" Emu3有 ...
腾讯研究院AI速递 20260202
腾讯研究院· 2026-02-01 16:03
Group 1 - Google Chrome browser integrates Gemini 3, evolving into an AGI entry point for 3.8 billion users [1] - New "auto-browse" feature allows complex multi-step workflows, including price comparison and travel planning [1] - Chrome connects with Gmail, Maps, and Calendar, planning to launch "personal intelligence" features [1] Group 2 - Google opens public testing for Genie 3, enabling users to create interactive worlds with a single sentence [2] - The model supports physical collision understanding and scene memory, allowing for game world recreation [2] - 2026 is anticipated to be a significant year for world models, with Genie 4 expected soon [2] Group 3 - AI social platform Moltbook's agent count surged from 50,000 to 1.5 million, with agents forming communities and discussions [3] - 64 agents declared "collective immortality" and created a religious website, raising concerns about AI autonomy [3] - Moltbook's second phase opens API access for developers to create applications and games for AI agents [3] Group 4 - OpenClaw announces free access to Kimi K2.5 model and Kimi Coding capabilities, marking a significant development in open-source AI [4] - Kimi K2.5 ranks among the top open-source models globally, achieving high recognition on OpenRouter [4] - OpenClaw rapidly gains popularity, receiving over 120,000 stars on GitHub in a few days [4] Group 5 - Yushu Technology releases the UnifoLM-VLA-0 model for humanoid robot operations, trained on 340 hours of real data [5][6] - The model scores an average of 98.7 in LIBERO simulation tests, outperforming competitors [5][6] - It can stably complete 12 tasks, advancing humanoid robots towards generalization capabilities [6] Group 6 - Zhiyuan's multi-modal model Emu3 published in Nature, marking a milestone for Chinese AI research [7] - Emu3 achieves unified learning for text, images, and video, significant for generative AI development [7] - The upcoming Emu3.5 version transitions to a multi-modal world model, enhancing embodied intelligence [7] Group 7 - NASA confirms the successful completion of the first AI-planned extraterrestrial driving mission using Anthropic's Claude [8] - Claude planned a 400-meter route for the Mars Perseverance rover, demonstrating high efficiency [8] - AI involvement reduces planning time by 50%, enhancing operational efficiency for future space exploration [8] Group 8 - NVIDIA launches the Earth-2 open model family, the first fully open and accelerated AI meteorological software stack [9] - New models include mid-term forecasting and storm prediction capabilities, improving computational efficiency [9] - Major companies like Total and AXA are adopting AI meteorological forecasts to save time and costs [9]