多模态世界学习
Search documents
智源研究院发布2026十大AI技术趋势:“技术泡沫”是假命题
Xin Jing Bao· 2026-01-09 03:52
Core Insights - The Beijing Zhiyuan Artificial Intelligence Research Institute has released its predictions for the top ten AI technology trends for 2026, focusing on foundational models, AI applications, and key industries [1] Group 1: Foundational Models - The institute believes that world models will become a consensus direction for AGI, as high-quality text data is nearly exhausted. AI must learn not only language but also the rules governing the physical world, necessitating the processing of multimodal information such as images, sounds, time, and space [3] - In the realm of embodied intelligence, the number of companies has exceeded 230, but many exhibit homogeneity in their business models, potentially leading to industry "clearing." The introduction of world models may serve as a crucial technological anchor for the next stage of embodied intelligence [3] Group 2: Consumer Applications - The competition in consumer AI applications is becoming clearer, with a focus on "super applications" characterized by "All in One" functionality, moving beyond single-tool attributes to create a closed loop from information acquisition to task planning and problem-solving [3] - Despite the presence of major players in the general market, there are still opportunities for breakthroughs in high-barrier vertical fields such as health and education, where vertical applications demonstrate differentiated competitiveness [3] Group 3: Reasoning Capabilities - The institute asserts that the notion of a "technology bubble" is a false proposition, as reasoning optimization has not yet reached its ceiling. Progress in this area will remain a key factor supporting the large-scale application of AI in 2026 [4]
训练仍有巨大的Scaling空间!智源研究院王仲远:视频数据还未被充分利用 | MEET2026
Xin Lang Cai Jing· 2025-12-24 09:47
编辑部 整理自 MEET2026 量子位 | 公众号 QbitAI 全球互联网的文本数据已基本挖掘完毕,但视频数据还未被充分利用。 智源研究院的多模态世界模型悟界·Emu3.5,就是一个从视频中学习,而非仅依赖文本的大模型。 在量子位MEET2026智能未来大会上,北京智源人工智能研究院院长王仲远提到: 当前人工智能正处于第三次浪潮的关键拐点:大模型不仅推动AI从弱智能向通用智能跨越,更有望让机器人从1.0专用时代迈入2.0通用时代。 为此,智源研究院发布"悟界"系列大模型,锚定AI从数字世界进入物理世界的核心方向。 智源的Emu3.5与具身大脑全栈技术体系,就成为支撑这一技术演进趋势的两大基石。 MEET2026智能未来大会上,王仲远还说,要实现AI与物理世界的深度交互,需突破多模态理解与具身执行的核心技术瓶颈。 目前,悟界系列已在多模态学习范式、跨机器人本体适配等领域取得关键进展,且多项成果已开源开放,助力产业协同创新。 为了完整体现王仲远的思考,在不改变原意的基础上,量子位对演讲内容进行了编辑整理,希望能给你带来更多启发。 MEET2026智能未来大会是由量子位主办的行业峰会,近30位产业代表与会讨论。 ...
训练仍有巨大的Scaling空间!智源研究院王仲远:视频数据还未被充分利用 | MEET2026
量子位· 2025-12-24 07:20
编辑部 整理自 MEET2026 量子位 | 公众号 QbitAI 全球互联网的文本数据已基本挖掘完毕,但视频数据还未被充分利用。 智源研究院的多模态世界模型悟界·Emu3.5,就是一个从视频中学习,而非仅依赖文本的大模型。 智源的Emu3.5与具身大脑全栈技术体系,就成为支撑这一技术演进趋势的两大基石。 MEET2026智能未来大会上,王仲远还说,要实现AI与物理世界的深度交互,需突破多模态理解与具身执行的核心技术瓶颈。 目前,悟界系列已在多模态学习范式、跨机器人本体适配等领域取得关键进展,且多项成果已开源开放,助力产业协同创新。 为了完整体现王仲远的思考,在不改变原意的基础上,量子位对演讲内容进行了编辑整理,希望能给你带来更多启发。 在量子位MEET2026智能未来大会上,北京智源人工智能研究院院长 王仲远 提到: 当前人工智能正处于第三次浪潮的关键拐点:大模型不仅推动 AI从弱智能向通用智能跨越 , 更有望 让机器人从1.0专用时代迈入2.0通 用时代 。 为此,智源研究院发布"悟界"系列大模型,锚定 AI从数字世界进入物理世界 的核心方向。 MEET2026智能未来大会是由量子位主办的行业峰会,近30 ...