原生多模态世界模型Emu3

Search documents
刚刚,智源全新「悟界」系列大模型炸场!AI第一次真正「看见」宏观-微观双宇宙
机器之心· 2025-06-06 09:36
Core Viewpoint - The article discusses the advancements in AI technology, particularly focusing on the launch of the "Wujie" series of large models by Zhiyuan Institute, which signifies a shift from digital to physical world modeling and understanding at both macro and micro levels [4][8][40]. Group 1: AI Advancements and Trends - The AI field remains vibrant and rapidly evolving, with significant developments in reinforcement learning and various AI domains such as intelligent agents and multimodal models [2][3]. - The annual Zhiyuan Conference showcased insights from leading experts, including Turing Award winners, on the future paths of AI [3]. - The "Wujie" series represents a new phase in large model exploration, focusing on bridging the gap between virtual and physical worlds [4][7]. Group 2: "Wujie" Series Features - The "Wujie" series includes several key models: Emu3 (multimodal world model), Brainμ (brain science model), RoboOS 2.0 (embodied intelligence framework), and OpenComplex2 (microscopic life model) [6][15][34]. - Emu3 is the first native multimodal world model, integrating various modalities like text, images, and brain signals into a unified representation [14]. - Brainμ is a groundbreaking model in brain science, capable of processing over 1 million neural signal data units and supporting various neuroscience tasks [15][19]. Group 3: Embodied Intelligence Development - The embodied intelligence sector has become a strategic focus, with the introduction of RoboOS 2.0 and RoboBrain 2.0, which enhance the capabilities of embodied AI systems [20][22]. - RoboOS 2.0 introduces a user-friendly framework for developers, significantly reducing the complexity of deploying robotic systems [24]. - RoboBrain 2.0 is noted for its superior performance in task planning and spatial reasoning, achieving a 74% improvement in task planning accuracy compared to its predecessor [27]. Group 4: Microscopic Life Modeling - OpenComplex2 marks a significant advancement in modeling microscopic life, capable of predicting static and dynamic structures of biological molecules [34][38]. - The model has demonstrated its effectiveness by successfully predicting protein structures in a competitive evaluation, showcasing its potential in life sciences [36]. - OpenComplex2 aims to revolutionize drug discovery and biological research by providing a new modeling pathway for understanding molecular dynamics [38]. Group 5: Future Directions - The "Wujie" series reflects a strategic upgrade in AI paradigms, emphasizing the importance of modeling the physical world and integrating various AI domains [40]. - The future of large models is expected to extend beyond traditional applications, influencing systems that understand and change the world [41].
智源发布“悟界”系列大模型,宣布围绕物理AGI进行布局
Xin Lang Ke Ji· 2025-06-06 02:51
Group 1 - The core viewpoint of the news is the launch of the "Wujie" large model series by the Beijing Zhiyuan Artificial Intelligence Research Institute, focusing on advancements in physical AGI and breaking the boundaries between virtual and real worlds [1] - The "Wujie" series includes four models: Emu3, Brainμ, RoboBrain 2.0, and OpenComplex2, each targeting different aspects of multi-modal learning and applications in neuroscience [1] - Emu3, set to be released in October 2024, utilizes a novel token prediction paradigm for unified multi-modal learning, allowing for the encoding of images/videos into discrete symbol sequences that are isomorphic to text [1] Group 2 - Brainμ is built on the Emu3 architecture and tokenizes brain signals from various neuroscience modalities, enabling multi-directional mapping between brain signals and other modalities like text and images [2] - The model has been pre-trained on over 1 million units of neural signals and aims to support a wide range of applications in neuroscience, from basic research to clinical studies and brain-computer interfaces [2] - Collaborations with leading neuroscience laboratories and research teams in China, including institutions like Tsinghua University and Peking University, are being established to expand the scientific and industrial applications of Brainμ [2]