空间智能
Search documents
创业一年后,李飞飞推出首款可商用世界模型 Marble,任意模态都可生成 3D 世界
Founder Park· 2025-11-13 14:06
Core Insights - World Labs has officially launched its first commercial generative multimodal world model product, Marble, which supports a wider range of input modalities compared to its earlier preview version [2][4] - The concept of "spatial intelligence" introduced by Fei-Fei Li is highlighted as the next frontier in AI, emphasizing its importance in transforming how humans create and interact with both real and virtual worlds [25][29] Product Features - Marble allows users to generate a complete 3D world from various inputs such as images, text, or videos, enabling detailed and rich 3D environments [5][10] - The platform offers both a free version and a Pro version tailored for professionals in fields like game development, film effects, architecture, and robotics [8] - Key capabilities of Marble include: - Multimodal 3D world generation based on user inputs [10] - Support for multiple image inputs to create cohesive 3D spaces [13] - Advanced editing tools for fine-tuning generated worlds, including local adjustments and global style changes [18][20] - An experimental tool called Chisel for advanced users to manipulate the spatial layout of the generated worlds [21] - Options to expand and combine worlds for larger, more complex environments [22][26] - Export capabilities in various formats for use in professional software and platforms [23][24] Importance of Spatial Intelligence - Spatial intelligence is deemed crucial for AI's evolution, as it will reshape various fields such as storytelling, creativity, robotics, and scientific discovery [29][40] - Current AI models, while strong in language processing, lack a robust understanding of the physical world, which limits their practical applications [30][38] - The development of spatial intelligence can lower the barriers for creating 3D environments, enabling non-professionals to build and experience virtual worlds [41] - It is also essential for advancing embodied intelligence in robotics, allowing machines to interact safely and effectively with the physical world [41] - The potential applications of spatial intelligence extend to scientific research, healthcare, and education, enabling simulations and explorations beyond human perceptual capabilities [42][43]
李飞飞的世界模型来了,一句话生成3D世界,AI 真的开始理解现实了
3 6 Ke· 2025-11-13 11:42
Core Insights - The launch of Marble by World Labs marks the first public product in the realm of world models, showcasing significant advancements in spatial intelligence and AI capabilities [1][2][3] Group 1: Marble's Core Capabilities - Marble features three main capabilities: multimodal generation, AI-native world editing, and a practical production workflow [1] - It can reconstruct a complete 3D world from various inputs, including text, images, and videos, allowing for a seamless creative process [4][7] - Users can edit generated worlds similarly to real scenes, enabling continuous refinement and expansion of 3D environments [13][14] Group 2: Applications and Integration - Marble allows for the export of generated worlds into various formats compatible with industry-standard tools like Unreal, Unity, and Blender, facilitating integration into game and film production workflows [15][17] - The platform supports high-quality rendering and video generation, enhancing the usability of created worlds in real-world applications [18][19] Group 3: Theoretical Foundations and Future Implications - The development of Marble is rooted in the concept of spatial intelligence, which is essential for AI to interact with the physical world [20][21] - A mature world model must possess generative, multimodal, and interactive capabilities, which are foundational for future advancements in robotics and scientific research [22][23][24] - Marble's release signifies a step towards achieving comprehensive spatial intelligence, paving the way for future applications in automation and simulation [27]
DeepSeek前骨干罗福莉C位亮相小米,曾网传雷军千万年薪挖她
程序员的那些事· 2025-11-13 11:24
Core Insights - Luo Fuli has officially joined Xiaomi as the head of the MiMo team, marking a significant step in the company's AI ambitions [1][3] - The evolution of intelligence is transitioning from the language domain to the physical world, aiming to unlock multi-modal spatial intelligence, which is crucial for achieving true Artificial General Intelligence (AGI) [4] Timeline and Background - Rumors about Luo Fuli joining Xiaomi surfaced last year, with reports indicating she was recruited by Lei Jun with a salary of tens of millions [5][10] - Key dates include the launch of DeepSeek-V3 on December 25, followed by media reports of Xiaomi assembling a GPU cluster [6][7] - On December 31, Lei Jun publicly shared Xiaomi's ambitions in AI during a New Year's live stream [8] Luo Fuli's Profile - Luo Fuli holds a Bachelor's degree in Computer Science from Beijing Normal University and a Master's in Computational Linguistics from Peking University, with numerous publications in top NLP conferences [15] - She has worked at Alibaba's DAMO Academy and DeepSeek, contributing to the development of various deep learning models [17] - Her academic work has garnered over 11,000 citations, with approximately 8,000 citations added in the past year alone [18] Xiaomi's AI Strategy - The MiMo project is central to Xiaomi's efforts in advancing large model research, with a focus on spatial intelligence [24] - Spatial intelligence aims to bridge the gap between information AI and physical AI, aligning with Xiaomi's ecosystem of people, vehicles, and homes [26]
罗福莉C位亮相小米,离职DeepSeek后首次官宣
36氪· 2025-11-13 10:26
Core Viewpoint - The article highlights the appointment of Luo Fuli as the head of Xiaomi's MiMo team, focusing on advancing spatial intelligence as a key step towards achieving Artificial General Intelligence (AGI) [1][3][23]. Group 1: Appointment and Background - Luo Fuli officially announced her role at Xiaomi on November 12, leading the MiMo team [1]. - She previously worked at DeepSeek and was reportedly recruited by Lei Jun with a salary of tens of millions [4][7]. - Luo has a strong academic background, with over 11,000 citations for her research papers, indicating her prominence in the field [17][18]. Group 2: MiMo Team and Objectives - The MiMo team aims to unlock multi-modal spatial intelligence, which includes perception, reasoning, generation, and action capabilities [4][23]. - Luo's vision aligns with the broader goal of integrating information AI with physical AI, creating a seamless connection between the digital and physical worlds [25]. Group 3: Industry Context and Implications - The concept of spatial intelligence has gained attention, with AI experts like Fei-Fei Li discussing its significance for embodied intelligence and AGI [24]. - Xiaomi's focus on spatial intelligence is seen as a strategic move, leveraging its ecosystem that includes people, vehicles, and homes [25].
周末来造梦!李飞飞世界模型正式开放,能力升级,有免费版
机器之心· 2025-11-13 08:26
Core Insights - The article discusses the launch of Marble, a multi-modal generative world model developed by Fei-Fei Li's "Spatial Intelligence" team, which is now available for public use, allowing users to create 3D worlds easily [3][4]. Features and Capabilities - Marble has undergone significant upgrades since its preview release, now supporting more generation methods, deeper editing capabilities, and a wider range of output formats, making it suitable for various professional applications such as game development, film effects, architectural design, and robotic simulation [4]. - The platform offers both a free version and a membership version, with differences in the number of worlds that can be generated, the range and depth of editing features, and commercial licensing [6]. Multi-Modal Input - Marble's core upgrade is its heavy multi-modal capability, allowing users to input various types of information, including multiple images, to create more refined 3D worlds [7][12]. - Users can provide different reference images for various areas of the world, enabling a more cohesive 3D space [7]. Editing and Iteration - Marble allows for iterative creation, where users can modify the generated world post-creation, including object removal, local adjustments, and structural reconfigurations [12][20]. - The platform supports input from multiple real-world photos or short video clips to inspire virtual world creation, with seamless transitions between views [14]. Expansion and Detail Enhancement - Users can expand specific areas of the generated world to fill in missing details and enhance clarity, particularly in areas that may have been less defined during initial generation [23][24]. - The platform also allows for the combination of multiple worlds based on user-defined relationships, facilitating the construction of larger spaces [25]. Output and Rendering - Marble enables users to export created worlds in various formats, including high-fidelity Gaussian Splat representations and triangle meshes, ensuring compatibility with industry-standard tools [27][28]. - Users can render worlds as videos with pixel-level control over camera movements and pacing, enhancing the creative process [31]. Collaborative Exploration - The company has launched Marble Labs to collaborate with artists, designers, and engineers to explore new creative paradigms and best practices [36]. - Marble is positioned as a step towards "spatial intelligence," with future plans to enhance interactivity and expand applications in simulation and robotics [37].
星源智T5域控制器亮相百度大会 赋能智元精灵G2开启机器人新纪元
Zheng Quan Ri Bao Wang· 2025-11-13 06:11
本报讯(记者袁传玺)11月13日,百度世界大会2025现场,北京星源智机器人科技有限公司(以下简称"星 源智")携自主研发的机器人大小脑域控制器T5重磅登陆展会。 目前,智元机器人与星源智已达成深度合作并联合参展,于今年10月份发布的搭载该控制器的新一代工 业级交互具身作业机器人智元精灵G2同步亮相展台,全方位呈现其硬核实力,近距离感受了这一技术 突破带来的行业变革潜力。 据悉,星源智机器人由北京智源研究院孵化,致力于实现多模态空间智能,构建物理世界的通用具身大 脑。星源智拥有世界领先的具身多模态大模型能力和空间智能能力,打造了内涵极为丰富的具身技术 栈,包括物理世界的多模态感知与数字化重建、多模态具身世界模型、机器人本体的运动控制、用于感 知规划、决策、导航、操作的各类具身基础大模型、软硬一体的端侧具身系统等。 据了解,T5是一款集超大算力、低功耗与高性能于一体的域控制平台。该产品搭载NVIDIA最新 JetsonThor处理器,算力高达2070TFLOPS,具备大型Transformer模型加速能力,广泛支持深度学习、 计算机视觉等先进算法,可充分满足机器人在实时感知、智能决策与精准控制方面的需求。 ...
李飞飞3D世界模型公测,网友已经玩疯了
量子位· 2025-11-13 05:38
闻乐 发自 凹非寺 量子位 | 公众号 QbitAI 该模型叫 Marble ,是李飞飞创立的World Lab推出的全新3D世界生成模型,直接给世界模型赛道按下加速键——人人都能轻松get专属3D 世界。 就在今天,李飞飞发布了全新的世界模型,开启公测,人人可玩。 创造力就是有趣的智能。 不需要专业团队建模,普通人可以靠文本、照片甚至短视频,轻松生成可编辑、可下载的专属3D世界。 还能在VR里沉浸式体验 ! 好家伙,就这么一会儿,大家已经带着自己的3D世界刷屏了。 网友已经玩疯了 一键从像素世界"穿越"到彩色艺术小巷,街道是砖石铺就,两侧的店铺门口摆放着黑板菜单牌,还有一辆自行车停靠在路边,很悠闲的市井氛 围~ 或者是未来感和复古感交织的赛博朋克世界。 还内置了AI原生世界编辑工具。编辑可以很小很局部,比如移除一个物体,修饰一个区域。也可以很彻底:交换物体,改变视觉风格,或者重 构世界的大片区域。 带有机械元素的睡眠舱。 充满自然气息的欧式风格庭院。 总之,Marble一经开放就大获好评:使用非常简单。 在官方的技术报告中,Marble不仅能通过简短的文本提示、单图提示生成3D世界,还能通过多张图片、不同视 ...
“AI教母”李飞飞发布首款商用世界模型
第一财经· 2025-11-13 02:15
Core Insights - World Labs, founded by AI expert Fei-Fei Li, launched its first product, Marble, which is supported by a multimodal world model designed to create high-fidelity, persistent 3D environments from various inputs [2][5] - Marble offers a freemium model with four subscription tiers, ranging from a free version to a premium version priced at $95 per month, allowing for extensive generation capabilities [5] - Fei-Fei Li emphasizes the importance of spatial intelligence as the next frontier in AI, arguing that current AI models lack a true understanding of the physical world [6][8] Product Features - Marble supports large-scale multimodal input and includes a creative center called Marble Labs, enhancing user experience [5] - The product differentiates itself by generating persistent 3D environments that can be exported in various formats, reducing scene distortion and inconsistency [5] - The real-time model RTFM can run on a single H100 GPU, but Marble's unique selling point is its ability to create downloadable 3D worlds [5] Market Position - Marble is the first commercially available product in the world model space, while competitors like Google's Genie and others are still in limited preview or demo stages [8] - The overall interaction quality of Marble has been positively reviewed, although there is room for improvement in detail precision [8] Future Outlook - In the short term, spatial intelligence is expected to empower creativity in industries such as film, gaming, and architecture [8] - Mid-term implications include advancements in embodied intelligent robotics, enhancing collaboration in domestic and laboratory settings [8] - Long-term potential includes revolutionary applications in science, healthcare, and education through simulation and immersive learning experiences [8] Company Growth - World Labs has raised approximately $230 million in funding, achieving a valuation exceeding $1 billion, making it a new unicorn in the AI sector [9] - The company’s investors include prominent firms such as a16z, Radical Ventures, NVIDIA NVentures, AMD Ventures, and Intel Capital [9] - Future plans involve deepening the understanding of three-dimensionality and physicality, with aspirations to integrate augmented reality and robotics [9]
“AI教母”李飞飞发布首款商用世界模型 空间智能更近了
Di Yi Cai Jing· 2025-11-13 01:37
Core Insights - World Labs, founded by AI expert Fei-Fei Li, launched its first product, Marble, which is supported by a multimodal world model designed to create high-fidelity, persistent 3D worlds from a single image, video, or text prompt [1][4]. Product Features - Marble has expanded its functionalities since its preview release two months ago, now supporting large-scale multimodal input and introducing Marble Labs as a creative center [4]. - The product offers four subscription tiers: a free version with 4 generations limited to text and image input, a standard version at $20/month with multi-image and video input, and a premium version at $95/month allowing 75 generations and full feature access [4]. - Unlike competitors, Marble generates persistent, downloadable 3D environments rather than dynamically generated worlds, significantly reducing scene distortion and inconsistency [4]. Industry Context - Fei-Fei Li argues that current AI models, primarily large language models, lack a true understanding of the physical world, which is essential for achieving genuine machine intelligence [5]. - The concept of spatial intelligence is highlighted as a key breakthrough for AI, enabling machines to understand and interact with the three-dimensional world [5]. - Competitors like Google and Decart are still in the research or demo phase, making Marble the first commercially available product in the world model space [5]. Future Outlook - In the short term, spatial intelligence is expected to empower creativity in industries such as film, gaming, and architecture by providing tools for rapid 3D environment generation [6]. - In the medium term, it may drive the development of embodied intelligent robots, enhancing their role as collaborators in various settings [6]. - Long-term implications include potential revolutions in science, healthcare, and education through simulations and immersive learning experiences [6]. Company Growth - World Labs has raised approximately $230 million, achieving a valuation exceeding $1 billion, making it a new unicorn in the AI sector [6]. - The company’s investors include prominent firms such as a16z, Radical Ventures, NVIDIA NVentures, AMD Ventures, and Intel Capital [6]. - Future plans involve focusing on models that deeply understand three-dimensionality, physicality, and concepts of space and time, with aspirations to support augmented reality and robotics [6].
“AI教母”李飞飞发布首款商用世界模型,空间智能更近了
Di Yi Cai Jing· 2025-11-13 01:31
Core Insights - World Labs, founded by AI expert Fei-Fei Li, launched its first product, Marble, which is described as the foundation for building a spatially intelligent future [1][4] - Marble utilizes a multi-modal world model to create high-fidelity, persistent 3D environments from a single image, video, or text prompt [1][4] - The product is now publicly available with expanded features, including a freemium model and four subscription tiers, ranging from a free version to a flagship version priced at $95 per month [4] Product Features - Marble supports large-scale multi-modal input and includes a creative center called Marble Labs [4] - The subscription options include a free version with limited capabilities and paid versions that allow for more extensive generation and advanced editing [4] - Unlike competitors, Marble generates persistent, downloadable 3D environments, reducing scene distortion and inconsistency [4][5] Industry Context - Fei-Fei Li argues that spatial intelligence is crucial for achieving true machine intelligence, as it allows for a comprehensive understanding of the physical world [5] - Other companies, such as Google, are also exploring world models, but Marble is the first commercially available product in this space [5] - The industry evaluation indicates that while Marble's interaction effects are strong, there is room for improvement in detail precision [5] Future Implications - In the short term, spatial intelligence is expected to empower creativity in industries like film, gaming, and architecture [5][6] - Mid-term, it may drive the development of embodied intelligent robots for collaboration in various settings [6] - Long-term, spatial intelligence could revolutionize fields such as science, healthcare, and education through enhanced simulations and immersive learning experiences [6] Company Growth - World Labs has raised approximately $230 million, achieving a valuation exceeding $1 billion, making it a new unicorn in the AI sector [6] - The company’s investors include prominent firms such as a16z, Radical Ventures, NVIDIA NVentures, AMD Ventures, and Intel Capital [6] - Future plans involve focusing on models that deeply understand three-dimensionality, physicality, and concepts of space and time, with aspirations to support augmented reality and robotics [6]