Workflow
空间智能
icon
Search documents
周末来造梦!李飞飞世界模型正式开放,能力升级,有免费版
机器之心· 2025-11-13 08:26
Core Insights - The article discusses the launch of Marble, a multi-modal generative world model developed by Fei-Fei Li's "Spatial Intelligence" team, which is now available for public use, allowing users to create 3D worlds easily [3][4]. Features and Capabilities - Marble has undergone significant upgrades since its preview release, now supporting more generation methods, deeper editing capabilities, and a wider range of output formats, making it suitable for various professional applications such as game development, film effects, architectural design, and robotic simulation [4]. - The platform offers both a free version and a membership version, with differences in the number of worlds that can be generated, the range and depth of editing features, and commercial licensing [6]. Multi-Modal Input - Marble's core upgrade is its heavy multi-modal capability, allowing users to input various types of information, including multiple images, to create more refined 3D worlds [7][12]. - Users can provide different reference images for various areas of the world, enabling a more cohesive 3D space [7]. Editing and Iteration - Marble allows for iterative creation, where users can modify the generated world post-creation, including object removal, local adjustments, and structural reconfigurations [12][20]. - The platform supports input from multiple real-world photos or short video clips to inspire virtual world creation, with seamless transitions between views [14]. Expansion and Detail Enhancement - Users can expand specific areas of the generated world to fill in missing details and enhance clarity, particularly in areas that may have been less defined during initial generation [23][24]. - The platform also allows for the combination of multiple worlds based on user-defined relationships, facilitating the construction of larger spaces [25]. Output and Rendering - Marble enables users to export created worlds in various formats, including high-fidelity Gaussian Splat representations and triangle meshes, ensuring compatibility with industry-standard tools [27][28]. - Users can render worlds as videos with pixel-level control over camera movements and pacing, enhancing the creative process [31]. Collaborative Exploration - The company has launched Marble Labs to collaborate with artists, designers, and engineers to explore new creative paradigms and best practices [36]. - Marble is positioned as a step towards "spatial intelligence," with future plans to enhance interactivity and expand applications in simulation and robotics [37].
星源智T5域控制器亮相百度大会 赋能智元精灵G2开启机器人新纪元
Zheng Quan Ri Bao Wang· 2025-11-13 06:11
Core Insights - Baidu World Conference 2025 showcased the T5 domain controller developed by Beijing Xingyuan Intelligent Robot Technology Co., Ltd, highlighting its advanced capabilities in robotics [1] Company Overview - Xingyuan Intelligent focuses on multi-modal spatial intelligence and aims to create a universal embodied brain for the physical world [1] - The company was incubated by the Beijing Academy of Artificial Intelligence and possesses leading capabilities in embodied multi-modal large models and spatial intelligence [1] Product Details - The T5 controller features high computational power of 2070 TFLOPS, low power consumption, and high performance, supporting advanced algorithms like deep learning and computer vision [1] - The T5 is equipped with NVIDIA's latest Jetson Thor processor, enhancing its ability to meet the demands of real-time perception, intelligent decision-making, and precise control in robotics [1] Collaboration and Industry Impact - A deep collaboration has been established between Zhiyuan Robotics and Xingyuan Intelligent, showcasing the new generation industrial interactive embodied robot, Zhiyuan Spirit G2, at the conference [1] - The exhibition demonstrated the potential industry transformation brought by the technological breakthroughs represented by the T5 controller [1]
李飞飞3D世界模型公测,网友已经玩疯了
量子位· 2025-11-13 05:38
Core Insights - The article discusses the launch of a new 3D world generation model called Marble, developed by World Lab, founded by Fei-Fei Li, which is now open for public testing [1][3][34] - Marble allows users to easily create personalized 3D worlds using text, photos, or short videos, significantly lowering the barrier for entry in 3D modeling [4][15][35] Group 1: Features and Functionality - Marble can generate 3D worlds from simple text prompts or single images, and it supports multiple images from different angles to create a cohesive environment [17][35] - Users can customize their 3D spaces by uploading multiple images to define layouts and can edit elements within the generated worlds, such as removing objects or changing styles [19][21] - The platform includes an AI-native world editing tool, allowing for both minor and extensive modifications to the created environments [21][33] Group 2: Export and Compatibility - Users can export their created worlds in two formats: Gaussian point cloud for high fidelity rendering and triangle mesh for compatibility with various industry-standard tools [29] - The generated 3D worlds can also be rendered into videos, which can be enhanced with additional details and dynamic elements [31] Group 3: Future Developments - Marble aims to enhance interactivity in future updates, allowing users to not only create but also interact with elements within their 3D worlds [36][37] - The development team emphasizes that the current features are just the foundation, with plans to incorporate real-time interactions in the generated environments [36][37]
“AI教母”李飞飞发布首款商用世界模型
第一财经· 2025-11-13 02:15
Core Insights - World Labs, founded by AI expert Fei-Fei Li, launched its first product, Marble, which is supported by a multimodal world model designed to create high-fidelity, persistent 3D environments from various inputs [2][5] - Marble offers a freemium model with four subscription tiers, ranging from a free version to a premium version priced at $95 per month, allowing for extensive generation capabilities [5] - Fei-Fei Li emphasizes the importance of spatial intelligence as the next frontier in AI, arguing that current AI models lack a true understanding of the physical world [6][8] Product Features - Marble supports large-scale multimodal input and includes a creative center called Marble Labs, enhancing user experience [5] - The product differentiates itself by generating persistent 3D environments that can be exported in various formats, reducing scene distortion and inconsistency [5] - The real-time model RTFM can run on a single H100 GPU, but Marble's unique selling point is its ability to create downloadable 3D worlds [5] Market Position - Marble is the first commercially available product in the world model space, while competitors like Google's Genie and others are still in limited preview or demo stages [8] - The overall interaction quality of Marble has been positively reviewed, although there is room for improvement in detail precision [8] Future Outlook - In the short term, spatial intelligence is expected to empower creativity in industries such as film, gaming, and architecture [8] - Mid-term implications include advancements in embodied intelligent robotics, enhancing collaboration in domestic and laboratory settings [8] - Long-term potential includes revolutionary applications in science, healthcare, and education through simulation and immersive learning experiences [8] Company Growth - World Labs has raised approximately $230 million in funding, achieving a valuation exceeding $1 billion, making it a new unicorn in the AI sector [9] - The company’s investors include prominent firms such as a16z, Radical Ventures, NVIDIA NVentures, AMD Ventures, and Intel Capital [9] - Future plans involve deepening the understanding of three-dimensionality and physicality, with aspirations to integrate augmented reality and robotics [9]
“AI教母”李飞飞发布首款商用世界模型 空间智能更近了
Di Yi Cai Jing· 2025-11-13 01:37
Core Insights - World Labs, founded by AI expert Fei-Fei Li, launched its first product, Marble, which is supported by a multimodal world model designed to create high-fidelity, persistent 3D worlds from a single image, video, or text prompt [1][4]. Product Features - Marble has expanded its functionalities since its preview release two months ago, now supporting large-scale multimodal input and introducing Marble Labs as a creative center [4]. - The product offers four subscription tiers: a free version with 4 generations limited to text and image input, a standard version at $20/month with multi-image and video input, and a premium version at $95/month allowing 75 generations and full feature access [4]. - Unlike competitors, Marble generates persistent, downloadable 3D environments rather than dynamically generated worlds, significantly reducing scene distortion and inconsistency [4]. Industry Context - Fei-Fei Li argues that current AI models, primarily large language models, lack a true understanding of the physical world, which is essential for achieving genuine machine intelligence [5]. - The concept of spatial intelligence is highlighted as a key breakthrough for AI, enabling machines to understand and interact with the three-dimensional world [5]. - Competitors like Google and Decart are still in the research or demo phase, making Marble the first commercially available product in the world model space [5]. Future Outlook - In the short term, spatial intelligence is expected to empower creativity in industries such as film, gaming, and architecture by providing tools for rapid 3D environment generation [6]. - In the medium term, it may drive the development of embodied intelligent robots, enhancing their role as collaborators in various settings [6]. - Long-term implications include potential revolutions in science, healthcare, and education through simulations and immersive learning experiences [6]. Company Growth - World Labs has raised approximately $230 million, achieving a valuation exceeding $1 billion, making it a new unicorn in the AI sector [6]. - The company’s investors include prominent firms such as a16z, Radical Ventures, NVIDIA NVentures, AMD Ventures, and Intel Capital [6]. - Future plans involve focusing on models that deeply understand three-dimensionality, physicality, and concepts of space and time, with aspirations to support augmented reality and robotics [6].
“AI教母”李飞飞发布首款商用世界模型,空间智能更近了
Di Yi Cai Jing· 2025-11-13 01:31
Core Insights - World Labs, founded by AI expert Fei-Fei Li, launched its first product, Marble, which is described as the foundation for building a spatially intelligent future [1][4] - Marble utilizes a multi-modal world model to create high-fidelity, persistent 3D environments from a single image, video, or text prompt [1][4] - The product is now publicly available with expanded features, including a freemium model and four subscription tiers, ranging from a free version to a flagship version priced at $95 per month [4] Product Features - Marble supports large-scale multi-modal input and includes a creative center called Marble Labs [4] - The subscription options include a free version with limited capabilities and paid versions that allow for more extensive generation and advanced editing [4] - Unlike competitors, Marble generates persistent, downloadable 3D environments, reducing scene distortion and inconsistency [4][5] Industry Context - Fei-Fei Li argues that spatial intelligence is crucial for achieving true machine intelligence, as it allows for a comprehensive understanding of the physical world [5] - Other companies, such as Google, are also exploring world models, but Marble is the first commercially available product in this space [5] - The industry evaluation indicates that while Marble's interaction effects are strong, there is room for improvement in detail precision [5] Future Implications - In the short term, spatial intelligence is expected to empower creativity in industries like film, gaming, and architecture [5][6] - Mid-term, it may drive the development of embodied intelligent robots for collaboration in various settings [6] - Long-term, spatial intelligence could revolutionize fields such as science, healthcare, and education through enhanced simulations and immersive learning experiences [6] Company Growth - World Labs has raised approximately $230 million, achieving a valuation exceeding $1 billion, making it a new unicorn in the AI sector [6] - The company’s investors include prominent firms such as a16z, Radical Ventures, NVIDIA NVentures, AMD Ventures, and Intel Capital [6] - Future plans involve focusing on models that deeply understand three-dimensionality, physicality, and concepts of space and time, with aspirations to support augmented reality and robotics [6]
锦秋基金被投企业流形空间3个月融资亿元,证明世界模型也需要预训练 |Jinqiu Spotlight
锦秋集· 2025-11-12 12:44
Core Insights - The article discusses the emergence and potential of world models in AI, particularly focusing on the company Manifold AI and its CEO Wu Wei's vision for developing a robust world model that can understand and predict the physical world [7][10][22]. Investment and Company Overview - Jinqiu Fund has invested in Manifold AI, which has quickly raised over 100 million in seed and angel rounds within three months of its establishment [4][6]. - Jinqiu Fund emphasizes a long-term investment philosophy, seeking breakthrough technologies and innovative business models in general artificial intelligence startups [5]. Technology and Market Trends - The concept of world models is gaining traction, with significant discussions in Silicon Valley about their capabilities, including generative, multimodal, and interactive features [8][9]. - Wu Wei argues that world models can provide superior predictive capabilities compared to Vision-Language-Action (VLA) models, which are limited by their reliance on past experiences [18][22]. Technical Development and Challenges - The development of world models is still in its early stages, with various approaches being explored, including explicit physical modeling and latent space interaction [25][30]. - Manifold AI aims to create a "bodily world model" that can transfer and unify across different scales, contrasting with the top-down strategies of many international teams [33]. Strategic Focus and Market Positioning - Manifold AI prioritizes the robotics and drone sectors over autonomous driving due to the fragmented nature of these markets, which allows for more opportunities for innovation [43][44]. - The company is focused on enabling hardware to possess autonomous reasoning capabilities, moving away from human-controlled operations [46]. Future Goals and Product Development - The company plans to release its first generation of base models based on the World Model Architecture (WMA) by late 2025 to early 2026, aiming to drive advancements in Physical AI Agents [51]. - Wu Wei emphasizes the importance of pre-training models to understand physical world dynamics, which can reduce deployment costs significantly [37][40].
李飞飞揭大模型“死穴”:不会空间智能,再能聊也是纸上谈兵
3 6 Ke· 2025-11-12 11:47
当科技界仍深陷于大模型"参数内卷"时,斯坦福大学教授、World Labs联合创始人李飞飞教授指向了一个更本质的瓶颈:当前AI被困在由文本和 二维图像构成的"扁平世界"里,它与我们生活其中的、立体的、受物理规律支配的现实严重脱节。 11月11日,在她刷屏的一篇长文中,李飞飞鲜明指出,空间智能,正是打破这层认知隔膜的关键。它不仅代表了人工智能演进的下一个前沿,更 是AI真正融入物理世界、从"对话工具"蜕变为"行动伙伴"的转折点。 本文梳理了李飞飞在这篇长文中对于空间智能的技术路径与应用前景系统阐述,并结合多位产业实践者的洞察,共同展望这一变革性力量将如何 重塑人机关系与产业生态。 从语言到世界,空间智能是AI的破晓之光 当前人工智能,特别是生成式AI已在创意、效率与沟通方面深刻改变了世界。 然而,李飞飞指出,当前AI在诸多关键领域应用的宏伟愿景还远未实现。自主机器人的发展尚未走出实验室与特定场景,其"融入日常生活"的愿 景仍停留于概念推演; 在科学研究中,AI虽展现出潜力,但距离真正实现疾病诊疗、新材料研发与基础物理探索的效率革命,仍有相当距离; 而在创意赋能方面,无论是辅助学生理解复杂抽象概念、支持建筑师进行 ...
罗福莉C位亮相小米,离职DeepSeek后首次官宣
量子位· 2025-11-12 08:01
Core Insights - Luo Fuli has officially announced her position at Xiaomi, leading the MiMo team to advance the development of multi-modal spatial intelligence, a key step towards achieving Artificial General Intelligence (AGI) [1][3][7] Group 1: Background and Context - Rumors about Luo Fuli joining Xiaomi surfaced at the end of last year, with reports indicating that she was recruited by Lei Jun with a salary of tens of millions [4][10] - Significant events include the launch of DeepSeek-V3 on December 25, followed by media reports of Xiaomi assembling a GPU cluster [5][6] - Luo Fuli's name appeared in Xiaomi's AI team papers as an independent researcher prior to her official announcement [11][20] Group 2: Luo Fuli's Profile - Luo Fuli holds a Bachelor's degree in Computer Science from Beijing Normal University and a Master's degree in Computational Linguistics from Peking University, with numerous publications in top NLP conferences [15][17] - She has over 11,000 citations for her academic papers, with approximately 8,000 citations added in the current year alone [18] - Luo previously worked at Alibaba's DAMO Academy and DeepSeek, contributing to the development of various deep learning models [17] Group 3: Xiaomi's AI Ambitions - Xiaomi aims to enter the deep waters of AI following the establishment of its automotive business, with a focus on spatial intelligence [9][24] - The concept of spatial intelligence, as articulated by Luo Fuli, involves bridging the gap between information AI and physical AI, which aligns with Xiaomi's ecosystem of people, vehicles, and homes [23][25]
巴菲特宣告“谢幕”:年底卸任CEO,将加快捐赠速度|首席资讯日报
首席商业评论· 2025-11-12 05:15
Group 1 - Warren Buffett announced his retirement as CEO of Berkshire Hathaway by the end of this year, indicating a shift in management and an acceleration in his charitable donations [2] - SoftBank Group reported a net profit of 2.50 trillion yen for the second quarter, with net sales of 1.92 trillion yen [3] - The lawsuit result for "Chai Dui Dui" case revealed that the defendants must cease infringement and pay a total of 2.6 million yuan in compensation to Pang Donglai [4] Group 2 - AMD completed the acquisition of AI inference startup MK1, integrating its team into AMD's AI division to enhance software innovation [5] - A merger training conference was held in Shenzhen, discussing policies and practical paths for mergers and acquisitions, with over 180 representatives from various sectors attending [6] - In October, China's new energy vehicle sales exceeded 50% of total new car sales for the first time, indicating strong growth in the sector [7] Group 3 - Beijing has completed its housing construction tasks for 2025, including 17 new projects and a total of 19,800 housing units [8] - Stanford professor Fei-Fei Li published an article discussing spatial intelligence as the next frontier in AI, emphasizing the need for machines to understand the physical world [9] - The film "Demon Slayer: Infinity Castle Chapter" set a record for pre-sale box office for imported animated films in China, surpassing 1.199 billion yuan [10] Group 4 - Burger King announced a strategic partnership with CPE Yuanfeng, which will invest 350 million USD to support the expansion and innovation of Burger King in China [11] - COMAC's C919 aircraft is set to participate in the 2025 Dubai Airshow, marking its first display in the Middle East [12] - In 2024, the market share of domestic industrial robots in China is expected to exceed 50% for the first time, reaching 58.5% with a sales volume of 177,000 units [13]