世界模型
Search documents
“AI教母”李飞飞发布首款商用世界模型
第一财经· 2025-11-13 02:15
Core Insights - World Labs, founded by AI expert Fei-Fei Li, launched its first product, Marble, which is supported by a multimodal world model designed to create high-fidelity, persistent 3D environments from various inputs [2][5] - Marble offers a freemium model with four subscription tiers, ranging from a free version to a premium version priced at $95 per month, allowing for extensive generation capabilities [5] - Fei-Fei Li emphasizes the importance of spatial intelligence as the next frontier in AI, arguing that current AI models lack a true understanding of the physical world [6][8] Product Features - Marble supports large-scale multimodal input and includes a creative center called Marble Labs, enhancing user experience [5] - The product differentiates itself by generating persistent 3D environments that can be exported in various formats, reducing scene distortion and inconsistency [5] - The real-time model RTFM can run on a single H100 GPU, but Marble's unique selling point is its ability to create downloadable 3D worlds [5] Market Position - Marble is the first commercially available product in the world model space, while competitors like Google's Genie and others are still in limited preview or demo stages [8] - The overall interaction quality of Marble has been positively reviewed, although there is room for improvement in detail precision [8] Future Outlook - In the short term, spatial intelligence is expected to empower creativity in industries such as film, gaming, and architecture [8] - Mid-term implications include advancements in embodied intelligent robotics, enhancing collaboration in domestic and laboratory settings [8] - Long-term potential includes revolutionary applications in science, healthcare, and education through simulation and immersive learning experiences [8] Company Growth - World Labs has raised approximately $230 million in funding, achieving a valuation exceeding $1 billion, making it a new unicorn in the AI sector [9] - The company’s investors include prominent firms such as a16z, Radical Ventures, NVIDIA NVentures, AMD Ventures, and Intel Capital [9] - Future plans involve deepening the understanding of three-dimensionality and physicality, with aspirations to integrate augmented reality and robotics [9]
“AI教母”李飞飞发布首款商用世界模型 空间智能更近了
Di Yi Cai Jing· 2025-11-13 01:37
Core Insights - World Labs, founded by AI expert Fei-Fei Li, launched its first product, Marble, which is supported by a multimodal world model designed to create high-fidelity, persistent 3D worlds from a single image, video, or text prompt [1][4]. Product Features - Marble has expanded its functionalities since its preview release two months ago, now supporting large-scale multimodal input and introducing Marble Labs as a creative center [4]. - The product offers four subscription tiers: a free version with 4 generations limited to text and image input, a standard version at $20/month with multi-image and video input, and a premium version at $95/month allowing 75 generations and full feature access [4]. - Unlike competitors, Marble generates persistent, downloadable 3D environments rather than dynamically generated worlds, significantly reducing scene distortion and inconsistency [4]. Industry Context - Fei-Fei Li argues that current AI models, primarily large language models, lack a true understanding of the physical world, which is essential for achieving genuine machine intelligence [5]. - The concept of spatial intelligence is highlighted as a key breakthrough for AI, enabling machines to understand and interact with the three-dimensional world [5]. - Competitors like Google and Decart are still in the research or demo phase, making Marble the first commercially available product in the world model space [5]. Future Outlook - In the short term, spatial intelligence is expected to empower creativity in industries such as film, gaming, and architecture by providing tools for rapid 3D environment generation [6]. - In the medium term, it may drive the development of embodied intelligent robots, enhancing their role as collaborators in various settings [6]. - Long-term implications include potential revolutions in science, healthcare, and education through simulations and immersive learning experiences [6]. Company Growth - World Labs has raised approximately $230 million, achieving a valuation exceeding $1 billion, making it a new unicorn in the AI sector [6]. - The company’s investors include prominent firms such as a16z, Radical Ventures, NVIDIA NVentures, AMD Ventures, and Intel Capital [6]. - Future plans involve focusing on models that deeply understand three-dimensionality, physicality, and concepts of space and time, with aspirations to support augmented reality and robotics [6].
“AI教母”李飞飞发布首款商用世界模型,空间智能更近了
Di Yi Cai Jing· 2025-11-13 01:31
Core Insights - World Labs, founded by AI expert Fei-Fei Li, launched its first product, Marble, which is described as the foundation for building a spatially intelligent future [1][4] - Marble utilizes a multi-modal world model to create high-fidelity, persistent 3D environments from a single image, video, or text prompt [1][4] - The product is now publicly available with expanded features, including a freemium model and four subscription tiers, ranging from a free version to a flagship version priced at $95 per month [4] Product Features - Marble supports large-scale multi-modal input and includes a creative center called Marble Labs [4] - The subscription options include a free version with limited capabilities and paid versions that allow for more extensive generation and advanced editing [4] - Unlike competitors, Marble generates persistent, downloadable 3D environments, reducing scene distortion and inconsistency [4][5] Industry Context - Fei-Fei Li argues that spatial intelligence is crucial for achieving true machine intelligence, as it allows for a comprehensive understanding of the physical world [5] - Other companies, such as Google, are also exploring world models, but Marble is the first commercially available product in this space [5] - The industry evaluation indicates that while Marble's interaction effects are strong, there is room for improvement in detail precision [5] Future Implications - In the short term, spatial intelligence is expected to empower creativity in industries like film, gaming, and architecture [5][6] - Mid-term, it may drive the development of embodied intelligent robots for collaboration in various settings [6] - Long-term, spatial intelligence could revolutionize fields such as science, healthcare, and education through enhanced simulations and immersive learning experiences [6] Company Growth - World Labs has raised approximately $230 million, achieving a valuation exceeding $1 billion, making it a new unicorn in the AI sector [6] - The company’s investors include prominent firms such as a16z, Radical Ventures, NVIDIA NVentures, AMD Ventures, and Intel Capital [6] - Future plans involve focusing on models that deeply understand three-dimensionality, physicality, and concepts of space and time, with aspirations to support augmented reality and robotics [6]
腾讯研究院AI速递 20251113
腾讯研究院· 2025-11-12 16:08
Group 1: Generative AI Developments - Meta's Chief AI Scientist LeCun is leaving the company due to strategic disagreements, focusing on "world models" in a new startup [1] - Google's AI model successfully transcribed an 18th-century ledger with a character error rate of only 1.7%, showcasing advanced abstract reasoning capabilities [2] - ElevenLabs launched the Scribe v2 Realtime model, achieving a 93.5% accuracy rate across 90 languages with a latency of just 150 milliseconds [3] Group 2: AI in Communication and Music - OpenAI is set to introduce a group chat feature for ChatGPT, allowing users to share conversation links while maintaining privacy [4] - An AI-generated song topped the Billboard country digital singles chart, raising concerns about the competition between AI and human artists [5] Group 3: Investment and Financing in AI - The AI company Jiga Vision completed a financing round of over 100 million yuan, with investments from Huawei and other funds [6] - Gamma, an AI presentation tool, raised $68 million in Series B funding, achieving a valuation of $2.1 billion and generating an annual recurring revenue of $100 million [9] Group 4: Programming Language Trends - TypeScript has surpassed Python as the most widely used programming language on GitHub, with a 66% year-over-year increase in contributors [8]
锦秋基金被投企业流形空间3个月融资亿元,证明世界模型也需要预训练 |Jinqiu Spotlight
锦秋集· 2025-11-12 12:44
Core Insights - The article discusses the emergence and potential of world models in AI, particularly focusing on the company Manifold AI and its CEO Wu Wei's vision for developing a robust world model that can understand and predict the physical world [7][10][22]. Investment and Company Overview - Jinqiu Fund has invested in Manifold AI, which has quickly raised over 100 million in seed and angel rounds within three months of its establishment [4][6]. - Jinqiu Fund emphasizes a long-term investment philosophy, seeking breakthrough technologies and innovative business models in general artificial intelligence startups [5]. Technology and Market Trends - The concept of world models is gaining traction, with significant discussions in Silicon Valley about their capabilities, including generative, multimodal, and interactive features [8][9]. - Wu Wei argues that world models can provide superior predictive capabilities compared to Vision-Language-Action (VLA) models, which are limited by their reliance on past experiences [18][22]. Technical Development and Challenges - The development of world models is still in its early stages, with various approaches being explored, including explicit physical modeling and latent space interaction [25][30]. - Manifold AI aims to create a "bodily world model" that can transfer and unify across different scales, contrasting with the top-down strategies of many international teams [33]. Strategic Focus and Market Positioning - Manifold AI prioritizes the robotics and drone sectors over autonomous driving due to the fragmented nature of these markets, which allows for more opportunities for innovation [43][44]. - The company is focused on enabling hardware to possess autonomous reasoning capabilities, moving away from human-controlled operations [46]. Future Goals and Product Development - The company plans to release its first generation of base models based on the World Model Architecture (WMA) by late 2025 to early 2026, aiming to drive advancements in Physical AI Agents [51]. - Wu Wei emphasizes the importance of pre-training models to understand physical world dynamics, which can reduce deployment costs significantly [37][40].
95后AI才女,官宣加入小米,雷军千万年薪挖人
3 6 Ke· 2025-11-12 12:14
雷军去年开出千万年薪挖角的95后AI才女,如今终于官宣入职小米。 2024年12月底多家媒体报道,小米创始人雷军亲自出面,想用千万年薪招揽,曾在国际顶会发表8篇论文、DeepSeek-V2关键开发者之一的罗福莉,领导 小米AI大模型团队。 但在那之后,双方都没给出官方消息,罗福莉也因为不想被过度打扰,慢慢淡出了公众视野,"我不是天才少女,只想安安静静做难而正确的事情。" 她到底有没有加入小米,也成了谜。 直到11月12日,罗福莉发了条朋友圈,正式确认已经加入小米。 尘埃落定,加入Xiaomi MiMo团队 虽说这是罗福莉的正式官宣,但她与小米之间早已有了不少羁绊。 今年2月,罗福莉的家属曾透露她已到新岗位上班,但当时小米的员工系统中并未出现她的名字,这让她的去向多了一层悬念。 9月,罗福莉在知乎上评论了小米语音大模型开源的帖子,直言"小米开源了一个语音大模型,非常强!建议马上实装"。 到了10月,她的名字出现在小米论文中,以通讯作者身份位列作者最后一位。 这篇论文由"北京大学计算机学院多媒体信息处理国家重点实验室",以及"小米大模型核心团队"联合署名,却并未标注罗福莉所属团队。 因此外界推测她可能是合作研究, ...
Meta首席AI科学家Yann LeCun被曝将离职,投身“世界模型”创业
Guo Ji Jin Rong Bao· 2025-11-12 12:12
Core Insights - Meta is undergoing significant changes in its AI strategy, with key personnel departures including Yann LeCun, the Chief AI Scientist, who plans to start a new AI startup focused on "world models" [1][3] - Mark Zuckerberg is shifting the company's focus from foundational research to practical applications, as evidenced by the hiring of Alexandr Wang to lead the new Meta Superintelligence Labs with a substantial investment of $14.3 billion [1][2] - Internal policies at Meta have restricted academic freedom within the FAIR lab, leading to dissatisfaction among members and contributing to LeCun's potential departure [2][3] Group 1 - Yann LeCun's departure is part of a broader trend of leadership changes in Meta's AI division, which is facing challenges from competitors like OpenAI and Google [1][3] - The company has initiated layoffs affecting around 600 employees, particularly in the FAIR lab, while the newly formed TBD Lab remains unaffected [3] - LeCun's vision for AI emphasizes "world models" that understand the physical world through video and spatial data, contrasting with Meta's current focus on large language models (LLMs) [3][4] Group 2 - Meta's strategic pivot includes a new policy requiring additional scrutiny of research outputs from the FAIR lab, which has been perceived as a limitation on academic freedom [2] - Competitors like Google DeepMind and NVIDIA are also investing in "world models," indicating a growing interest in this area within the AI industry [4] - Stanford's Fei-Fei Li has raised approximately $230 million for her startup World Labs, which aims to enhance AI's "spatial intelligence," further highlighting the competitive landscape [4]
李飞飞揭大模型“死穴”:不会空间智能,再能聊也是纸上谈兵
3 6 Ke· 2025-11-12 11:47
当科技界仍深陷于大模型"参数内卷"时,斯坦福大学教授、World Labs联合创始人李飞飞教授指向了一个更本质的瓶颈:当前AI被困在由文本和 二维图像构成的"扁平世界"里,它与我们生活其中的、立体的、受物理规律支配的现实严重脱节。 11月11日,在她刷屏的一篇长文中,李飞飞鲜明指出,空间智能,正是打破这层认知隔膜的关键。它不仅代表了人工智能演进的下一个前沿,更 是AI真正融入物理世界、从"对话工具"蜕变为"行动伙伴"的转折点。 本文梳理了李飞飞在这篇长文中对于空间智能的技术路径与应用前景系统阐述,并结合多位产业实践者的洞察,共同展望这一变革性力量将如何 重塑人机关系与产业生态。 从语言到世界,空间智能是AI的破晓之光 当前人工智能,特别是生成式AI已在创意、效率与沟通方面深刻改变了世界。 然而,李飞飞指出,当前AI在诸多关键领域应用的宏伟愿景还远未实现。自主机器人的发展尚未走出实验室与特定场景,其"融入日常生活"的愿 景仍停留于概念推演; 在科学研究中,AI虽展现出潜力,但距离真正实现疾病诊疗、新材料研发与基础物理探索的效率革命,仍有相当距离; 而在创意赋能方面,无论是辅助学生理解复杂抽象概念、支持建筑师进行 ...
雷军挖来前DeepSeek大将,大模型团队40人合影曝光,疑进军具身智能
3 6 Ke· 2025-11-12 08:31
Core Insights - The announcement of Luo Fuli joining Xiaomi MiMo team signifies Xiaomi's ambition towards AGI (Artificial General Intelligence) and highlights her focus on "world models" and "embodied intelligence" [1][10]. Group 1: Luo Fuli's Background and Transition - Luo Fuli, a prominent figure in AI research, has transitioned from DeepSeek to Xiaomi, confirming rumors of her high-profile recruitment with a reported annual salary in the millions [6][4]. - She has a strong academic background with a Bachelor's degree in Computer Science from Beijing Normal University and a Master's in Computational Linguistics from Peking University, contributing to significant projects like VECO and DeepSeek-V2 [4][6]. Group 2: Xiaomi MiMo and AGI Vision - Xiaomi MiMo, the company's first open-source inference model, was launched in April and has shown promising results in mathematical reasoning and coding competitions, outperforming models from OpenAI [7]. - The MiMo ecosystem is expanding with the introduction of multi-modal models, indicating progress towards a "world model" that integrates various forms of information [7]. - Xiaomi has been actively investing in the field of embodied intelligence, with recent investments in startups like DeepMind, totaling nearly 30 companies since 2014 [8]. Group 3: Future Implications - Luo Fuli's involvement is expected to accelerate Xiaomi's advancements in AGI, particularly in the areas of world models and embodied intelligence, raising industry expectations for future developments [10].
Meta首席AI科学家LeCun被曝将离职创业,与扎克伯格“超智能”路线理念分歧
硬AI· 2025-11-12 05:00
Core Viewpoint - Meta is undergoing a significant strategic shift in its AI approach, moving from long-term foundational research to rapid product iteration, highlighted by the departure of key AI figure Yann LeCun and the underperformance of its Llama 4 model [2][3][6]. Group 1: Strategic Divergence - Yann LeCun, a Turing Award winner and head of Meta's Fundamental AI Research Lab, advocates for a new generation AI system called "world model," which aims to understand the physical world through video and spatial data, aspiring to achieve human-level intelligence [5]. - LeCun believes that the current focus on large language models (LLMs) is useful but insufficient for human-like reasoning and planning, contrasting sharply with Zuckerberg's emphasis on rapid productization and the development of "superintelligent" teams [5][6]. Group 2: Leadership Changes and Cost Pressures - LeCun's planned departure from Meta, where he has been a pivotal figure since 2013, reflects a broader trend of executive turnover within the company, including the exit of AI research VP Joelle Pineau and layoffs of approximately 600 employees in the AI research department [11]. - In response to competitive pressures and the need to demonstrate returns on substantial investments in AI, Zuckerberg has hired Alexandr Wang for $14.3 billion to lead a new "superintelligent" team and acquired 49% of Wang's data annotation startup, Scale AI [7][11]. - The restructuring has resulted in LeCun reporting to Wang instead of the previous chief product officer, indicating a shift in focus towards immediate product development rather than foundational research [8].