“AI教母”李飞飞发布实时生成式世界模型!一张H100就能运行
Di Yi Cai Jing·2025-10-17 04:40

Core Insights - The new real-time generative world model RTFM developed by World Labs is designed to run on a single H100 GPU, emphasizing efficiency, scalability, and persistence [1][4][5] - The model is based on large-scale video data and is an autoregressive diffusion Transformer, capable of modeling 3D geometry, reflections, and shadows [4][5] - World Labs aims to create a virtual 3D space where users can control physical variables, with significant implications for various industries including gaming and film production [8][9] Group 1: Model Features - RTFM operates under three key principles: efficiency, scalability, and persistence, allowing it to run on minimal GPU resources while expanding with increased data and computational power [4][5] - The model's computational demands are expected to exceed those of current large language models, with the need to generate over 100,000 tokens per second for 4K interactive video streams [4][5] Group 2: Company Background - World Labs, founded by Fei-Fei Li in 2024, has raised approximately $230 million, achieving a valuation of over $1 billion, making it a new unicorn in the AI sector [8][9] - The company has received investments from prominent players in the tech and venture capital space, including a16z, NVIDIA NVentures, AMD Ventures, and Intel Capital [8] Group 3: Future Plans - World Labs plans to focus on building models with a deep understanding of 3D, physical, and spatial concepts, with future support for augmented reality (AR) and robotics [10]