Core Viewpoint - Odyssey, a company founded by experts in autonomous driving, has developed a world model that can generate and interact with video in real-time, achieving a frame rate of 40 milliseconds per frame, which is faster than the human blink rate [1][5][6]. Company Highlights - Odyssey has raised $27 million (approximately 190 million RMB) from notable investors including EQT Ventures, Google GV, and Air Street Capital, with Ed Catmull, a co-founder of Pixar and Turing Award winner, on its board [5]. - The platform is currently available for free, attracting significant user interest, leading to server congestion [6]. Technology Differentiation - Odyssey distinguishes between world models and video models, emphasizing that world models allow for real-time interaction and flexibility, while video models generate fixed content without interactivity [8][10]. - The company believes that learning from real-life video data can enhance the capabilities of world models beyond traditional gaming environments [15]. Development Challenges - Odyssey acknowledges the difficulties in learning from open real-world videos due to their complexity and unpredictability [16]. - The primary challenge lies in autoregressive modeling, where the model's output influences future predictions, leading to potential instability [18][19]. Innovative Solutions - To address these challenges, Odyssey has developed a narrow distribution model that pre-trains on broad video data and fine-tunes on specific dense video data, improving stability and persistence in autoregressive generation [20]. Future Prospects - The company is working on the next generation of world models to enhance generalization capabilities [21]. - With the current version being a preview, user feedback has been positive, indicating the model's potential [23]. Industry Context - Over 10 automotive and autonomous driving companies, including Tesla and NIO, are exploring the concept of world models, indicating a competitive landscape [38]. - The autonomous driving sector is seen as a fertile ground for the development of world models, suggesting significant future growth in this area [40].
视频实时生成可交互! 两位自动驾驶大牛创业世界模型:40毫秒/帧,无需任何游戏引擎,人人免费可玩
量子位·2025-05-29 07:19