蚂蚁深夜开源比肩Genie 3的世界模型,我也看到了具身智能的未来。
数字生命卡兹克·2026-01-29 02:06

Core Viewpoint - The article discusses the recent release of LingBot-World, a groundbreaking world model developed by Ant Group's Lingbo Technology, which is comparable in quality to Google Genie 3 and is open-sourced, marking a significant advancement in interactive real-time world modeling [3][8][32]. Group 1: Model Features - LingBot-World allows for real-time generation of environments based on user input, creating a dynamic and interactive experience where the world evolves as the user navigates [12][30]. - The model exhibits strong long-term memory capabilities, maintaining consistency in the environment even as the user changes perspective, which is crucial for immersive experiences [48][55]. - It demonstrates exceptional style generalization, effectively blending realistic and non-realistic styles, which is a challenge for many existing models [62][68]. Group 2: Technical Specifications - The model has approximately 28 billion parameters, with inference capabilities around 14 billion [44]. - Three versions of the model are available: LingBot-World-Base (Cam), which focuses on camera control; LingBot-World-Base (Act), which emphasizes action control; and LingBot-World-Fast, designed for low latency and real-time interaction [39][41][43]. Group 3: Innovation and Impact - The article emphasizes the potential of LingBot-World to revolutionize various fields, including gaming, film, and embodied intelligence, by providing a low-cost, high-fidelity testing space for real-world understanding and long-term tasks [96][97]. - The open-source nature of the project is highlighted as a significant step forward, allowing broader access and innovation within the AI community [100][101].

蚂蚁深夜开源比肩Genie 3的世界模型,我也看到了具身智能的未来。 - Reportify