Workflow
训练闭环
icon
Search documents
理想下一步的重点:从数据闭环到训练闭环
自动驾驶之心· 2025-12-14 02:03
点击下方 卡片 ,关注" 自动驾驶之心 "公众号 戳我-> 领取 自动驾驶近30个 方向 学习 路线 >>自动驾驶前沿信息获取 → 自动驾驶之心知识星球 理想汽在ICCV'25期间也分享了些新东西!目前还没有视频对外。 VLA团队负责人詹锟老师做了一场世界模型的presentation,名为World Model: Evolving from Data Closed-loop to Training Closed-loop。自动驾驶之心第一时间做了解 读分享给大家~ 首先是介绍下理想VLA司机大模型: 回顾了理想汽车智能驾驶的发展路线,从规则时代的轻图和无图,再到基于AI的E2E+VLM快慢双系统和VLA, 这四个方案中Nav(导航)是重点突出的模块。 下面介绍的是数据闭环的价值。左上角这张图是一个完整的数据闭环流程: 影子模式验证→经由数据触发回传到云端进行数据挖掘→有效样本进行自动标注→生 成训练集训练模型→模型下发验证性能。 这个过程已经可以做到一分钟的数据回传。 目前已经有15亿公里的驾驶数据,200+的Trigger来生产15-45s的Clip数据。 目前理想的端到端量产版本MPI已经到了220+, ...
ICCV涌现自动驾驶新范式:统一世界模型VLA,用训练闭环迈向L4
量子位· 2025-11-08 04:10
Core Viewpoint - The article discusses the shift in the autonomous driving industry from a data-driven approach to a training-driven approach, emphasizing the importance of world models and reinforcement learning in achieving Level 4 (L4) autonomy [2][4][6]. Group 1: Transition from Data Loop to Training Loop - The current data loop is insufficient for advancing autonomous driving technology, necessitating a shift to a training loop that allows for continuous model iteration through environmental feedback [4][11]. - Ideal's approach involves building a world model training environment in the cloud, which integrates prior knowledge and driving capabilities into the vehicle's VLA model [11][30]. - The world model encompasses environment construction, agent modeling, feedback mechanisms, and various scenario simulations, which are crucial for the training loop [13][31]. Group 2: Simulation and Evaluation Techniques - Ideal employs a combination of reconstruction and generation techniques for simulation, allowing for both stable and dynamic outputs [14][15][16]. - The Hierarchy UGP model, developed in collaboration with academic institutions, achieves state-of-the-art results in large-scale dynamic scene reconstruction [21][19]. - The focus on synthetic data generation enhances the diversity and complexity of training scenarios, improving model performance [25][24]. Group 3: Reinforcement Learning and Challenges - The reinforcement learning world engine enables models to explore training environments and receive feedback, with five key factors influencing its effectiveness [25][27]. - The simulation of interactions between multiple agents poses significant challenges, with Ideal exploring self-play and reward function adjustments to enhance sample diversity [27][29]. Group 4: Commercialization and Technological Advancements - Ideal has successfully established a profitable business model, which supports its ongoing research and development efforts, with over 10 billion yuan invested in the self-developed Star Ring OS [32][33]. - The Star Ring OS enhances vehicle performance by streamlining communication between different control systems, significantly reducing braking distances [35][36]. - The open-source initiative of the Star Ring OS is expected to benefit the entire industry, reducing development costs for other automakers [39][40]. Group 5: Industry Position and Future Outlook - Ideal is positioning itself as a leading player in the AI-driven automotive sector, with a focus on becoming a "space robotics company" [48][50]. - The company has established a research-production closed loop, allowing for rapid application of research findings to production, exemplified by the DriveVLM project [52]. - The article concludes that while many companies are investing in AI and robotics, few have achieved the comprehensive capabilities demonstrated by Ideal and Tesla [53].
理想ICCV'25分享了世界模型:从数据闭环到训练闭环
自动驾驶之心· 2025-11-07 00:05
Core Insights - The article discusses the advancements in autonomous driving technology, particularly focusing on the transition from data closed-loop systems to training closed-loop systems, marking a new phase in autonomous driving development [18][21]. Group 1: Development of Ideal Auto's Intelligent Driving - Ideal Auto's intelligent driving has evolved through various stages, from rule-based systems to AI-driven E2E+VLM dual systems and VLA, with a strong emphasis on navigation as a key module [6]. - The current end-to-end mass production version of MPI has reached over 220, representing a 19-fold increase compared to the version from July 2024 [13]. Group 2: Data Closed-Loop Value - The data closed-loop process includes shadow mode validation, data feedback to the cloud for mining, automatic labeling of effective samples, and model training, with data return achievable in one minute [9][10]. - Ideal Auto has accumulated 1.5 billion kilometers of driving data, utilizing over 200 triggers to produce 15-45 second clip data [11]. Group 3: Transition to Training Closed-Loop - The core of the L4 training loop involves VLA, reinforcement learning (RL), and world models (WM), optimizing trajectories through diffusion and reinforcement learning [23]. - Key technologies for closed-loop autonomous driving training include regional simulation, synthetic data, and reinforcement learning [24]. Group 4: Reconstruction and Generation Work - Ideal Auto has made significant progress in reconstruction and generation, with multiple top conference papers published in the last two years [28][32][34]. - The generation applications range from scene editing to scene migration and scene generation [36]. Group 5: Interactive Agents and System Capabilities - The development of interactive agents is highlighted as a critical challenge in the training closed-loop [40]. - System capabilities are enhanced through world models providing simulation environments, diverse scene construction, and accurate feedback from reward models [41]. Group 6: Community and Collaboration - The article mentions the establishment of nearly a hundred technical communication groups related to various autonomous driving technologies, with a community of around 4,000 members and over 300 companies and research institutions involved [50][51].
理想ICCV'25分享了世界模型:从数据闭环到训练闭环
自动驾驶之心· 2025-10-30 00:56
Core Insights - The article discusses the advancements in autonomous driving technology, particularly focusing on the transition from data closed-loop systems to training closed-loop systems, marking a new phase in autonomous driving development [17][20]. Group 1: Development of Li Auto's VLA Model - Li Auto's VLA driver model has evolved through various stages, from rule-based systems to AI-driven E2E+VLM systems, with a strong emphasis on navigation as a key module [6]. - The end-to-end mass production version of MPI has reached over 220 units, representing a 19-fold increase compared to the version from July 2024 [12]. Group 2: Data Closed-Loop Value - The data closed-loop process includes shadow mode validation, data mining in the cloud, automatic labeling of effective samples, and model training, with a data return time of one minute [9][10]. - Li Auto has accumulated 1.5 billion kilometers of driving data, utilizing over 200 triggers to produce 15-45 second clip data [10]. Group 3: Transition to Training Closed-Loop - The core of the L4 training loop involves VLA, reinforcement learning (RL), and world models (WM), optimizing trajectories through diffusion and reinforcement learning [22]. - Key technologies for closed-loop autonomous driving training include regional simulation, synthetic data, and reinforcement learning [24]. Group 4: Simulation and Generation Techniques - Simulation relies on scene reconstruction, including visual and Lidar reconstruction, while synthetic data generation utilizes multimodal techniques [25]. - Li Auto's recent advancements in reconstruction and generation have led to significant improvements, with multiple top conference papers published in the last two years [26][29][31]. Group 5: Interactive Agents and System Capabilities - The development of interactive agents is highlighted as a critical challenge in the training closed-loop [37]. - System capabilities are enhanced through world models providing simulation environments, diverse scene construction, and accurate feedback from reward models [38]. Group 6: Community and Collaboration - The article mentions the establishment of nearly a hundred technical discussion groups related to various autonomous driving technologies, with a community of around 4,000 members and over 300 companies and research institutions involved [44][45].