训练闭环
Search documents
2025年几家自动驾驶公司的采访总结
自动驾驶之心· 2026-01-22 09:07
Core Algorithm - The industry has shifted towards end-to-end solutions, moving away from modular approaches, at least in public discourse [1] - The introduction of world models is prevalent, with some companies using them to generate training data, while others incorporate them into end-to-end models to enhance performance [1][8] - There is a divergence in opinions regarding the necessity of language models (VLA) in autonomous driving, with some companies arguing that language is not essential for driving tasks [1][11] Simulation and Infrastructure - The closed-loop systems have evolved from data-driven to simulation testing and training loops [2] - 3DGS is highlighted as a crucial technology for building simulation environments, as emphasized by Tesla at CVPR 2025 [5] - Infrastructure is critical, with companies like Xiaomi and Li Auto noting its benefits for development efficiency [3][14] Organizational Capability - Organizational ability is vital, as large autonomous driving teams face significant management challenges [4] - Team culture and collaboration are emphasized as essential for overcoming complex technical and management issues [5] Technical Choices Comparison - A comparison of various companies' technical choices reveals differing approaches to core technologies and the role of world models and simulation tools [9] - Companies like Li Auto advocate for a training loop that evolves from imitation to self-learning, while NVIDIA emphasizes interpretability and reasoning in AI [9] Key Non-Core Factors - R&D infrastructure and engineering efficiency are crucial for the success of autonomous driving technologies [14] - Simulation and synthetic data are becoming essential for addressing corner cases that real-world data cannot cover [14] - The scale of computing power and chip adaptation is critical, as autonomous driving is not just a software issue but also a hardware challenge [15] User Experience and Safety - User experience and safety are paramount, with companies like Xiaomi stressing the importance of balancing advanced technology with user concerns [17] - The need for a dual-stack safety mechanism is highlighted, ensuring that even aggressive end-to-end models have a fallback to traditional rule-based systems for safety [19]
理想下一步的重点:从数据闭环到训练闭环
自动驾驶之心· 2025-12-14 02:03
Core Insights - The article discusses the evolution of autonomous driving technology, highlighting the transition from data closed-loop systems to training closed-loop systems, marking a new phase in autonomous driving development [18][21]. Group 1: Development of Autonomous Driving Technology - The development trajectory of Li Auto's intelligent driving has evolved from rule-based systems to AI-driven E2E+VLM dual systems and VLA, with a focus on navigation as a key module [6]. - Li Auto has accumulated 1.5 billion kilometers of driving data, utilizing over 200 triggers to produce 15-45 second clip data [11]. - The end-to-end mass production version MPI has increased to over 220, representing a 19-fold increase compared to the version from July 2024 [13]. Group 2: Data Closed-Loop and Its Limitations - The data closed-loop process includes shadow mode validation, data mining in the cloud, automatic labeling of effective samples, and model training, with data return achievable in one minute [9][10]. - Despite the effectiveness of the data closed-loop, it cannot address all issues, particularly long-tail scenarios such as traffic control and sudden lane changes [16]. Group 3: Transition to Training Closed-Loop - The core of the L4 training loop involves VLA, reinforcement learning (RL), and world models (WM), optimizing trajectories through diffusion and reinforcement learning [23]. - Key technologies for closed-loop autonomous driving training include regional simulation, synthetic data, and reinforcement learning [24]. Group 4: Advances in Reconstruction and Generation - Li Auto has made significant advancements in reconstruction and generation, with multiple top conference papers published in the past two years [28][34]. - The company has developed a feedforward 3D generation system that eliminates the need for point cloud initialization, directly producing results from visual inputs [29]. Group 5: Challenges and System Capabilities - The interactive agent is identified as a key challenge in the training closed-loop [40]. - System capabilities are enhanced by the world model providing simulation environments, diverse scene construction, and accurate feedback from reward models [41].
ICCV涌现自动驾驶新范式:统一世界模型VLA,用训练闭环迈向L4
量子位· 2025-11-08 04:10
Core Viewpoint - The article discusses the shift in the autonomous driving industry from a data-driven approach to a training-driven approach, emphasizing the importance of world models and reinforcement learning in achieving Level 4 (L4) autonomy [2][4][6]. Group 1: Transition from Data Loop to Training Loop - The current data loop is insufficient for advancing autonomous driving technology, necessitating a shift to a training loop that allows for continuous model iteration through environmental feedback [4][11]. - Ideal's approach involves building a world model training environment in the cloud, which integrates prior knowledge and driving capabilities into the vehicle's VLA model [11][30]. - The world model encompasses environment construction, agent modeling, feedback mechanisms, and various scenario simulations, which are crucial for the training loop [13][31]. Group 2: Simulation and Evaluation Techniques - Ideal employs a combination of reconstruction and generation techniques for simulation, allowing for both stable and dynamic outputs [14][15][16]. - The Hierarchy UGP model, developed in collaboration with academic institutions, achieves state-of-the-art results in large-scale dynamic scene reconstruction [21][19]. - The focus on synthetic data generation enhances the diversity and complexity of training scenarios, improving model performance [25][24]. Group 3: Reinforcement Learning and Challenges - The reinforcement learning world engine enables models to explore training environments and receive feedback, with five key factors influencing its effectiveness [25][27]. - The simulation of interactions between multiple agents poses significant challenges, with Ideal exploring self-play and reward function adjustments to enhance sample diversity [27][29]. Group 4: Commercialization and Technological Advancements - Ideal has successfully established a profitable business model, which supports its ongoing research and development efforts, with over 10 billion yuan invested in the self-developed Star Ring OS [32][33]. - The Star Ring OS enhances vehicle performance by streamlining communication between different control systems, significantly reducing braking distances [35][36]. - The open-source initiative of the Star Ring OS is expected to benefit the entire industry, reducing development costs for other automakers [39][40]. Group 5: Industry Position and Future Outlook - Ideal is positioning itself as a leading player in the AI-driven automotive sector, with a focus on becoming a "space robotics company" [48][50]. - The company has established a research-production closed loop, allowing for rapid application of research findings to production, exemplified by the DriveVLM project [52]. - The article concludes that while many companies are investing in AI and robotics, few have achieved the comprehensive capabilities demonstrated by Ideal and Tesla [53].
理想ICCV'25分享了世界模型:从数据闭环到训练闭环
自动驾驶之心· 2025-11-07 00:05
Core Insights - The article discusses the advancements in autonomous driving technology, particularly focusing on the transition from data closed-loop systems to training closed-loop systems, marking a new phase in autonomous driving development [18][21]. Group 1: Development of Ideal Auto's Intelligent Driving - Ideal Auto's intelligent driving has evolved through various stages, from rule-based systems to AI-driven E2E+VLM dual systems and VLA, with a strong emphasis on navigation as a key module [6]. - The current end-to-end mass production version of MPI has reached over 220, representing a 19-fold increase compared to the version from July 2024 [13]. Group 2: Data Closed-Loop Value - The data closed-loop process includes shadow mode validation, data feedback to the cloud for mining, automatic labeling of effective samples, and model training, with data return achievable in one minute [9][10]. - Ideal Auto has accumulated 1.5 billion kilometers of driving data, utilizing over 200 triggers to produce 15-45 second clip data [11]. Group 3: Transition to Training Closed-Loop - The core of the L4 training loop involves VLA, reinforcement learning (RL), and world models (WM), optimizing trajectories through diffusion and reinforcement learning [23]. - Key technologies for closed-loop autonomous driving training include regional simulation, synthetic data, and reinforcement learning [24]. Group 4: Reconstruction and Generation Work - Ideal Auto has made significant progress in reconstruction and generation, with multiple top conference papers published in the last two years [28][32][34]. - The generation applications range from scene editing to scene migration and scene generation [36]. Group 5: Interactive Agents and System Capabilities - The development of interactive agents is highlighted as a critical challenge in the training closed-loop [40]. - System capabilities are enhanced through world models providing simulation environments, diverse scene construction, and accurate feedback from reward models [41]. Group 6: Community and Collaboration - The article mentions the establishment of nearly a hundred technical communication groups related to various autonomous driving technologies, with a community of around 4,000 members and over 300 companies and research institutions involved [50][51].
理想ICCV'25分享了世界模型:从数据闭环到训练闭环
自动驾驶之心· 2025-10-30 00:56
Core Insights - The article discusses the advancements in autonomous driving technology, particularly focusing on the transition from data closed-loop systems to training closed-loop systems, marking a new phase in autonomous driving development [17][20]. Group 1: Development of Li Auto's VLA Model - Li Auto's VLA driver model has evolved through various stages, from rule-based systems to AI-driven E2E+VLM systems, with a strong emphasis on navigation as a key module [6]. - The end-to-end mass production version of MPI has reached over 220 units, representing a 19-fold increase compared to the version from July 2024 [12]. Group 2: Data Closed-Loop Value - The data closed-loop process includes shadow mode validation, data mining in the cloud, automatic labeling of effective samples, and model training, with a data return time of one minute [9][10]. - Li Auto has accumulated 1.5 billion kilometers of driving data, utilizing over 200 triggers to produce 15-45 second clip data [10]. Group 3: Transition to Training Closed-Loop - The core of the L4 training loop involves VLA, reinforcement learning (RL), and world models (WM), optimizing trajectories through diffusion and reinforcement learning [22]. - Key technologies for closed-loop autonomous driving training include regional simulation, synthetic data, and reinforcement learning [24]. Group 4: Simulation and Generation Techniques - Simulation relies on scene reconstruction, including visual and Lidar reconstruction, while synthetic data generation utilizes multimodal techniques [25]. - Li Auto's recent advancements in reconstruction and generation have led to significant improvements, with multiple top conference papers published in the last two years [26][29][31]. Group 5: Interactive Agents and System Capabilities - The development of interactive agents is highlighted as a critical challenge in the training closed-loop [37]. - System capabilities are enhanced through world models providing simulation environments, diverse scene construction, and accurate feedback from reward models [38]. Group 6: Community and Collaboration - The article mentions the establishment of nearly a hundred technical discussion groups related to various autonomous driving technologies, with a community of around 4,000 members and over 300 companies and research institutions involved [44][45].