仿真数据
Search documents
搞自驾这七年,绝大多数的「数据闭环」都是伪闭环
自动驾驶之心· 2026-01-08 05:58
Core Viewpoint - The concept of "data closed loop" in the autonomous driving industry is still largely limited to small internal loops within algorithm teams, rather than achieving the grand vision of a comprehensive system that directly solves problems through data [1]. Group 1: Definition of "True Data Closed Loop" - A "true closed loop" must meet three levels: automated problem discovery, quantifiable and reviewable solution effects, and a comprehensive trigger system that integrates real-time and historical data [4][5]. - The ideal state involves a system that can automatically classify issues, route them to the appropriate teams, and assist in developing trigger rules, thereby reducing reliance on manual processes [5]. Group 2: Current Industry Practices - Many companies' so-called "data closed loops" are more accurately described as "data-driven development processes with some automation tools," primarily limited to the perspective of individual algorithm teams [8]. - Typical workflows are often module-level and algorithm-focused, lacking a system-wide perspective [9]. Group 3: Reasons for Lack of True Closed Loops - The starting point for many companies is a "passive closed loop," where problems are identified reactively rather than through automated data analysis [10]. - Attribution of issues is often difficult, as multiple interrelated factors contribute to the same phenomenon [12]. - The data-to-solution chain often stops at data-to-model, failing to address real-world problems effectively [16]. Group 4: Data Closed Loop Practices - The company has developed a more aggressive approach to data closed loops, treating data as a product and metrics as primary citizens [24]. - The overall strategy involves quantifying real-world pain points and using triggers to convert these into actionable data [25]. Group 5: Trigger Mechanism - The trigger mechanism is designed to be lightweight and high-recall, ensuring that significant events are captured without overwhelming the system [32]. - Once a trigger is activated, it generates a micro log that is uploaded for further analysis, leading to more detailed data collection if necessary [35]. Group 6: Unified Trigger Framework - A unified trigger framework using Python allows for consistent implementation across vehicle data mining, cloud data analysis, and simulation validation [50]. - This framework enables non-technical team members to participate in writing rules, thus democratizing the process of data analysis [54]. Group 7: Distinction Between World Labels and Algorithm Labels - The company maintains two types of labels: world-level labels that describe objective physical conditions and model-level labels that depend on algorithm performance [61]. - This distinction is crucial for effective data analysis and problem-solving in the autonomous driving context [61]. Group 8: Use of Generative and Simulation Data - Generative data is primarily used to address long-tail scenarios that are difficult to encounter in real life, but real data remains essential for evaluation and validation [67]. - The company emphasizes the importance of filtering data through structured labels before applying vector retrieval methods to ensure efficiency and accuracy [64].
理想VLA含金量分析与关键迭代方向预测
理想TOP2· 2025-08-09 06:18
Core Viewpoint - The article emphasizes the innovative capabilities of Li Auto's VLA (Vision Language Architecture) and its potential to significantly enhance autonomous driving technology through a combination of AI software and hardware integration, led by the company's founder, Li Xiang [2][3][4]. Group 1: Innovation and Technology - Li Auto's VLA represents a significant innovation at the MoE (Mixture of Experts) level, with a focus on original architecture and execution, drawing from contributions across the AI community [2]. - The integration of AI software with hardware has reached an industry-leading level, with a clear distinction between the rapid iteration capabilities of software and the slower evolution of hardware [3]. - The core of Li Auto's VLA is based on reinforcement learning, which allows for a more effective learning process compared to traditional imitation learning, enhancing the vehicle's decision-making capabilities [9][10]. Group 2: Leadership and Vision - Li Xiang plays a crucial role in the development of Li Auto's autonomous driving technology, similar to Elon Musk's influence at Tesla, ensuring the company remains adaptable to industry changes and resource allocation [4][5]. - The ability of Li Xiang to make key judgments regarding resource distribution and AI learning is vital for the company's long-term success and efficient resource utilization [4]. Group 3: Future Directions and Predictions - Key iterative directions for Li Auto's VLA include improving the speed, quality, and cost-effectiveness of simulation data, which is essential for reinforcement learning [8][12]. - The company aims to maximize the potential of existing vehicle hardware for autonomous driving while also exploring new chip technologies to enhance computational capabilities [13]. - Future advancements may involve online learning architectures that allow for real-time weight updates, significantly improving the model's adaptability and understanding of the physical world [13].
WAIC观察|仿真不稳、真机太贵?机器人数据最优解出现了吗
Di Yi Cai Jing· 2025-07-28 02:07
Core Viewpoint - The debate between the value of real-world data versus simulation data in robot training is intensifying, with industry leaders emphasizing the necessity of real data for complex tasks while acknowledging the cost-effectiveness of simulation data for simpler tasks [1][2][4]. Group 1: Importance of Real Data - Sergey Levine, co-founder of Physical Intelligence, argues that real-world data is essential for effective robot training, challenging the reliance on simulation data [1]. - Industry experts, such as Yao Maoqing from Zhiyuan Robotics, support Levine's view, stating that while some tasks can be trained using simulation, most complex tasks require real data [1][3]. - The CEO of Qingtong Intelligent, Li Tong, emphasizes that robots must be deployed in real environments to accumulate valuable training data, suggesting that a deployment scale of tens of thousands is necessary for effective data collection [3]. Group 2: Simulation Data Advantages - Companies like Galaxy General advocate for simulation data, claiming it allows for faster learning and lower costs, even enabling training without real data [2]. - The COO of Self-Variable Robotics, Yang Qian, acknowledges the role of simulation in training lower-body movements but stresses that real-world data is crucial for tasks involving complex interactions [10][12]. - The industry faces a dilemma in balancing the use of simulation and real data, with some companies using a 7:3 ratio of simulation to real data, while others prefer a 3:1 ratio favoring real data [9][10]. Group 3: Challenges and Future Directions - The industry is grappling with the technical challenge of integrating simulation and real data effectively, as highlighted by Chen Yuanpei from Lingchu Intelligent, who notes that data from different sources must be weighted differently [9]. - The consensus is that while simulation data is beneficial for initial training phases, real data is indispensable for achieving advanced capabilities in robots [10][12]. - Companies are increasingly focusing on building extensive real-world data sets to enhance their models, with Zhiyuan Robotics aiming to create a comprehensive dataset to support embodied intelligence [10][12].
英伟达对机器人下手了
远川研究所· 2025-03-20 12:35
Core Viewpoint - The article discusses the advancements in humanoid robotics and the role of NVIDIA in developing the necessary technologies, particularly focusing on the concept of "Physical AI" and the importance of simulation data for training robots [1][7][41]. Group 1: NVIDIA's Role in Robotics - NVIDIA is positioning itself as a key player in the humanoid robotics industry by developing a series of platforms and models, including the Cosmos training platform and the Isaac GR00T N1 humanoid robot model [3][4][19]. - The company has created a comprehensive ecosystem for humanoid robot development, including high-performance computing (DGX), simulation platforms (Omniverse), and inference chips (Jetson Thor) [19][31]. - NVIDIA's strategy involves not only selling hardware but also providing software tools and services to enhance the capabilities of humanoid robots [41][42]. Group 2: The Concept of Physical AI - The term "Physical AI" refers to the next wave of AI development, where robots are expected to understand physical laws and interact with the real world autonomously [8][41]. - Unlike traditional industrial robots that perform specific tasks, humanoid robots aim to understand and make decisions based on their environment, showcasing a significant leap in intelligence [10][13]. - The training of these robots requires vast amounts of simulation data that mimic real-world physics, filling the gap where real-world data is scarce [16][17][18]. Group 3: Simulation Data and Its Importance - Simulation data is crucial for training humanoid robots, as it allows for the creation of realistic scenarios that adhere to physical laws, which is essential for effective learning [16][18]. - The article compares real data to "real exam questions" and simulation data to "mock exams," emphasizing the need for high-quality simulation data to ensure effective training [18]. - NVIDIA's experience in gaming and simulation technologies positions it well to provide the necessary tools for creating this simulation data [23][30]. Group 4: Historical Context and Future Directions - NVIDIA's journey in high-performance computing has evolved from gaming to various high-value applications, including mobile devices, autonomous driving, and now humanoid robotics [32][39]. - The company has learned from past ventures, such as its experience with mobile processors, to focus on more promising markets like AI and robotics [36][38]. - As the demand for "Physical AI" grows, NVIDIA aims to solidify its position by offering integrated solutions that combine hardware and software for the robotics industry [41][43].