Workflow
数据闭环
icon
Search documents
搞自驾这七年,绝大多数的「数据闭环」都是伪闭环
自动驾驶之心· 2026-01-08 05:58
Core Viewpoint - The concept of "data closed loop" in the autonomous driving industry is still largely limited to small internal loops within algorithm teams, rather than achieving the grand vision of a comprehensive system that directly solves problems through data [1]. Group 1: Definition of "True Data Closed Loop" - A "true closed loop" must meet three levels: automated problem discovery, quantifiable and reviewable solution effects, and a comprehensive trigger system that integrates real-time and historical data [4][5]. - The ideal state involves a system that can automatically classify issues, route them to the appropriate teams, and assist in developing trigger rules, thereby reducing reliance on manual processes [5]. Group 2: Current Industry Practices - Many companies' so-called "data closed loops" are more accurately described as "data-driven development processes with some automation tools," primarily limited to the perspective of individual algorithm teams [8]. - Typical workflows are often module-level and algorithm-focused, lacking a system-wide perspective [9]. Group 3: Reasons for Lack of True Closed Loops - The starting point for many companies is a "passive closed loop," where problems are identified reactively rather than through automated data analysis [10]. - Attribution of issues is often difficult, as multiple interrelated factors contribute to the same phenomenon [12]. - The data-to-solution chain often stops at data-to-model, failing to address real-world problems effectively [16]. Group 4: Data Closed Loop Practices - The company has developed a more aggressive approach to data closed loops, treating data as a product and metrics as primary citizens [24]. - The overall strategy involves quantifying real-world pain points and using triggers to convert these into actionable data [25]. Group 5: Trigger Mechanism - The trigger mechanism is designed to be lightweight and high-recall, ensuring that significant events are captured without overwhelming the system [32]. - Once a trigger is activated, it generates a micro log that is uploaded for further analysis, leading to more detailed data collection if necessary [35]. Group 6: Unified Trigger Framework - A unified trigger framework using Python allows for consistent implementation across vehicle data mining, cloud data analysis, and simulation validation [50]. - This framework enables non-technical team members to participate in writing rules, thus democratizing the process of data analysis [54]. Group 7: Distinction Between World Labels and Algorithm Labels - The company maintains two types of labels: world-level labels that describe objective physical conditions and model-level labels that depend on algorithm performance [61]. - This distinction is crucial for effective data analysis and problem-solving in the autonomous driving context [61]. Group 8: Use of Generative and Simulation Data - Generative data is primarily used to address long-tail scenarios that are difficult to encounter in real life, but real data remains essential for evaluation and validation [67]. - The company emphasizes the importance of filtering data through structured labels before applying vector retrieval methods to ensure efficiency and accuracy [64].
智驾的2025:辞旧迎新的一年
自动驾驶之心· 2026-01-04 01:04
Core Viewpoint - The article discusses the evolution of the autonomous driving industry in 2025, highlighting the dual focus on technology proliferation and technical challenges, with traditional automakers pushing for accessibility and new players striving for technological advancements [4][5]. Group 1: Industry Trends - In 2025, traditional automakers like BYD, Geely, and Chery are leading the charge in making autonomous driving technology more accessible by integrating mid-level highway NOA features into vehicles priced over 100,000 yuan [4]. - New entrants and leading autonomous driving suppliers are focused on pushing the limits of technology, adhering to a model of annual technological iteration [4][5]. - The industry is witnessing a bifurcation, with one camp focused on accessibility and the other on technological challenges, particularly in the realm of algorithm development [4]. Group 2: Technological Advancements - The transition from "passive perception" to "active cognition" is marked by the introduction of world models, which represent a significant paradigm shift in autonomous driving technology [5][6]. - 2025 is characterized as a year of significant technological transition, with the widespread adoption of end-to-end systems and the emergence of world models and VLA (Vision-Language-Action) technologies [6][9]. - NIO is highlighted as a pioneer in the world model space, having launched its world model in 2024, transitioning from "perception-driven" to "cognition-driven" systems [5][6]. Group 3: Data Infrastructure and Chip Development - The importance of data infrastructure is emphasized, with companies like NIO benefiting from early investments in data collection and model training capabilities [7][8]. - The year 2025 is noted as a pivotal year for integrated hardware and software solutions, with companies like NIO and XPeng achieving self-developed chip integration [7][8]. - The article warns of the risks associated with outsourced chip development, contrasting it with NIO's genuine self-development efforts, which involve significant technical team investments [8]. Group 4: Regulatory and Market Dynamics - The issuance of L3 licenses is seen as a significant step towards the next phase of autonomous driving, indicating a shift from L2+ mass production to L3 and L4 capabilities [8][9]. - While traditional automakers have secured initial L3 licenses, their capabilities are questioned, suggesting that true advancements will come from new players and those with strong model capabilities [9][10]. - The ultimate value of autonomous driving technology is framed around enhancing driver convenience and significantly reducing traffic accidents, with a focus on safety as a primary goal [9].
搞自驾这七年,绝大多数的「数据闭环」都是伪闭环
自动驾驶之心· 2025-12-29 09:17
Core Viewpoint - The concept of a "true data closed loop" in the autonomous driving industry is still far from realization, with most current implementations being limited to small, internal loops within individual algorithm teams rather than the comprehensive systems envisioned in early presentations [1]. Group 1: Definition of a True Data Closed Loop - A true data closed loop should automate problem discovery, allowing systems to identify anomalies from vast operational data without relying on manual feedback [4]. - The effectiveness of solutions must be quantifiable and reviewable, requiring a comprehensive trigger system that integrates real-time and historical data analysis [5]. - The system should continuously assess whether the investments in data, computing power, and development yield satisfactory results [5]. Group 2: Current Industry Practices - Many companies currently operate under a "data-driven development process with some automation tools," which are often limited to the perspectives of individual algorithm teams [8]. - Typical workflows are more about module-level, algorithmic closed loops rather than a holistic system-level approach [9]. Group 3: Challenges in Achieving True Data Closed Loops - Many existing systems are reactive rather than proactive, relying on manual identification of issues rather than automated detection [10]. - Attribution of problems is often difficult, as multiple interrelated factors contribute to issues, making it hard to pinpoint the source of a problem [12]. - The transition from data to actionable solutions often halts at the model training stage, lacking a clear connection to real-world problems [16]. - The degree of "self-healing" in current systems is limited, with many platforms resembling automated production lines rather than self-correcting systems [17]. - Organizational structures often fragment the closed loop, leading to communication issues between teams [18]. Group 4: Practical Implementation of Data Closed Loops - The company has developed a more aggressive approach to data closed loops, treating data as a product and metrics as primary citizens [24]. - The methodology emphasizes quantifying real-world pain points and ensuring all critical incidents are recorded accurately [26]. - A micro log and mini log mechanism is employed to capture high-recall, low-overhead data from vehicles, focusing on significant driving events [30]. - The system allows for dynamic control of data mining tasks based on real-time needs, ensuring flexibility in data collection [59]. Group 5: Distinction Between World Labels and Algorithm Labels - The company maintains two types of labels: world-level labels that describe the physical environment and model-level labels that reflect algorithm performance [61]. - This distinction is crucial for effective data analysis and problem-solving, ensuring that the focus remains on real-world scenarios rather than solely on algorithmic outputs [61]. Group 6: Use of Generative and Simulation Data - Generative data is utilized to address long-tail scenarios that are difficult to encounter in reality, but it is not a substitute for real-world evaluation [67]. - The company emphasizes that while recall rates may improve with generative data, the potential for increased false positives must be carefully monitored [70].
汽车帮热评:工信部发放L3准入资格意味着什么
Group 1 - The core viewpoint of the article highlights a significant shift in the competitive landscape of the Chinese automotive market, driven by the issuance of L3-level conditional autonomous driving licenses and Tesla's plans for L4 Robotaxi operations [1][2][3] Group 2 - The focus of competition is shifting from hardware capabilities to a dual-track ability involving regulations and operations, as traditional automakers can only sell L3 vehicles in limited scenarios, while Tesla aims to monetize its vehicles directly through L4 operations [1][2] - The technological competition is evolving into a race for data closure, where L3 vehicles are limited by the data they can collect, whereas Tesla's Robotaxi can generate vast amounts of real-world data for rapid model iteration [2] - The industry is transitioning from price wars to a dual screening process based on qualifications and funding, with L3 licenses only granted to companies that meet stringent safety and operational criteria, effectively sidelining smaller manufacturers [3]
理想下一步的重点:从数据闭环到训练闭环
自动驾驶之心· 2025-12-14 02:03
Core Insights - The article discusses the evolution of autonomous driving technology, highlighting the transition from data closed-loop systems to training closed-loop systems, marking a new phase in autonomous driving development [18][21]. Group 1: Development of Autonomous Driving Technology - The development trajectory of Li Auto's intelligent driving has evolved from rule-based systems to AI-driven E2E+VLM dual systems and VLA, with a focus on navigation as a key module [6]. - Li Auto has accumulated 1.5 billion kilometers of driving data, utilizing over 200 triggers to produce 15-45 second clip data [11]. - The end-to-end mass production version MPI has increased to over 220, representing a 19-fold increase compared to the version from July 2024 [13]. Group 2: Data Closed-Loop and Its Limitations - The data closed-loop process includes shadow mode validation, data mining in the cloud, automatic labeling of effective samples, and model training, with data return achievable in one minute [9][10]. - Despite the effectiveness of the data closed-loop, it cannot address all issues, particularly long-tail scenarios such as traffic control and sudden lane changes [16]. Group 3: Transition to Training Closed-Loop - The core of the L4 training loop involves VLA, reinforcement learning (RL), and world models (WM), optimizing trajectories through diffusion and reinforcement learning [23]. - Key technologies for closed-loop autonomous driving training include regional simulation, synthetic data, and reinforcement learning [24]. Group 4: Advances in Reconstruction and Generation - Li Auto has made significant advancements in reconstruction and generation, with multiple top conference papers published in the past two years [28][34]. - The company has developed a feedforward 3D generation system that eliminates the need for point cloud initialization, directly producing results from visual inputs [29]. Group 5: Challenges and System Capabilities - The interactive agent is identified as a key challenge in the training closed-loop [40]. - System capabilities are enhanced by the world model providing simulation environments, diverse scene construction, and accurate feedback from reward models [41].
“智驾普及元年”年终大考:奇瑞猎鹰智驾的承诺兑现了吗?
Tai Mei Ti A P P· 2025-11-28 14:16
Core Insights - The article highlights the transition of China's intelligent driving industry from concept to practical application, with Chery's commitment to its intelligent driving strategy serving as a milestone [1][3]. Industry Overview - By 2025, the Chinese intelligent driving industry is expected to shift from "parameter competition" to "real-world validation," with consumer expectations evolving from "availability" to "usability" and "reliability" [3]. - The current stage of the industry is characterized by both technological breakthroughs and challenges in implementation [4]. Chery's Commitment - Chery's chairman publicly committed to equipping all models with the Falcon intelligent driving assistance system within the year, a move that sparked industry discussions due to the previous trend of high-level intelligent driving features being limited to premium models [3][6]. - As of the end of the year, Chery successfully integrated the Falcon system across all models, demonstrating its technical capabilities through real-world testing in complex driving conditions [3][6]. Challenges in Intelligent Driving - Many automakers face issues such as "feature reduction," "delayed functionality," and limitations to high-end models when delivering intelligent driving features [5]. - Current intelligent driving systems exhibit significantly higher error rates on unstructured roads compared to structured ones, with failure rates being 3-5 times higher [5]. Technical Foundation of Falcon Intelligent Driving - The Falcon system's success is attributed to a collaborative foundation of data, algorithms, and hardware, creating a "data loop - algorithm breakthrough - hardware redundancy" structure [7]. - Chery's Tianqiong Intelligent Computing Center has accumulated over 24 billion kilometers of driving assistance data, enhancing the system's adaptability across various road conditions [7][10]. Algorithm and Hardware Integration - The Falcon system utilizes the Momenta R6 reinforcement learning model, which allows for rapid decision-making in unforeseen scenarios, enhancing its performance in complex environments [10][11]. - The hardware setup includes a combination of sensors, ensuring reliable perception in challenging conditions, while the system's computational power is optimized for efficient data processing [12][14]. Long-term Strategy and Collaboration - Chery's approach to intelligent driving is rooted in a long-term commitment to technology development, having invested in intelligent technology since 2010 [17][19]. - The company employs a collaborative ecosystem model, partnering with various tech firms to enhance its capabilities while maintaining core technology independence [19]. Future Outlook - Chery aims to achieve end-to-end integration of its intelligent driving system by 2026, with ongoing updates to enhance functionality [21]. - The intelligent driving industry is moving towards a phase of "refined cultivation," focusing on real-world validation and user-centric solutions [22].
中国智驾打响残酷突围战
Hua Er Jie Jian Wen· 2025-11-27 12:17
Core Insights - The Chinese intelligent driving industry is undergoing a significant reshuffle, highlighted by the suspension of the once-prominent unicorn, Haomo Zhixing, while competitors like Yuanrong Qixing and Zhuoyu are gaining market share and investment support [1][2][5] Company Analysis - Haomo Zhixing, originally a spin-off from Great Wall Motors, received substantial early-stage funding but has struggled to maintain momentum, with its last financing round occurring in early 2024 without support from its former backer [2][3] - The company's choice of Qualcomm Snapdragon Ride chips over the industry-standard NVIDIA Orin has hindered its ability to adapt to new technological trends, leading to operational inefficiencies [3][4] - Great Wall Motors has shifted its focus to other suppliers, notably investing $100 million in Yuanrong Qixing, indicating a loss of confidence in Haomo Zhixing's capabilities [5][6] Industry Trends - The competitive landscape has evolved, with a focus on achieving a scale of one million vehicles to generate valuable data for algorithm development, moving beyond flashy demonstrations to practical data-driven solutions [7][10] - Companies like Yuanrong Qixing and Horizon Robotics are positioning themselves as strategic partners rather than mere component suppliers, emphasizing the importance of data access and integration [8][9] - The industry is witnessing a consolidation of market share among leading players, with predictions that only a few companies will dominate the market by 2025 [14][15] Future Outlook - The intelligent driving sector is transitioning from an optional feature to a core asset for automotive companies, with a clear divide emerging between those who can leverage large-scale data and those who cannot [14][15] - The ultimate goal for many companies is to develop systems that not only enhance vehicle performance but also contribute to broader applications in robotics and artificial intelligence [12][13]
一周一刻钟,大事快评(W130):数据闭环
Investment Rating - The industry investment rating is "Overweight" indicating a positive outlook for the sector compared to the overall market performance [8]. Core Insights - The report emphasizes that intelligence will be a key theme in the market for 2026, with investment opportunities extending beyond smart driving to areas like Robotaxi. A data closed loop is identified as the core starting point for achieving full-stack self-research, which differs fundamentally from mere data collection [1][3]. - The establishment of a data closed loop is crucial for filtering effective information from massive data, enabling machines to understand data, feedback to correct models, and perform OTA updates for secondary verification. This requires not only data ownership but also the ability to identify data gaps and utilize data to enhance models [1][3]. - The report suggests that the scale of the data closed loop team (e.g., whether it reaches a hundred members) and related investments should be key indicators for assessing a company's commitment and capability for self-research [1][3]. Summary by Sections Data Closed Loop - The report highlights that when algorithm models are truly driven by PB-level data, it will create a competitive barrier that is difficult to replicate. Even if competitors acquire model architectures or poach key personnel, lacking a substantial underlying data accumulation will hinder their ability to replicate similar algorithm capabilities in the short term [2][4]. - Building a solid data closed loop is expected to provide companies with a certainty of competitive advantage for six months to a year. Companies like Xiaopeng, Li Auto, and Huawei are noted to have established a leading advantage in the smart driving sector, with a high degree of technical moat [2][4]. Investment Recommendations - The report recommends focusing on domestic strong alpha manufacturers such as BYD, Geely, and Xiaopeng, as well as companies that represent the trend of intelligence like Huawei's HarmonyOS. Attention is also drawn to companies like JAC Motors and Seres, with specific recommendations for Li Auto, Kobot, Desay SV, and Jingwei Hengrun [2]. - For state-owned enterprise integration, the report suggests monitoring SAIC Motor, Dongfeng Motor Group, and Changan Automobile. Additionally, it highlights component companies with strong performance growth and capabilities for overseas expansion, recommending Fuyao Glass, New Spring, Fuda, Shuanghuan Transmission, and Yinlun [2].
南开-镁信健康精算科技实验室发布mind42.ins
Bei Jing Shang Bao· 2025-11-17 01:52
Core Insights - The health insurance industry is undergoing a significant transformation driven by AI technology, which is reshaping decision-making processes and reconstructing the entire value chain of health insurance [1] Data Complexity and Decision Challenges - The health insurance sector faces structural data complexity and a lack of standardization, leading to challenges such as product innovation stagnation, rough risk pricing, and low operational efficiency [2] - Traditional product design relies on limited epidemiological data and reinsurer rate tables, making it difficult for insurers to achieve precise risk differentiation [2] - Underwriting processes are hindered by cumbersome health disclosure requirements and inconsistent evaluation standards across companies, leading to subjective decision-making that affects efficiency and accuracy [2] Claims Operations Challenges - Claims operations are complicated by intricate insurance liability clauses and drug indication restrictions, resulting in low efficiency in manual reviews and increased operational costs [3] From Concept Validation to Industry Application - The launch of the mind42.ins commercial health insurance decision support model marks a shift from concept validation to industrial application of AI technology in the health insurance field [4] Product Development Innovations - The mind42.ins model enables a paradigm shift from "experience-based" to "data-driven" approaches in product development, allowing for precise market analysis and user demand identification [6] - The system can automatically generate sales support tools, streamlining the creation of marketing materials and enhancing the connection from product concept to sales [8] Efficiency and Data Loop Enhancements - The mind42.ins model significantly improves efficiency in underwriting and claims processes, with some routine claims review times reduced from days to seconds [8] - It establishes a complete data loop that tracks product market performance and claims data, providing valuable insights for product optimization and contributing to the industry's knowledge base [9] Systemic Transformation in the Industry - The anticipated deep application of mind42.ins is expected to trigger systemic changes in the health insurance industry, shortening product innovation cycles from months to weeks and increasing the success rate of innovations [10] - The model redefines the competitive landscape by offering standardized intelligent decision-making capabilities, benefiting insurance companies of all sizes, particularly small and medium-sized firms [10] - It fosters innovation in health insurance business models, enabling the development of personalized products based on precise risk identification and pricing capabilities [10] Future Developments - As technology evolves, mind42.ins is transitioning from a decision support tool to an industry-level infrastructure, integrating diverse data sources for a more comprehensive user health profile [11] - The system is expected to enhance reasoning capabilities and domain knowledge, allowing it to handle more complex decision scenarios, ultimately leading to more efficient, diverse, and personalized insurance products [11]
理想ICCV'25分享了世界模型:从数据闭环到训练闭环
自动驾驶之心· 2025-11-07 00:05
Core Insights - The article discusses the advancements in autonomous driving technology, particularly focusing on the transition from data closed-loop systems to training closed-loop systems, marking a new phase in autonomous driving development [18][21]. Group 1: Development of Ideal Auto's Intelligent Driving - Ideal Auto's intelligent driving has evolved through various stages, from rule-based systems to AI-driven E2E+VLM dual systems and VLA, with a strong emphasis on navigation as a key module [6]. - The current end-to-end mass production version of MPI has reached over 220, representing a 19-fold increase compared to the version from July 2024 [13]. Group 2: Data Closed-Loop Value - The data closed-loop process includes shadow mode validation, data feedback to the cloud for mining, automatic labeling of effective samples, and model training, with data return achievable in one minute [9][10]. - Ideal Auto has accumulated 1.5 billion kilometers of driving data, utilizing over 200 triggers to produce 15-45 second clip data [11]. Group 3: Transition to Training Closed-Loop - The core of the L4 training loop involves VLA, reinforcement learning (RL), and world models (WM), optimizing trajectories through diffusion and reinforcement learning [23]. - Key technologies for closed-loop autonomous driving training include regional simulation, synthetic data, and reinforcement learning [24]. Group 4: Reconstruction and Generation Work - Ideal Auto has made significant progress in reconstruction and generation, with multiple top conference papers published in the last two years [28][32][34]. - The generation applications range from scene editing to scene migration and scene generation [36]. Group 5: Interactive Agents and System Capabilities - The development of interactive agents is highlighted as a critical challenge in the training closed-loop [40]. - System capabilities are enhanced through world models providing simulation environments, diverse scene construction, and accurate feedback from reward models [41]. Group 6: Community and Collaboration - The article mentions the establishment of nearly a hundred technical communication groups related to various autonomous driving technologies, with a community of around 4,000 members and over 300 companies and research institutions involved [50][51].