VLA路线
Search documents
华为吃高端,Momenta占中端:智驾的“圈地运动”谁能终结?
3 6 Ke· 2026-01-22 09:39
Core Insights - In 2025, the adoption of intelligent driving in China is expected to experience explosive growth, with L2 level vehicles' sales projected to reach a penetration rate of 66.1% by the end of the year, indicating that intelligent driving has become a standard feature in vehicles [1][2][3] Group 1: Market Trends - The intelligent driving industry is facing a significant downturn despite the growth in adoption, leading to a "survival of the fittest" scenario [2][3] - The competition is shifting focus from high-speed NOA (Navigation on Autopilot) to urban NOA, with over 3.129 million vehicles equipped with urban NOA sold from January to November 2025 [12][13] - Mainstream models priced below 300,000 yuan contributed 68.9% of urban NOA sales, indicating a move towards mass-market adoption [14][15] Group 2: Technological Pathways - Two main technological pathways are emerging: the "Vision-Language-Action" (VLA) route, which emphasizes rapid iteration and compatibility with existing hardware, and the "World Model" route, which focuses on deeper cognitive paradigms [5][7][10] - Companies like XPeng and Li Auto are strong proponents of the VLA route, while Huawei represents the World Model approach [6][9] Group 3: Competitive Landscape - The market is characterized by a trend of "self-research dominance" with a high concentration of third-party suppliers, where domestic brands accounted for 81.1% of urban NOA vehicle sales [18][19] - The collapse of companies like Haomo and the shift towards third-party suppliers highlight the challenges faced by automakers in self-research capabilities [20][21] - Leading third-party suppliers, such as Huawei and Momenta, dominate the market, with Momenta holding approximately 61.06% market share [25][26] Group 4: Future Outlook - The competition is expected to intensify, with predictions that only two or three intelligent driving companies may survive by 2026 [32] - The integration of software and hardware is becoming crucial for companies to build competitive advantages, with a focus on deep collaboration between chip design and software development [35][39] - Companies like Horizon Robotics are positioning themselves as challengers to the dominant players by targeting cost-sensitive markets and offering integrated solutions [44][47]
2025智驾“大逃杀”,谁能解决“长尾问题”?
Hu Xiu· 2025-09-05 07:25
Core Insights - The rapid commercialization of Vision-Language-Action (VLA) models is redefining the technical threshold for advanced intelligent driving [1] - The competition surrounding VLA will significantly influence the future competitive landscape of Chinese automotive companies and may lead to a reshuffling of the entire intelligent driving industry [7] Group 1: VLA Model Development - Li Auto has launched its VLA driver model with the flagship electric vehicle i8, while Yuanrong Qixing released its self-developed VLA model, DeepRoute IO 2.0, on August 26, covering approximately 200,000 vehicles [2] - XPeng Motors introduced a new generation VLA architecture at the P7 launch on August 27, claiming a latency of less than 100 ms and a planning frame rate of 20 Hz, setting a new benchmark for mass production [3] - The VLA model's ability to abstract and categorize real-world scenarios through language enhances its generalization capabilities compared to traditional end-to-end models [8][12] Group 2: Technical and Economic Challenges - The VLA model requires significant computational power, with Li Auto and XPeng utilizing cloud clusters of 13 EFLOPS and 8 EFLOPS, respectively, while many smaller companies are limited to 0.2-0.6 EFLOPS [14] - The data requirements for VLA are substantial, necessitating the collection and annotation of visual-language-action triplets, with 90% of the training data sourced from 2.93 billion kilometers of real vehicle logs [13] - The cost of training a VLA model can reach 12-15 million RMB per session, which is a significant portion of the annual R&D budget for smaller companies [15] Group 3: Industry Restructuring - The high costs associated with VLA models create survival challenges for smaller automotive companies, which may struggle to compete against larger firms with established technological advantages [19][29] - The transition from rule-based algorithms to VLA models requires a gradual and systematic approach, making it difficult for many second-tier brands to replicate the success of leading companies [21][23] - The VLA model's emergence may lead to a significant industry reshuffle, with many mid-tier companies potentially becoming "outsourcing providers" or low-end manufacturers [24][30] Group 4: Competitive Landscape - The VLA model's introduction is expected to alter the competitive dynamics, with companies like Huawei and Momenta currently holding a dominant market share in intelligent driving [45] - The VLA model's multi-modal learning and reasoning capabilities allow companies like Li Auto and XPeng to achieve performance levels close to those of larger competitors in long-tail scenarios [48] - The year 2025 could mark a pivotal moment for both leading companies and VLA practitioners, potentially leading to a reversal of fortunes in the intelligent driving market [51]
智元机器人首席科学家罗剑岚老师专访!具身智能的数采、仿真、场景与工程化
具身智能之心· 2025-07-30 00:02
Core Viewpoint - The interview with Dr. Luo Jianlan emphasizes the importance of real-world data in the development of embodied intelligence, highlighting the challenges and strategies in data collection, model training, and application deployment. Data Discussion - The company collaborates with multiple sensor suppliers focusing on the joint development of visual, tactile, and high-density sensors, while building a cross-platform data collection API for standardized data input [2] - Achieving a high performance rate of 95% for robots in real-world applications remains a significant challenge, particularly in household tasks [2] - The company uses 100% real machine data for training multimodal large models, agreeing with the notion that simulation environments have scalability limitations [2][3] - The cost of collecting real-world data is not the main issue; rather, the lack of standardized mechanisms for data collection is a core challenge [6] - The company acknowledges the data scarcity and performance optimization difficulties in both autonomous driving and robotics, emphasizing the need for high success rates in open environments [7] Evaluation of Embodied Large Models - There is currently no universal benchmark for evaluating embodied intelligence models due to significant differences in software and hardware environments across companies [9] - The evaluation of different large models is primarily based on their technical routes and the challenges they face in the current landscape [9][10] - The company aims to establish a unified real-machine testing platform to facilitate model evaluation across different scenarios [9] Embodied Intelligence Applications and Implementation - The deployment process for robots involves four steps: task modeling, scene migration, scene adaptation, and safety verification, emphasizing the importance of hardware-software collaboration [18] - High success rates are crucial, but challenges in generalization, robustness, and real-time performance must also be addressed [20] - Industrial environments are seen as the most promising for the initial large-scale deployment of embodied intelligence due to their structured nature and clear commercial demands [21] Future Outlook for Embodied Intelligence - The company aims for a "DeepSeek moment," focusing on achieving near 100% success rates and high-speed execution capabilities in future models [24] - The transition to a data-driven paradigm is recognized as a significant shift in the field, moving away from traditional hypothesis-driven approaches [25] - The potential of brain-like architectures is acknowledged, with ongoing exploration to combine computation with physical capabilities for future intelligent systems [26]
融资5亿,90后清华博导做机器人,「外界对我们有不少误解」
36氪· 2025-07-07 11:02
Core Viewpoint - The article discusses the innovative approach of "Xingdong Jiyuan" in developing both the hardware and software aspects of humanoid robots, emphasizing the importance of a comprehensive system that integrates both components for achieving general-purpose robotics [3][4][8]. Company Overview - "Xingdong Jiyuan" was founded in August 2023 by Chen Jianyu, an assistant professor at Tsinghua University, and has raised nearly 500 million yuan in Series A funding by July 2025 [3][4]. - The company has rapidly developed several robotic products, including dexterous hands, wheeled robots, and full-sized humanoid robots, which has led to some misconceptions about its focus being solely on hardware [3][9]. Technology and Research - Chen Jianyu has a unique interdisciplinary background, having studied both hardware and algorithmic aspects of robotics, which is rare among founders in the field [6][8]. - The company has developed a software model called VLA (Vision-Language-Action) that integrates understanding and generative capabilities, allowing robots to deeply comprehend and predict their environment [8][10]. - The hardware development focuses on modular and universal robot products, enabling flexibility in design and application [8][9]. Business Strategy - The company adopts a "laying eggs along the way" strategy, selling individual components like dexterous hands to reduce costs and gather data for further development [10][30]. - As of June 2025, "Xingdong Jiyuan" has delivered over 200 products and has numerous orders in production, with clients including major tech companies [10][48]. Market Position and Future Outlook - The company aims to target high-value, reusable scenarios in both industrial and service sectors, with a focus on humanoid robots that can perform complex tasks [49][50]. - The future of robotics is seen as a gradual integration into everyday life, with expectations for household robots to emerge within the next 3-5 years [63][64]. - The competitive landscape in robotics is expected to be diverse, with multiple players due to the varied applications and hardware requirements [68].