Workflow
VLA架构
icon
Search documents
揭秘长城智驾自研:元戎、Momenta「抬轿」与千人自研团队
雷峰网· 2026-03-02 00:43
Core Viewpoint - Great Wall Motors is adopting a dual strategy in the autonomous driving sector, focusing on both self-research and collaboration with external suppliers to enhance its technological capabilities and market position [2][5][11]. Group 1: Autonomous Driving Strategy - The competition in autonomous driving is intensifying, with Great Wall Motors planning to release a VLA (Vision-Language-Action) model by the end of 2025, positioning itself as one of the few companies to implement this technology [2][5]. - Great Wall Motors is expanding its supplier network to cover vehicle price ranges from 100,000 to 400,000 yuan, with plans to implement three major computing platforms starting in 2025 [5][9]. - The ADC 2.0 platform targets mainstream models like Haval, utilizing TI chips and Qualcomm platforms, and aims to enhance capabilities in high-speed scenarios [6][7]. Group 2: Research and Development - Great Wall Motors has established a self-research team of over 1,000 people, focusing on both autonomous driving and cockpit technologies, with a significant emphasis on self-research despite ongoing collaborations [10][11]. - The company has been investing heavily in R&D, with autonomous driving research expenses accounting for 50% of total R&D investments, amounting to 1 billion yuan annually [12][13]. - The establishment of the 九州超算中心 (Jiuzhou Supercomputing Center) aims to provide substantial computational support for the development of large models, with a total computing power of 5 EFLOPS [15]. Group 3: Market Performance - The sales of the Haval Shuanglong Max, which features autonomous driving capabilities, show a balanced order structure between smart and non-smart versions, indicating a growing consumer interest in smart features [9]. - The sales of the Wey brand reached 102,000 units in 2025, marking an 86% year-on-year increase, attributed to the successful implementation of end-to-end models and VLA technology [9].
智驾圈都在等何小鹏
3 6 Ke· 2026-02-26 03:03
Core Insights - The article discusses the strategic shift in Xiaopeng Motors' autonomous driving technology, emphasizing the need to innovate and adapt to stay ahead of competitors like Li Auto and Huawei [7][16][29] - The transition from a modular approach to a more integrated Vision-Language-Action (VLA) architecture is highlighted as a critical move for Xiaopeng to enhance its autonomous driving capabilities [17][19][22] Group 1: Leadership and Organizational Change - Xiaopeng Motors has undergone multiple leadership changes, reflecting the need for organizational evolution in response to the fast-paced autonomous driving landscape [5][56] - The current leadership under Liu Xianming is focused on dismantling previous frameworks to foster innovation and agility within the company [46][57] - The article suggests that true progress in autonomous driving requires a willingness to challenge existing structures and embrace self-revolution [6][58] Group 2: Technological Evolution - The VLA architecture represents a significant advancement, moving from a two-step language translation process to a more direct integration of visual and language inputs for decision-making [17][19][22] - This new approach aims to reduce information loss and improve model efficiency, allowing for better performance in complex driving scenarios [22][25] - Xiaopeng's VLA 2.0 model is designed to enhance the vehicle's ability to operate without traditional navigation or textual instructions, showcasing a leap in autonomous capabilities [22][28] Group 3: Competitive Landscape - The article notes that Xiaopeng's challenges are reflective of broader trends in the autonomous driving industry, where many players are racing to innovate and improve their technologies [3][16] - The competitive environment is characterized by rapid advancements, with companies like Tesla and others also pushing the boundaries of autonomous driving capabilities [28][56] - Xiaopeng's ability to leverage its U.S. research center and retain talent is seen as a crucial factor in maintaining its competitive edge [30][35][39]
AI重塑汽车产业:从CES 2026看智能化革命的深度演进
Core Insights - The CES 2026 event highlighted the accelerated transformation of the automotive industry towards smart technology, with AI becoming a core element in redefining automotive products [1] - The concept of "smart driving equity" emerged as a key theme in 2025, with advanced driving systems becoming accessible across a wider range of vehicle price points [2] - The rapid proliferation of smart driving technology necessitates new requirements for vehicle after-sales service systems, leveraging AI for efficient diagnostics [3] - The commercialization of Robotaxi services is gaining momentum, with significant growth in revenue and operational scale in major Chinese cities [5] - The evolution of smart cockpits is transforming vehicles into personalized assistants, enhancing user interaction and experience [6] - AI-driven marketing services are reshaping automotive marketing from broad outreach to precision targeting [7] - The competitive landscape is shifting towards ecosystem collaboration among automotive companies and technology suppliers, fostering innovation and efficiency [9] - The future of the automotive industry will be defined by AI, with a focus on collaborative ecosystems and comprehensive digital transformation [10] Group 1: Smart Driving Technology - The introduction of advanced driving systems like BYD's "Heavenly Eye" and Xiaopeng's XNGP is making smart driving features available in lower-priced vehicles [2] - The VLA architecture enables a more human-like decision-making process in driving, significantly reducing reliance on real vehicle data for testing [2] - The cost reduction in high-level driving systems, such as the integration of solid-state LiDAR, is facilitating their adoption in more affordable models [2] Group 2: After-Sales Service Innovations - The establishment of an AI-driven diagnostic system has achieved a 98.1% accuracy rate in diagnosing electric vehicle systems, significantly improving service efficiency [3] - The AI system reduces the time to address individual faults to approximately 20 seconds, enhancing the overall after-sales service experience [3] Group 3: Commercialization of Robotaxi - Major cities in China are rapidly expanding Robotaxi operations, with Wuhan leading in testing mileage and population coverage [5] - Companies like Pony.ai and WeRide are reporting substantial revenue growth in their Robotaxi services, indicating a shift towards profitability [5] - The market for Robotaxi services is projected to grow significantly, with estimates reaching $4.7 billion by 2035 [5] Group 4: Smart Cockpit Developments - The integration of advanced AI models in vehicle cockpits is transforming them into intelligent assistants capable of understanding user emotions and preferences [6] - The trend towards end-to-end AI solutions in cockpits is expected to enhance user experience and interaction [6] Group 5: Marketing and Service Automation - AI-driven marketing tools are enabling automotive brands to identify high-intent customers and streamline communication processes [7] - The use of AI in customer outreach has led to improved engagement rates and more efficient service delivery [8] Group 6: Ecosystem Collaboration - Automotive companies are increasingly forming strategic partnerships with technology suppliers to enhance their capabilities and reduce R&D costs [9] - The collaborative model is leading to faster product iteration cycles and improved technology deployment [9] Group 7: Future Outlook - The automotive industry is entering a new era defined by AI, with a focus on collaborative ecosystems and comprehensive digital transformation [10] - The competition will center around the efficiency of ecosystem collaboration and the ability to innovate rapidly [10]
从“地大华魔”掉队,卓驭科技在智驾平权浪潮下另觅出路
第一财经网· 2026-01-12 10:24
Core Insights - The competitive landscape in China's intelligent driving sector is undergoing significant changes, with a clear polarization among suppliers as cost competition intensifies [1][2] Group 1: Market Dynamics - Momenta and Huawei HI together hold over 80% market share in the urban NOA third-party supplier segment for passenger cars in China by October 2025, leaving only 19.2% for other suppliers, including Zhuoyue Technology [1] - The penetration rate of intelligent driving in China's passenger vehicles has exceeded 68%, with high-level driving solutions being pushed down to the 100,000 to 150,000 yuan market segment [2] Group 2: Zhuoyue Technology's Position - Zhuoyue Technology, originally benefiting from low-cost advantages, is now showing signs of lagging behind competitors, with its main deployment still relying on fuel vehicles from Volkswagen [1][3] - The company has announced over 50 mass-produced cooperative models, but market performance varies significantly, with some models failing to boost sales [3] Group 3: Competitive Pressures - The competition in the low-cost intelligent driving sector is intensifying, with new entrants like BYD and Horizon aiming to offer high-level driving solutions at lower price points [4] - Zhuoyue Technology's reliance on Volkswagen is seen as a potential weakness, as the company needs to diversify its partnerships to scale its intelligent driving solutions [3] Group 4: Future Strategies - Zhuoyue Technology is exploring new business avenues, including heavy-duty trucks and unmanned logistics vehicles, to seek new growth points [6] - The company plans to launch heavy-duty trucks equipped with its high-speed NOA by mid-2026, collaborating with firms like XCMG and Shaanxi Automobile [6]
卓驭创始人沈劭劼:2026,智驾要从“端到端” 到“端到所有地方”
Xin Lang Cai Jing· 2026-01-11 05:53
Core Insights - The autonomous driving industry is experiencing significant turbulence, with companies like Maomao Zhixing facing collapse despite strong backing and funding, while others like Zhuoyu Technology secure substantial investments [2] - The competitive landscape has shifted from rule-driven to data-driven models, emphasizing the importance of rapid iteration and efficiency in development cycles [3][4] Company Developments - Zhuoyu Technology announced a strategic investment exceeding 3.6 billion yuan from China FAW, highlighting its growth amidst industry challenges [2] - The founder of Zhuoyu, Shen Shaojie, noted that the company's model iteration cycle has been reduced to weekly updates, significantly improving project delivery times from six months to just over one month [3] Industry Trends - Companies that fail to transition to a data-driven development paradigm are at risk of being eliminated from the market [4][5] - The core competitive factor in the intelligent driving sector is the ability to integrate data-driven approaches with traditional manufacturing processes [5] Transformation Challenges - Transitioning to a data-driven model has been challenging for teams traditionally focused on rule-based systems, as exemplified by Zhuoyu's decision to delete its original codebase [6] - The company has shifted its safety protocols from relying on numerous rules to a comprehensive evaluation system, emphasizing data quality over quantity [6] Engineering and Operational Changes - The integration of data-driven methodologies into all aspects of operations is crucial for the success of intelligent driving solutions [7] - Zhuoyu's engineering processes have evolved, with a focus on maintaining a disciplined approach to problem-solving without adding rules that could complicate models [10] Future Outlook - The competition in the intelligent driving industry is expected to intensify, with significant breakthroughs anticipated in 2026 [10][11] - Zhuoyu aims to expand its technology across various vehicle models and scenarios, leveraging a "base model" strategy that allows for customization by automotive manufacturers [13]
硬科技冲高,机器人行情火热,昊志机电涨超6%,机器人ETF基金(159213)冲击五连阳,连续3日强势吸金超6300万元!人形机器人"黄金十年"启幕?
Sou Hu Cai Jing· 2025-12-30 03:42
Core Viewpoint - The human-shaped robot and embodied intelligence industry is experiencing rapid growth, with the establishment of a standardization committee aimed at addressing the lagging standards and high collaboration costs in the sector [3]. Group 1: Market Performance - The Shanghai Composite Index opened lower but showed signs of recovery, with the Robot ETF Fund (159213) rising by 0.67%, marking a potential five-day winning streak and attracting a net subscription of 20 million yuan [1]. - The Robot ETF Fund has seen strong inflows, accumulating over 63 million yuan in the last three trading days [1]. - The index's constituent stocks exhibited mixed performance, with notable gains from companies like New Times reaching the daily limit and Haoshi Electric rising over 6% [6]. Group 2: Industry Developments - The establishment of the standardization committee for human-shaped robots and embodied intelligence is a significant step towards enhancing high-quality standard supply and promoting the maturation and application of related technologies [3]. - The committee will focus on developing industry standards across various domains, including common foundational technologies, components, systems, and safety, to guide healthy industry development [3]. Group 3: Future Outlook - The industry is expected to transition from "0-1" to "1-10" by 2025, focusing on technology convergence, with a shift towards mass production and commercialization anticipated in 2026 [4]. - Key milestones for 2026 include the completion of hardware platform design for Tesla's Gen2.5 robot and the initiation of large-scale manufacturing by August [8]. - The human-shaped robot sector is projected to experience a significant upward trend, driven by policy support and industry advancements, with potential IPOs for leading domestic companies in the first half of 2026 [8][10]. Group 4: Technological and Policy Insights - The evolution of models and hardware in the robotics sector is crucial, with real data becoming a core productivity driver and the VLA architecture expected to dominate applications by 2025 [9]. - The transition from industrial robots to general-purpose robots is underway, with applications expanding beyond data collection and education to include industrial and logistics sectors [9]. - Global policies are increasingly recognizing the importance of general-purpose robots, with major economies elevating the sector to a national strategic level, providing a clear development outlook and long-term certainty for the industry [10].
FSD v14很有可能是VLA!ICCV'25 Ashok技术分享解析......
自动驾驶之心· 2025-10-24 00:04
Core Insights - Tesla's FSD V14 series has shown rapid evolution with four updates in two weeks, indicating a new phase of accelerated development in autonomous driving technology [4][5] - The transition to an end-to-end architecture from version 12 has sparked industry interest in similar technologies, emphasizing the importance of a unified neural network model for driving control [7][9] Technical Advancements - The end-to-end system reduces intermediate processing steps, allowing for seamless gradient backpropagation from output to perception, enhancing overall model optimization [7] - Ashok highlighted the complexity of encoding human value judgments in autonomous driving scenarios, showcasing the system's ability to learn from human driving data to make nuanced decisions [9] - Traditional modular systems face challenges in defining interfaces for perception and decision-making, while end-to-end models minimize information loss and improve decision-making in rare scenarios [11][13] Data Utilization - Tesla's data engine collects vast amounts of driving data, generating the equivalent of 500 years of driving data daily, which is crucial for training the FSD model [18][19] - The company employs complex mechanisms to gather data from rare scenarios, ensuring the model can generalize effectively [19] Model Structure and Challenges - The ideal end-to-end model structure involves high-dimensional input data (e.g., 7 channels of 5 million pixel camera video) mapped to low-dimensional output signals, presenting significant training challenges [16] - The end-to-end system's architecture is designed to ensure interpretability and safety, avoiding the pitfalls of being a "black box" [20][22] Evaluation Framework - A robust evaluation framework is essential for end-to-end systems, focusing on closed-loop performance and the ability to assess diverse driving behaviors [32][34] - Tesla's closed-loop simulation system plays a critical role in validating the correctness of the end-to-end policy and generating adversarial samples for model testing [36][38] Future Implications - The integration of Tesla's simulation capabilities into robotics suggests potential advancements in embodied AI, enhancing the versatility of AI applications across different domains [40][42]
FSD V14深度解析!自动驾驶AI的觉醒时刻?
自动驾驶之心· 2025-10-17 16:04
Core Insights - The article discusses the advancements and features of Tesla's Full Self-Driving (FSD) version 14.1, highlighting its potential to achieve a level of "unsupervised" driving experience, surpassing previous versions in terms of safety and functionality [9]. Group 1: FSD V14.1 Features - FSD V14.1 introduces new arrival options for parking, allowing users to select various parking locations such as parking lots, streets, driveways, garages, or curbside [7]. - The update enhances the system's ability to yield for emergency vehicles and improves navigation by integrating routing into the vision-based neural network for real-time handling of blocked roads [7][8]. - Additional features include improved handling of static and dynamic gates, better management of road debris, and enhanced performance in various driving scenarios such as unprotected turns and lane changes [7][8]. Group 2: Technical Advancements - FSD V14.1 aims to cover a broader range of driving scenarios, optimizing performance in parking situations and simplifying user interface design for better efficiency [8]. - The update introduces a "most conservative" driving mode and offers more parking options upon arrival, catering to personalized user preferences [8]. - Significant improvements have been made in handling long-tail scenarios, including navigating around road debris, yielding to special vehicles, and managing system faults [8]. Group 3: Real-World Testing and Performance - Real-world testing of FSD V14.1 has demonstrated its ability to navigate complex environments, such as underground parking lots and construction zones, showcasing its advanced text recognition capabilities [12][15]. - The system has shown improved understanding of traffic signs and hand signals, indicating a significant leap in its contextual awareness and decision-making abilities [18]. - FSD V14.1 has also integrated audio signals into its control model, allowing it to detect emergency vehicles based on sirens, enhancing its situational awareness [21][28]. Group 4: Future Developments - The article mentions that FSD V14.1 is just the beginning, with future updates (V14.2 and V14.3) expected to further enhance the system's capabilities [27]. - There is speculation that the architecture of FSD V14 may incorporate a Vision-Language-Action (VLA) model, which could significantly improve its performance across various driving scenarios [25][28]. - The potential increase in model parameters and context length is anticipated to enhance the system's understanding and decision-making processes, bringing it closer to achieving a level of "awakening" in AI capabilities [28].
千寻智能解浚源:展望迈向通用人形机器人的曙光时刻
Xin Lang Cai Jing· 2025-06-30 08:22
Core Insights - The event "Empowering New Energy, Driving the Future" focused on the transformation of achievements by young scientists and the high-quality development of embodied intelligence, gathering over a hundred young scientists and renowned company entrepreneurs [1] Group 1: Technological Innovations - Dr. Jiyuan Jie from Qianxun Intelligent shared a solution that employs a three-stage learning path similar to large models, which includes pre-training with internet images, imitation learning data from real robots, and reinforcement learning to enhance performance [3] - This architecture addresses the multimodal challenges in traditional imitation learning, allowing models to flexibly choose various paths to achieve the same task rather than just replicating average actions [3] Group 2: Engineering and Commercialization - The true breakthrough in embodied intelligence lies not only in the choice of technological paths but also in the engineering capabilities that enable practical applications, with Qianxun Intelligent possessing top-tier hardware manufacturing capabilities and a pioneering software team [5] - The company's mission is to enable 10% of the global population to own their robots within ten years, showcasing technology maturity through specific industrial applications [5]
自动驾驶未来技术趋势怎样?李想:现阶段VLA是能力最强的架构
news flash· 2025-05-07 13:27
Core Viewpoint - The CEO of Li Auto, Li Xiang, discussed the transition of the auxiliary driving system to the VLA architecture, questioning its efficiency compared to potential future architectures [1] Group 1 - VLA architecture is capable of addressing full autonomous driving, but its efficiency as the optimal solution is uncertain [1] - Li Xiang highlighted that VLA is still based on the transformer architecture, which raises questions about whether transformer is the most efficient architecture available [1] - Currently, VLA is considered the most powerful architecture in terms of capabilities [1]