VLM
Search documents
何小鹏:未来最好的AI公司,都会自研芯片
3 6 Ke· 2026-01-12 07:10
Core Viewpoint - Xiaopeng Motors is increasingly positioning itself as an AI company, focusing on software and AI capabilities rather than just hardware in its new vehicle releases [6][8]. Group 1: New Vehicle Launches - On January 8, 2026, Xiaopeng launched four new models: P7+, G7 extended range version, 2026 G6, and 2026 G9, all featuring self-developed Turing AI chips [6][7]. - The new models will utilize Xiaopeng's second-generation VLA model, enabling basic Level 4 (L4) assisted driving capabilities [6][7]. - The Turing chip in the MAX version has an effective computing power of 750 TOPS, while the Ultra SE version uses two Turing chips for intelligent driving, and the Ultra version uses three chips [7]. Group 2: Strategic Focus on AI - Xiaopeng's CEO emphasizes that the value of AI will surpass traditional performance upgrades, with software's contribution to vehicle value expected to rise from 10% to 50% over the next decade [8][14]. - The company is committed to developing both the models and the software for chips, believing that the best AI companies will choose to customize their chips [17]. Group 3: Market Position and Sales Strategy - Xiaopeng has sold over 400,000 vehicles in the past year, showing significant growth, and plans to expand its product lineup with multiple new models in 2026 [10][24]. - The company aims to enhance its global supply chain and channel management, with plans to expand into more countries and regions [9][10]. Group 4: Future Outlook and Challenges - The CEO predicts that 2026 will be a pivotal year for the automotive industry, with a significant shift towards AI integration in vehicles [14][20]. - Xiaopeng is aware of the competitive landscape in the Chinese automotive market and acknowledges the uncertainties associated with its AI strategy [7][8].
何小鹏谈行业销售承压:最坏的时候也是最好的时候
Xin Lang Cai Jing· 2026-01-08 10:04
Group 1 - The core viewpoint expressed by the CEO of Xiaopeng Motors is that the current challenges in the electric vehicle (EV) sales are temporary and the industry will recover in due time, indicating a positive outlook for the future [1] - The CEO emphasized that the worst times can also present the best opportunities, suggesting a resilient approach to the current market conditions [1] - Xiaopeng Motors is focusing on the production of its VLA, VLM, and humanoid robots, indicating a strategic shift towards innovation and new product lines to capitalize on future opportunities [1]
从小切口透视大行业 ——2025年汽车供应链变革“风暴眼”
Zhong Guo Qi Che Bao Wang· 2026-01-06 02:18
Core Insights - The automotive industry's core competitiveness is shifting from traditional mechanical performance to smart technology, safety, and integration with energy networks [3] - Eight key component areas have emerged as focal points for change in the automotive supply chain by 2025 [3] Group 1: AI and Smart Technology - AI large models, including VLA and VLM, are reshaping the perception, decision-making, and interaction systems in smart vehicles [4] - Companies like Li Auto and XPeng are actively developing and deploying VLA-based autonomous driving systems, with plans for mass production by 2026 [4] - The competition in AI models is intensifying, with a focus on the underlying support systems like computing power and data [4] Group 2: Vehicle-to-Grid (V2G) Interaction - V2G is becoming a hot topic as electric vehicles can act as distributed energy storage units within new energy systems [5] - Government policies are driving the adoption of V2G, with pilot projects and plans to expand the scope of V2G applications by 2027 [5][6] - Companies like GAC Group are implementing V2G functionalities in their models and developing charging infrastructure to support this transition [6] Group 3: Battery Safety Standards - The new national standard for electric vehicle batteries, effective July 2026, emphasizes safety by requiring batteries to be "non-flammable and non-explosive" [7] - The updated standards will compel battery manufacturers to innovate in materials, design, and production processes to meet stricter safety requirements [7] - Leading battery companies like BYD are already adapting to these new standards, which will enhance safety and consumer trust in electric vehicles [7] Group 4: Door Handle Innovations - Electric hidden door handles are becoming a focal point due to safety concerns arising from their failure in collision scenarios [8][9] - New regulations are being proposed to ensure that all door handles, including electronic ones, have a mechanical release function for emergency situations [9] Group 5: Solid-State Batteries - Solid-state batteries are gaining traction due to their advantages in energy density and safety, with several companies planning to launch new products or production lines [10] - The development of solid-state batteries is seen as a key competitive factor for companies in the next generation of electric vehicles [10][11] Group 6: Human-Car-Home Ecosystem - The "Human-Car-Home" ecosystem is emerging, integrating automotive, home, and personal devices into a cohesive smart system [12] - Companies like Haier and Midea are collaborating with automotive brands to create interconnected systems that enhance user experience [12][13] Group 7: Humanoid Robots - The automotive industry is increasingly intersecting with humanoid robotics, with companies exploring the integration of robotic technology into manufacturing processes [14][15] - The demand for precision and adaptability in manufacturing is driving the development of humanoid robots tailored for automotive applications [14] Group 8: Zero-Gravity Seats - Zero-gravity seats are becoming a key feature in mid to high-end vehicles, enhancing passenger comfort and experience [16] - The lack of standardized regulations for these seats poses challenges, particularly regarding safety during vehicle operation and collisions [16]
英伟达主管!具身智能机器人年度总结
具身智能之心· 2025-12-29 12:50
Core Insights - The robotics field is still in its early stages, as highlighted by Jim Fan, NVIDIA's robotics head, indicating a lack of standardized evaluation metrics and the disparity between hardware advancements and software reliability [1][8][11]. Group 1: Hardware and Software Disparity - Current advancements in robotics hardware, such as Optimus and e-Atlas, outpace software development, leading to underutilization of hardware capabilities [14][15]. - The need for extensive operational teams to manage robots is emphasized, as they do not self-repair and face frequent issues like overheating and motor failures [16][17]. - The reliability of hardware is crucial, as errors can lead to irreversible consequences, impacting the overall patience and scalability of the robotics field [18][19]. Group 2: Benchmarking Challenges - The lack of consensus on benchmarking in robotics is a significant issue, with no standardized hardware platforms or task definitions, leading to everyone claiming to achieve state-of-the-art (SOTA) results [20][21]. - The field must improve reproducibility and scientific standards to avoid treating them as secondary concerns [23]. Group 3: VLA Model Insights - The Vision-Language-Action (VLA) model is currently the dominant paradigm in robotics, but its reliance on pre-trained Vision-Language Models (VLM) presents challenges due to misalignment with physical world tasks [25][49]. - The VLA model's performance does not scale linearly with VLM parameters, as the pre-training objectives do not align with the requirements for physical interactions [26][51]. - Future VLA models should integrate physical-driven world models to enhance their ability to understand and interact with the physical environment [50]. Group 4: Data Importance - Data plays a critical role in shaping model capabilities, with the need for diverse data sources and collection methods being highlighted [31][43]. - The emergence of new hardware and data collection methods, such as Generalist and Egocentric-10K, demonstrates the growing importance of data in the robotics field [36][42]. - The current data collection strategies remain open-ended, with various approaches still being explored [43]. Group 5: Industry Trends - The robotics industry is projected to grow significantly, from $91 billion currently to $25 trillion by 2050, indicating a strong future potential [57]. - Major tech companies, excluding Microsoft and Anthropic, are increasingly investing in robotics software and hardware, reflecting the sector's attractiveness [59].
最近做 VLA 的一些心得体会
自动驾驶之心· 2025-12-11 00:05
Core Insights - The article discusses the challenges and advancements in Vision-Language Models (VLM) for autonomous driving, highlighting issues such as hallucination, 3D spatial understanding, and processing speed [3]. Group 1: Challenges in VLM - Hallucination issues manifest as generating non-existent information and failing to perceive relevant data, which can be mitigated through dynamic perception techniques [3]. - Insufficient 3D spatial understanding is attributed to pre-training tasks being predominantly 2D, suggesting the incorporation of spatial localization tasks during training [3]. - Processing speed is a concern, with potential solutions including KV Cache, visual token compression, and mixed data training to enhance model efficiency [3]. Group 2: Learning Paradigms and Model Improvements - The learning paradigm should shift from imitation learning (SFT) to preference learning (DPO, GRPO), with simultaneous multi-task training yielding better results than sequential single-task training [3]. - To prevent catastrophic forgetting in foundation models, adding pre-training data is a simple and effective method [3]. - Enhanced supervisory signals can lead to better model representations, achieved by adding auxiliary task heads to the VLM model [3]. Group 3: Interaction and Evaluation - Current VLMs exhibit insufficient interaction between vision and language, limiting their effectiveness as base models; improving this interaction is crucial [3]. - The output method for trajectories is flexible, with various approaches yielding satisfactory results, though diffusion heads are preferred in industry for speed [3]. - Evaluation remains challenging due to inconsistencies between training and testing conditions, necessitating better alignment of objectives and data distributions [3].
一场关于自动驾驶VLA和世界模型的深度讨论!下周一不见不散~
自动驾驶之心· 2025-11-11 00:00
Core Insights - The article discusses advancements in autonomous driving technology, particularly focusing on the development of the Visual-Language-Action (VLA) framework and world models, highlighting the contributions of various experts in the field [1][2][3][4][5]. Group 1: Key Contributors - Jian Kun, a senior director at Li Auto, has built the autonomous driving technology stack from scratch since 2021, achieving milestones such as Highway NoA in 2022 and City NoA in 2023 [1]. - Xu Lingyun, a PhD from the Chinese Academy of Sciences, leads the parking team at Changan Automobile, focusing on autonomous driving perception and end-to-end system research [2]. - Jiang Anqing, a senior algorithm scientist at Bosch, leads research on VLA and closed-loop algorithms [3]. Group 2: Technological Focus - The discussion includes the potential integration of world models and VLA, questioning whether a unified approach is feasible [8]. - The high demand for data and computing power is making it increasingly difficult for academia to participate in intelligent driving, raising questions about future opportunities in the academic sector [8]. Group 3: Event Highlights - A live discussion on the future of autonomous driving technologies, including insights on Tesla's FSD v14 and its implications for domestic technology [4][5]. - The event featured a deep dive into the reliability of VLM in autonomous driving, with expert opinions on data closed-loop engineering [12].
理想VLM/VLA盲区减速差异
理想TOP2· 2025-10-18 08:44
Core Insights - The article discusses the differences between VLM (Visual Language Model) and VLA (Visual Language Action) in the context of autonomous driving, particularly focusing on scenarios like blind spot deceleration [1][2]. Group 1: VLM and VLA Differences - VLM operates by perceiving scenarios such as uncontrolled intersections and outputs a deceleration request to the E2E (End-to-End) model, which then reduces speed to 8-12 km/h, creating a sense of disconnection in the response [2]. - VLA, on the other hand, utilizes a self-developed base model to understand the scene directly, allowing for a more nuanced approach to blind spot deceleration, resulting in a smoother and more contextually appropriate response based on various road conditions [2]. Group 2: Action Mechanism - The action generated by VLA is described as a more native deceleration action rather than a dual-system command, indicating a more integrated approach to scene understanding and response [3]. - There are concerns raised in the comments regarding VLM's reliability as an external module, questioning its ability to accurately interpret 3D space and the stability of its triggering mechanisms [3].
【汽车智能化10月投资策略】先发优势稳固,后发发力追赶,继续看好智能化主线!
东吴汽车黄细里团队· 2025-10-17 09:20
Core Viewpoint - The market is expected to refocus on investment opportunities in smart technology in Q4, driven by the ongoing AI trend and advancements in autonomous driving capabilities, particularly in Robotaxi applications [2][8]. Group 1: Q4 Smart Technology Outlook - The Q4 market will see a renewed emphasis on smart technology investment opportunities, as AI applications in the physical world are anticipated to exceed expectations in the next 3-5 years [2][8]. - Key catalysts for smart technology in Q4 include the release of Tesla's V14 version, Xiaopeng's upcoming technology day, and the introduction of new autonomous vehicles by various companies [2][8]. Group 2: Comparison with Last Year - Similarities with last year's Q4 include the expansion of AI applications, but this year emphasizes the evolution of AI logic rather than the resonance between automotive and AI logic [3][9]. - The focus has shifted from hardware opportunities and consumer sales to software opportunities and breakthroughs in B2B applications [3][9]. Group 3: Investment Strategy - The preferred investment strategy favors Hong Kong stocks over A-shares, prioritizing software over hardware and B2B over B2C applications, with recommended stocks including Xiaopeng Motors, Horizon Robotics, and Cao Cao Mobility [4][9]. - Key investment targets include integrated models for Robotaxi, technology providers, and the transformation of ride-hailing services [4][9]. Group 4: Smart Technology Market Dynamics - The price war among passenger car manufacturers is more intense than expected, which could significantly impact profitability across the supply chain [5]. - The recovery of terminal demand is below expectations, which may affect sales growth for car manufacturers [5]. Group 5: Smart Technology Development Review - In August, the penetration rate of smart technology reached 23.3%, with significant advancements in autonomous driving capabilities among leading players [10]. - By October, the focus will be on the iterative development of next-generation driving architectures and the sales performance of key smart vehicles [10]. Group 6: Consumer Willingness to Pay - The consumer willingness to pay for smart technology is expected to evolve in two phases, with the first phase focusing on helping car manufacturers sell vehicles and the second phase aiming for software monetization [20][18]. Group 7: Future Projections - By 2025-2027, the core task of automotive smart technology will be to achieve a penetration rate of 50%-80% for new energy vehicles, while the period from 2028-2030 is expected to see the large-scale commercialization of Robotaxi services [20][19]. Group 8: Smart Technology Supply Chain Tracking - The supply chain for smart technology is being closely monitored, with various companies contributing to different aspects of the technology, including perception, decision-making, and execution [14][13]. Group 9: Key Metrics and Trends - The penetration rates for smart driving capabilities among different brands show significant variation, with Xiaopeng at 76.1% and Wey at 95.6% [25][26]. - The overall market dynamics indicate a competitive landscape with rapid advancements in technology and varying consumer adoption rates [24][23].
Waymo自动驾驶最新探索:世界模型、长尾问题、最重要的东西
自动驾驶之心· 2025-10-10 23:32
Core Insights - Waymo has developed a large-scale AI model called the Waymo Foundation Model, which supports vehicle perception, behavior prediction, scene simulation, and driving decision-making [5][11] - The model integrates data from multiple sensors to understand the environment, similar to how large language models operate [5][11] - The focus on data quality and selection is crucial for ensuring that the model addresses the right problems effectively [25][30] Group 1: World Model Development - Waymo's world model encodes all sensor data and incorporates world knowledge, enabling it to decode driving-related tasks [11] - The model allows for real-time perception and decision-making on the vehicle while simulating real driving environments in the cloud for testing [7][11] - The long-tail problem in autonomous driving, which includes complex scenarios like adverse weather and construction, remains a significant challenge [11][12] Group 2: Addressing Long-Tail Problems - Weather conditions such as rain and snow present unique challenges for autonomous driving, requiring high precision in judgment [12][14] - Low visibility scenarios necessitate the use of multi-modal sensors to detect objects effectively [15] - Occlusion reasoning is critical for understanding hidden objects and ensuring driving safety [18][21] Group 3: Complex Scene Understanding - Understanding complex scenes like construction zones and dynamic environments requires advanced reasoning capabilities [24] - Real-time responses to dynamic signals, such as traffic officer gestures, are essential for safe navigation [24] - The use of large language models is being explored to enhance scene understanding and decision-making [24] Group 4: Importance of Data, Algorithms, and Computing Power - The three critical components for successful autonomous driving are data, algorithms, and computing power, with a strong emphasis on data quality [25][30] - Efficient data mining from vast video datasets is vital for understanding driving events [30] - Quick decision-making is essential for safety and smooth operation, with a focus on reducing response times across the algorithmic chain [30][31] Group 5: Operational Infrastructure - Waymo's operational facilities, including depots and modification workshops, are crucial for the efficient deployment of Level 4 autonomous vehicles [33] - Vehicles can autonomously navigate to charging stations and begin operations after sensor installation [33] - The engineering challenges of scaling autonomous driving technology require collaboration with traditional automotive engineers [34] Group 6: Sensor and Algorithm Response - The responsiveness of sensors, such as camera frame rates, is critical for effective autonomous driving [36] - Algorithms must process data at high frequencies to ensure timely execution of driving commands [36] - The evolution of vehicle control systems is moving towards higher frequency responses, particularly in electric and electronically controlled systems [36]
李想目前对AI兴趣远大于汽车硬件维度产品细节打磨
理想TOP2· 2025-09-01 07:50
Core Viewpoints - Li Xiang's personal interest in AI currently outweighs the focus on the incremental details of automotive hardware products [1][4] - Discussing the short-term market, Li Xiang's preference for AI over hardware may pose a potential risk to short-term sales, as many consumers prefer hardware-defined products [1] - The foundational anchor for both short-term and long-term commercial value is the product's utility, supported by varying levels of emotional value; in the AI era, models are products [1] - Within a three-month timeframe, AI-related product utility is unlikely to reach early mainstream adoption, remaining in the early adopter phase, with low emotional value among the general public [1] Detailed Analysis - The head of the first product line, Lao Tang, actively shares the product development process online, while the heads of the second and third product lines, Zhang Xiao and Li Xinyang, are less inclined to do so [3] - The MEGA Home was developed based on user feedback regarding accessibility for the elderly, with differing opinions between Li Xiang and Lao Tang on design solutions [3] - Li Xiang has been the primary decision-maker for many product details in the Li ONE, while there is speculation that the i8 may shift to a configuration with fewer options, likely influenced by Li Xiang [3] - There is no evidence from public information that Li Xiang has strongly insisted on hardware dimension enhancements for the new product lines [3] - Li Xiang's strong insistence on running VLA on dual Orin chips led to significant technical challenges being overcome, showcasing his first-principles thinking [5] - All vehicles equipped with the Thor chip are expected to be able to switch to Li Auto's own autonomous driving chip in the future, although it is uncertain if the Orin chip will also be replaceable [5]