端到端自动驾驶 - filings, earnings calls, financial reports, news - Reportify

端到端自动驾驶

Search documents

端到端和VLA，正在吸引更多智驾公司的关注......

自动驾驶之心· 2025-10-23 00:04

Core Insights - There is a significant demand for end-to-end and VLA (Vision-Language-Action) technical talent in the automotive industry, particularly among major manufacturers and suppliers [1][3] - The industry is evolving from modular production algorithms to end-to-end solutions and now to VLA, with core algorithms involving BEV perception, VLM, diffusion models, reinforcement learning, and world models [3] Group 1: Industry Demand and Trends - The demand for end-to-end and VLA technology talent is high, with inquiries from multiple companies, including three major manufacturers and several suppliers [1] - The industry primarily operates under two paradigms: single-stage and two-stage approaches, with UniAD being a representative of the single-stage model [1] - The end-to-end approach has diversified into various subfields, especially those based on VLA, with a surge in related academic publications and industrial applications in recent years [1] Group 2: Educational Initiatives - The company has launched courses focused on end-to-end and VLA autonomous driving, aimed at helping individuals quickly and efficiently enter these fields [3][12] - The "VLA and Large Model Practical Course" covers VLA from VLM as an autonomous driving interpreter to modular and integrated VLA, including detailed theoretical foundations and practical assignments [3][12] - The "End-to-End and VLA Autonomous Driving Course" focuses on key algorithms and theoretical foundations, including BEV perception, large language models, diffusion models, and reinforcement learning [12][14] Group 3: Instructor Expertise - The courses are led by experts from both academia and industry, with backgrounds in multimodal perception, autonomous driving VLA, and large model frameworks [8][11][14] - Instructors have published numerous papers in top-tier conferences and possess extensive experience in research and practical applications in autonomous driving and large models [8][11][14] Group 4: Target Audience - The courses are designed for individuals with a foundational knowledge of autonomous driving, familiar with basic modules, and concepts such as transformer models, reinforcement learning, and BEV perception [15][16] - Participants are expected to have a background in probability theory, linear algebra, and programming skills in Python and PyTorch [15][16]

端到端自动驾驶

VLA自动驾驶

自动驾驶大模型

端到端与VLA自动驾驶课程

自动驾驶VLA和大模型实战课程

端到端自动驾驶

VLA自动驾驶

自动驾驶大模型

端到端与VLA自动驾驶课程

自动驾驶VLA和大模型实战课程

从地平线自动驾驶2025年的工作，我们看到了HSD的野心......

自动驾驶之心· 2025-10-22 00:03

Core Insights - Horizon is advancing in the autonomous driving sector by focusing on large-scale production of the new HSD system and reshaping the foundational logic of autonomous driving through cutting-edge research papers [2][3] - The company is transitioning from a technology supplier to a standard-defining entity in the industry, supported by capital influx following its Hong Kong listing [2] Group 1: End-to-End Autonomous Driving - ResAD introduces a normalized residual trajectory modeling framework that simplifies the learning task and enhances model performance, achieving a PDMS score of 88.6 in NAVSIM benchmark tests [8] - CorDriver enhances safety in end-to-end autonomous driving by explicitly defining safe passage areas, resulting in a 66.7% reduction in collision rates with traffic participants [11] - TTOG unifies motion prediction and path planning tasks, demonstrating a 36.06% reduction in average L2 error on the nuScenes dataset [15] - MomAD addresses trajectory prediction consistency and stability issues by introducing momentum mechanisms, showing significant improvements in collision rates and trajectory smoothness [19] - GoalFlow generates high-quality multimodal trajectories by using precise target point guidance, achieving a PDMS score of 90.3 in NavSim benchmark tests [22] - RAD employs a large-scale 3DGS-based reinforcement learning framework to enhance safety, reducing collision rates by three times compared to pure imitation learning methods [26] - DiffusionDrive utilizes a truncated diffusion model for real-time end-to-end autonomous driving, achieving an 88.1 PDMS score and significantly improving planning quality [30] Group 2: Autonomous Driving Scene Generation & World Models - Epona is a self-regressive diffusion world model that achieves high-resolution, long-term future scene generation and trajectory planning, outperforming existing methods in the NuScenes dataset [33] - UMGen generates diverse, multimodal driving scenes, supporting user-controlled scenario generation and demonstrating superior authenticity and controllability compared to existing methods [38] - DrivingWorld constructs a world model for autonomous driving via a video GPT framework, generating high-fidelity videos with strong temporal consistency and structural integrity [41] Group 3: Autonomous Driving VLM & VLA - AlphaDrive integrates reinforcement learning and reasoning into visual language models for high-level planning in autonomous driving, improving planning accuracy by 25.52% compared to standard fine-tuning models [45] - The company has established a community of nearly 4,000 members and over 300 autonomous driving companies and research institutions, focusing on various autonomous driving technology stacks [49]

端到端自动驾驶

端到端自动驾驶

转行多家自动驾驶大厂的经验分享

自动驾驶之心· 2025-10-21 00:06

Core Insights - The article emphasizes the importance of seizing opportunities and continuous learning in the rapidly evolving field of autonomous driving, as illustrated by the experiences of a professional who transitioned from banking to the autonomous driving industry [1][2]. Group 1: Career Development in Autonomous Driving - The transition from a traditional banking career to the autonomous driving sector was facilitated by the growing demand for talent in the industry, particularly in 2020 [1]. - The individual initially started in algorithm evaluation, gradually moving to more advanced roles in perception and safety algorithms, highlighting the significance of building foundational skills and adapting to industry trends [1]. Group 2: Community and Learning Resources - The "Autonomous Driving Heart Knowledge Planet" community has over 4,000 members and aims to grow to nearly 10,000 in the next two years, providing a platform for knowledge sharing and technical discussions [4][5]. - The community offers a comprehensive learning environment, including video content, written materials, learning pathways, and job exchange opportunities, catering to both beginners and advanced learners [7][11]. Group 3: Technical Learning and Support - The community has organized resources covering over 40 technical pathways in autonomous driving, addressing various topics such as end-to-end learning, multi-modal models, and data annotation practices [19][21]. - Members can access practical guidance on entering the field, including specific learning routes for different aspects of autonomous driving technology [8][13]. Group 4: Industry Engagement and Networking - The community collaborates with industry leaders and academic experts to provide insights into the latest trends and challenges in autonomous driving, fostering a network for professional growth [9][18]. - Members are encouraged to engage with industry professionals for job referrals and to stay updated on academic advancements and industrial applications [21][23].

端到端自动驾驶

自动驾驶多模态大模型

自动驾驶之心知识星球

端到端自动驾驶

自动驾驶多模态大模型

自动驾驶之心知识星球

执行力是当下自动驾驶的第一生命力

自动驾驶之心· 2025-10-17 16:04

Core Viewpoint - The article discusses the evolving landscape of the autonomous driving industry in China, highlighting the shift in competitive dynamics and the increasing investment in autonomous driving technologies as a core focus of AI development [1][2]. Industry Trends - The autonomous driving sector has undergone significant changes over the past two years, with new players entering the market and existing companies focusing on improving execution capabilities [1]. - The industry experienced a flourishing period before 2022, where companies with standout technologies could thrive, but has since transitioned into a more competitive environment that emphasizes addressing weaknesses [1]. - Companies that remain active in the market are progressively enhancing their hardware, software, AI capabilities, and engineering implementation to survive and excel [1]. Future Outlook - By 2025, the industry is expected to enter a "calm period," where unresolved technical challenges in areas like L3, L4, and Robotaxi will continue to present opportunities for professionals in the field [2]. - The article emphasizes the importance of comprehensive skill sets for individuals in the autonomous driving sector, suggesting that those with a short-term profit mindset may not endure in the long run [2]. Community and Learning Resources - The "Autonomous Driving Heart Knowledge Planet" community has been established to provide a comprehensive platform for learning and sharing knowledge in the autonomous driving field, featuring over 4,000 members and aiming for a growth to nearly 10,000 in the next two years [4][17]. - The community offers a variety of resources, including video content, learning pathways, Q&A sessions, and job exchange opportunities, catering to both beginners and advanced learners [4][6][18]. - Members can access detailed technical routes and practical solutions for various autonomous driving challenges, significantly reducing the time needed for research and learning [6][18]. Technical Focus Areas - The community has compiled over 40 technical routes related to autonomous driving, covering areas such as end-to-end learning, multi-modal models, and various simulation platforms [18][39]. - There is a strong emphasis on practical applications, with resources available for data processing, 4D labeling, and engineering practices in autonomous driving [12][18]. Job Opportunities - The community facilitates job opportunities by connecting members with openings in leading autonomous driving companies, providing a platform for resume submissions and internal referrals [13][22].

端到端自动驾驶

自动驾驶世界模型

视觉语言模型（VLM）

自动驾驶VLA

端到端自动驾驶

自动驾驶世界模型

视觉语言模型（VLM）

自动驾驶VLA

扛内卷，一个足够有料的4000人自动驾驶社区

自动驾驶之心· 2025-10-15 23:33

Core Viewpoint - The autonomous driving industry is entering a period of consolidation and technological convergence, leading to increased competition and challenges for individuals in the field. The focus is shifting towards comprehensive talent with diverse skill sets, as the market becomes more competitive and the need for innovation grows [2][4][14]. Group 1: Industry Trends - The autonomous driving sector is experiencing a "cooling period" where many professionals are considering transitioning to other fields due to the intense competition and lack of internship opportunities [2]. - The ongoing debate between VLA (Vision-Language Alignment) and WA (Wide-Angle) approaches signifies a larger industry transformation, highlighting the need for adaptability among professionals [2][4]. - The community aims to create a knowledge-sharing platform to help individuals navigate the complexities of the autonomous driving landscape, fostering collaboration and innovation [4][14]. Group 2: Community and Resources - The "Autonomous Driving Heart Knowledge Planet" has been established as a comprehensive community for learning and sharing knowledge, currently hosting over 4,000 members with a goal of reaching 10,000 in two years [4][14]. - The community provides a variety of resources, including video tutorials, learning pathways, and Q&A sessions, to assist both beginners and advanced learners in the field [6][10]. - Members have access to a wealth of information, including over 40 technical routes and numerous industry insights, which can significantly reduce the time needed for research and learning [6][15]. Group 3: Learning and Development - The community offers structured learning paths for newcomers, covering essential topics such as multi-sensor fusion, end-to-end autonomous driving, and various algorithms [15][36]. - Regular discussions with industry experts are held to explore trends, challenges, and practical applications in autonomous driving, providing members with valuable insights [7][19]. - The platform also facilitates job opportunities by connecting members with potential employers and providing resume submission services [10][19].

端到端自动驾驶

自动驾驶多模态大模型

自动驾驶之心知识星球

端到端自动驾驶

自动驾驶多模态大模型

自动驾驶之心知识星球

复旦SeerDrive：一种轨迹规划和场景演化的双向建模端到端框架

自动驾驶之心· 2025-10-14 23:33

Core Insights - The article discusses the advancements in end-to-end autonomous driving, specifically focusing on the SeerDrive model, which aims to improve trajectory planning by incorporating bidirectional modeling of trajectory planning and scene evolution [1][3][4]. Group 1: SeerDrive Overview - SeerDrive introduces a bidirectional modeling paradigm that captures scene dynamics while allowing planning results to optimize scene predictions, creating a closed-loop iteration [3][4]. - The overall pipeline of SeerDrive consists of four main modules: feature encoding, future BEV world modeling, future perception planning, and iterative optimization [4]. Group 2: Challenges in Current Systems - Current one-shot paradigms in autonomous driving overlook dynamic scene evolution, leading to inaccurate planning in complex interactions [5]. - Existing systems fail to model the impact of vehicle behavior on the surrounding environment, which is crucial for accurate trajectory planning [5]. Group 3: Technical Components - Feature encoding transforms multimodal sensor inputs and vehicle states into structured features, laying the groundwork for subsequent modeling [8][9]. - Future BEV world modeling predicts scene dynamics by generating future BEV features, balancing efficiency and structured representation [10][13]. Group 4: Planning and Optimization - SeerDrive employs a decoupled strategy for planning, allowing current and future scenes to guide planning separately, thus avoiding representation entanglement [15]. - The iterative optimization process enhances the bidirectional dependency between trajectory planning and scene evolution, leading to improved performance [17]. Group 5: Experimental Results - SeerDrive achieved a PDMS score of 88.9 on the NAVSIM test set, outperforming several state-of-the-art methods [23]. - In the nuScenes validation set, SeerDrive demonstrated an average L2 displacement error of 0.43m and a collision rate of 0.06%, significantly better than competing methods [24]. Group 6: Component Effectiveness - The removal of future perception planning or iterative optimization resulted in a decrease in PDMS scores, indicating the importance of these components for performance enhancement [26]. - The design choices, such as the decoupled strategy and the use of anchored endpoints for future ego feature initialization, proved to be critical for achieving optimal results [30]. Group 7: Limitations and Future Directions - The BEV world model does not leverage the generalization capabilities of foundational models, which could enhance performance in complex scenarios [41]. - Future research may explore the integration of foundational models with planning to improve generalization while maintaining efficiency [41].

端到端自动驾驶

端到端自动驾驶

学术和量产的分歧，技术路线的持续较量！从技术掌舵人的角度一览智驾的十年路....

自动驾驶之心· 2025-10-14 23:33

Core Insights - The article discusses the significant technological advancements in autonomous driving over the past decade, highlighting key innovations such as Visual Transformers, BEV perception, multi-sensor fusion, end-to-end autonomous driving, large models, VLA, and world models [3][4]. Group 1: Technological Milestones - The past ten years have seen remarkable technological developments in autonomous driving, with various solutions emerging through the collision and fusion of different technologies [3]. - A roundtable discussion is set to reflect on the technological milestones in the industry, focusing on the debate between world models and VLA [4][13]. Group 2: Industry Perspectives - The roundtable will feature insights from top industry leaders, discussing the evolution of autonomous driving technology and providing career advice for newcomers in the field [4][5]. - The discussion will also cover the perspectives of academia and industry regarding L3 autonomous driving, emphasizing the convergence of research directions and the practical implementation in engineering [13]. Group 3: Future Directions - The article raises questions about the future direction of autonomous driving technology, particularly the role of end-to-end systems as a foundational element of intelligent driving technology [13]. - It highlights the ongoing competition between academic research and engineering practices in the field, suggesting a need for new entrants to adapt and innovate [13].

视觉Transformer

多传感器融合

端到端自动驾驶

视觉Transformer

多传感器融合

端到端自动驾驶

地平线残差端到端是如何实现的？ResAD：残差学习让自动驾驶决策更接近人类逻辑

自动驾驶之心· 2025-10-13 23:33

点击下方卡片，关注" 自动驾驶之心 "公众号戳我-> 领取自动驾驶近30个方向学习路线 >>自动驾驶前沿信息获取 → 自动驾驶之心知识星球论文作者 | Zhiyu Zheng等编辑 | 自动驾驶之心想让车子自己开，传统方法得像搭积木：先"看"（感知），再"猜"（预测），最后"做决定"（规划）。这套流程环环相扣，一个环节出错，后面全跟着错，既不高效，也不安全。于是，端到端自动驾驶成了一条新路。它想让AI像老司机一样，直接把看到的（传感器数据）变成要走的路线（未来轨迹）。想法很美好，但现实很骨感：现有的端到端模型，大多在死磕一个问题—— "未来的轨迹长啥样？" 为了解决这些问题，地平线、华科和武大的团队提出了 ResAD 框架。核心思想很简单：不直接预测整条轨迹，而是先给一个"惯性参考线"——就是车子如果不动方向盘会走的路线。然后，让模型只学习一个"调整量"（残差），即为了安全行驶，需要偏离这根参考线多少。这样一来，学习目标就从 "轨迹是什么？" 变成了 "为什么要调整方向？" 。模型被迫去关注那些导致调整的真实原因，比如障碍物、交通规则等，而不是死记硬背数据里的巧合。我们 ...

端到端自动驾驶

归一化残差轨迹建模

端到端自动驾驶

归一化残差轨迹建模

端到端和VLA占据自动驾驶前沿方向的主流了。。。

自动驾驶之心· 2025-10-13 04:00

Core Insights - The article discusses the evolution of end-to-end algorithms in autonomous driving, highlighting the transition from modular production algorithms to end-to-end approaches and the recent focus on Vision-Language Models (VLA) [1][3]. Group 1: End-to-End Algorithms - End-to-end algorithms are central to the current mass production of autonomous driving technology, involving a rich technology stack [1]. - There are two main paradigms in the industry: single-stage and two-stage approaches, with UniAD being a representative of the single-stage paradigm [1]. - The single-stage approach can be further categorized into several subfields, including perception-based, diffusion model-based, world model-based, and VLA-based end-to-end algorithms [1]. Group 2: VLA and Course Offerings - The article mentions the recent surge in interest regarding how to efficiently learn about end-to-end and VLA technologies, leading to the creation of specialized courses [3]. - The "End-to-End and VLA Autonomous Driving Course" focuses on VLA, covering topics from VLM as an autonomous driving interpreter to modular and integrated VLA approaches [3]. - The course includes a detailed theoretical foundation and practical assignments to help participants build their own VLA models and datasets [3]. Group 3: Course Instructors - The course features a team of instructors with significant academic and practical experience in multi-modal perception, autonomous driving VLA, and large model frameworks [7][9]. - Instructors have published numerous papers in top international conferences and have hands-on experience in developing and implementing cutting-edge algorithms in the field [7][9][10]. Group 4: Target Audience and Requirements - The courses are designed for individuals with a foundational understanding of autonomous driving and familiarity with key technologies such as transformer models, reinforcement learning, and BEV perception [13]. - Participants are expected to have a basic knowledge of probability theory, linear algebra, and proficiency in Python and PyTorch [13].

端到端自动驾驶

VLA自动驾驶

端到端与VLA自动驾驶小班课

自动驾驶VLA和大模型实战课程

端到端自动驾驶

VLA自动驾驶

端到端与VLA自动驾驶小班课

自动驾驶VLA和大模型实战课程

Waymo提出Drive&Gen：用生成视频评估端到端自动驾驶（IROS'25）

自动驾驶之心· 2025-10-12 23:33

作者 | Jiahao Wang 来源 | 我爱计算机视觉传统的自动驾驶系统像一个部门林立的大公司，感知、预测、规划等模块各司其职，虽然稳定，但流程繁琐，一个环节出错就可能影响全局。而E2E模型就像一个全能的创业团队，直接从摄像头画面等原始输入，一步到位输出驾驶决策，简洁高效，潜力巨大。但问题也随之而来：AI生成的视频真的足够"真实"，能骗过自动驾驶系统，并用来做严肃的评估吗？我们又该如何深入了解E2E驾驶模型的"脾气"，修复它的短板，让它在没见过的新场景（比如突然的暴雨天）里也能从容应对？为了回答这些问题，来自约翰霍普金斯大学、Waymo和谷歌DeepMind的研究者们联手，在即将于IROS 2025会议上发表的论文中，提出了一个名为 Drive&Gen 的新框架。这个名字很直白，就是将驾驶（Drive）和生成（Gen）结合起来，旨在连接E2E驾驶模型和生成式世界模型，共同评估和提升彼此。背景：当E2E驾驶遇上生成式AI 点击下方卡片，关注" 自动驾驶之心 "公众号戳我-> 领取自动驾驶近30个方向学习路线 >>自动驾驶前沿信息获取 → 自动驾驶之心知识星球本文只做学术 ...

端到端自动驾驶

端到端自动驾驶