自动驾驶世界模型
Search documents
答应大家的《自动驾驶世界模型》课程终于开课了!
自动驾驶之心· 2026-01-06 06:52
正式开课!添加助理领取优惠~ 讲师介绍 Jason:C9本科+QS50 PhD,已发表CCF-A论文2篇,CCF-B论文若干。现任国内TOP主机厂算法专家,目前从事端到端、大模型、世界模型等前沿算法的预研和量 产,并已主持和完成多项自动驾驶感知和端到端算法的产品量产交付,拥有丰富的端到端算法研发和实战经验。 课程大纲 点击下方 卡片 ,关注" 自动驾驶之心 "公众号 戳我-> 领取 自动驾驶近30个 方向 学习 路线 Jason老师新课《世界模型与自动驾驶小班课》正式开课啦! 自动驾驶之心联合 工业界大佬 共同开展,先前的《端到端与VLA自动驾驶小班课》备受大家好评,因 此我们进一步推出这门世界模型小班课, 课程聚焦于通用世界模型、视频生成、OCC生成等世界模型算法,涵盖特斯拉世界模型、李飞飞团队Marble等。欢迎大 家加入学习~ 这门课程讲如何展开 第一章:世界模型介绍 第一章主要针对自动驾驶世界模型概括性的内容讲解。 这一章老师会先复盘世界模型和端到端自动驾驶的联系,接着讲解世界模型的发展历史以及当下的应用案 例。然后介绍世界模型有哪些流派:纯仿真的世界模型、仿真+Planning、生成传感器输入、生成 ...
双SOTA!GenieDrive:物理一致的自动驾驶世界模型(港大&华为诺亚)
自动驾驶之心· 2025-12-24 00:58
点击下方 卡片 ,关注" 自动驾驶之心 "公众号 戳我-> 领取 自动驾驶近30个 方向 学习 路线 >>自动驾驶前沿信息获取 → 自动驾驶之心知识星球 论文作者 | Zhenya Yang等 编辑 | 自动驾驶之心 来自香港大学、华为以及华中科技大学的最新工作 GenieDrive,提出了一种以 4D Occupancy 作为中间表示的自动驾驶世界模型框架,在 4D 占据预测、轨迹可控性 以及长时序视频生成能力等方面均显著优于现有自动驾驶世界模型,为该领域提供了一条"先生成 4D 占据、再生成视频"的全新研究路径。 项目简介 GenieDrive 是一种新型的以 4D Occupancy 作为中间表示的自动驾驶世界模型框架,能够实现高度可控、多视角一致且符合物理规律的自动驾驶视频生成。得益于上述 设计,GenieDrive 在仅使用 3.47M 参数的情况下实现了 41 FPS 的推理速度,并在 4D 占据预测任务上取得了 7.2% 的 mIoU 提升;同时,通过在视频生成阶段引入归 一化多视角注意力机制,在 4D Occupancy 的引导下显著提升了多视角视频生成质量,将 FVD 指标降低了 20.7% ...
理想一篇OCC世界模型:全新轨迹条件稀疏占用世界模型SparseWorld-TC
自动驾驶之心· 2025-12-16 03:16
点击下方 卡片 ,关注" 自动驾驶之心 "公众号 戳我-> 领取 自动驾驶近30个 方向 学习 路线 >>自动驾驶前沿信息获取 → 自动驾驶之心知识星球 论文作者 | Jiayuan Du等 编辑 | 自动驾驶之心 端到端自动驾驶预测技术正迎来革命性突破!传统方法依赖鸟瞰图(BEV)表示或离散令牌化,这些技术虽然在一定程度上推动了行业发 展,但本质上限制了模型对复杂时空依赖的捕捉能力。特别是在长时预测和动态场景中,如何平衡精度与效率成为制约技术落地的核心挑 战。如果仅靠VAE编码或BEV先验,往往难以兼顾细粒度细节和实时性需求。近年来,基于注意力的Transformer架构在语言和视觉领域的 成功,为自动驾驶世界模型提供了全新思路——能否直接利用稀疏表示绕过中间表示,实现更灵活的时空建模?这是一个值得深入探索的 方向。 现有4D占用预测方法大多面临两大本质局限:一是过度依赖VAE等离散令牌化技术,导致表征容量受限,细节信息丢失;二是BEV表示引 入的显式几何先验,虽然简化了问题形式化,但严重限制了特征交互的灵活性。例如,OccWorld和RenderWorld采用VQ-VAE编码场景,虽 然简化了生成过程,却 ...
工业界大佬带队!彻底搞懂自动驾驶世界模型...
自动驾驶之心· 2025-12-11 03:35
Core Viewpoint - The article introduces a new course titled "World Models and Autonomous Driving Small Class," focusing on advanced algorithms in the field of autonomous driving, including general world models, video generation, and OCC generation [1][3]. Course Overview - The course is developed in collaboration with industry leaders and follows the success of a previous course on end-to-end and VLA autonomous driving [1]. - The course aims to enhance understanding and practical skills in world models, targeting individuals interested in the autonomous driving industry [11]. Course Structure - **Chapter 1: Introduction to World Models** - Discusses the relationship between world models and end-to-end autonomous driving, including historical development and current applications [6]. - Covers various types of world models, such as pure simulation, simulation + planning, and generation of sensor inputs and perception results [6]. - **Chapter 2: Background Knowledge of World Models** - Focuses on foundational knowledge, including scene representation, Transformer, and BEV perception [6][12]. - Highlights key technical terms frequently encountered in job interviews related to world models [7]. - **Chapter 3: General World Model Exploration** - Examines popular models like Marble from Li Fei-Fei's team, DeepMind's Genie 3, and Meta's JEPA, along with recent discussions on VLA + world model algorithms [7]. - **Chapter 4: Video Generation-Based World Models** - Concentrates on video generation algorithms, starting with Wayve's GAIA-1 & GAIA-2 and extending to recent works like UniScene and OpenDWM [8]. - **Chapter 5: OCC-Based World Models** - Focuses on OCC generation methods, discussing three major papers and a practical project that extends to vehicle trajectory planning [9]. - **Chapter 6: World Model Job Specialization** - Provides insights into the application of world models in the industry, addressing pain points and interview preparation for relevant positions [10]. Learning Outcomes - The course aims to equip participants with the skills to reach a level equivalent to one year of experience as a world model autonomous driving algorithm engineer [14]. - Participants will gain a comprehensive understanding of world model technologies, including video generation and OCC generation methods, and will be able to apply their knowledge in practical projects [14].
自驾世界模型剩下的论文窗口期没多久了......
自动驾驶之心· 2025-12-11 00:05
Core Insights - The article highlights the recent surge in research papers related to world models in autonomous driving, indicating a trend towards localized breakthroughs and verifiable improvements in the field [1] - It emphasizes the importance of refining submissions to top conferences, suggesting that the final 10% of polishing can significantly impact the overall quality and acceptance of the paper [2] - The platform "Autonomous Driving Heart" is presented as a leading AI technology media outlet in China, with a strong focus on autonomous driving and related interdisciplinary fields [3] Summary by Sections Research Trends - Numerous recent works in autonomous driving, such as MindDrive and SparseWorld-TC, reflect a focus on world models, which are expected to dominate upcoming conferences [1] - The article suggests that the main themes for the end of this year and the first half of next year will likely revolve around world models, indicating a strategic direction for researchers [1] Guidance and Support - The platform offers personalized guidance for students, helping them navigate the complexities of research and paper submission processes [7][13] - It claims a high success rate, with a 96% acceptance rate for students who have received guidance over the past three years [5] Faculty and Resources - The platform boasts over 300 dedicated instructors from top global universities, ensuring high-quality mentorship for students [5] - The instructors have extensive experience in publishing at top-tier conferences and journals, providing students with valuable insights and support [5] Services Offered - The article outlines various services, including personalized paper guidance, real-time interaction with mentors, and comprehensive support throughout the research process [13] - It also mentions the potential for students to receive recommendations from prestigious institutions and direct job placements in leading tech companies [19]
和港校自驾博士交流后的一些分享......
自动驾驶之心· 2025-11-20 00:05
Core Viewpoint - The article emphasizes the importance of building a comprehensive community for autonomous driving, providing resources, networking opportunities, and guidance for both newcomers and experienced professionals in the field [6][16][19]. Group 1: Community and Networking - The "Autonomous Driving Heart Knowledge Planet" community aims to create a platform for technical exchange and collaboration among members from renowned universities and leading companies in the autonomous driving sector [16][19]. - The community has grown to over 4,000 members and aims to reach nearly 10,000 within two years, facilitating discussions on technology trends and industry developments [6][7]. - Members can freely ask questions regarding career choices and research directions, receiving insights from industry experts [89][92]. Group 2: Learning Resources - The community offers a variety of learning materials, including video tutorials, technical routes, and Q&A sessions, covering over 40 technical directions in autonomous driving [9][11][16]. - Specific learning paths are provided for newcomers, including foundational courses and advanced topics in areas such as end-to-end driving, multi-sensor fusion, and 3D target detection [11][17][36]. - The community has compiled a comprehensive list of open-source projects and datasets relevant to autonomous driving, aiding members in their research and development efforts [32][34][36]. Group 3: Career Development - The community facilitates job referrals and connections with various autonomous driving companies, enhancing members' employment opportunities [11][19]. - Regular discussions with industry leaders are organized to explore career paths, job openings, and the latest trends in the autonomous driving field [8][19][92]. - Members are encouraged to engage in research collaborations and internships, particularly for those pursuing advanced degrees in related fields [3][6][16].
跨行转入自动驾驶大厂的经验分享
自动驾驶之心· 2025-11-04 00:03
Core Insights - The article emphasizes the importance of seizing opportunities and continuous learning in the rapidly evolving field of autonomous driving [1][4] - It highlights the creation of a comprehensive community platform, "Autonomous Driving Heart Knowledge Planet," aimed at facilitating knowledge sharing and career development in the autonomous driving sector [4][16] Group 1: Career Development - Transitioning to the autonomous driving industry can be successful through dedication and preparation, as illustrated by the experience of a professional who switched careers and excelled in various roles [1] - Continuous learning and adapting to industry trends are crucial for career advancement, as demonstrated by the professional's progression from algorithm evaluation to advanced safety algorithms [1] Group 2: Community and Resources - The "Autonomous Driving Heart Knowledge Planet" has over 4,000 members and aims to grow to nearly 10,000 in two years, providing a platform for discussion, technical sharing, and job opportunities [4][16] - The community offers a variety of resources, including video content, learning pathways, and Q&A sessions, to support both beginners and advanced learners in the autonomous driving field [7][10] Group 3: Technical Learning and Networking - The community organizes discussions with industry experts on various topics, including entry points for end-to-end autonomous driving and the integration of multi-sensor fusion [8][20] - Members have access to a wealth of technical routes and resources, including over 40 technical pathways and numerous datasets relevant to autonomous driving [10][36] Group 4: Job Opportunities - The community facilitates job referrals and connections with leading companies in the autonomous driving sector, enhancing members' chances of securing positions in the industry [11][12] - Regular updates on job openings and industry trends are provided, helping members stay informed about potential career advancements [21][93]
Dream4Drive:一个能够提升下游感知性能的世界模型生成框架
自动驾驶之心· 2025-10-29 00:04
Core Insights - The article discusses the development of Dream4Drive, a new synthetic data generation framework aimed at enhancing downstream perception tasks in autonomous driving, emphasizing the importance of high-quality, controllable multimodal video generation [1][2][5]. Group 1: Background and Motivation - 3D perception tasks like object detection and tracking are critical for decision-making in autonomous driving, but their performance heavily relies on large-scale, manually annotated datasets [4]. - Existing methods for synthetic data generation often overlook the evaluation of downstream perception tasks, leading to a misrepresentation of the effectiveness of synthetic data [5][6]. - The need for diverse and extreme scenario data is highlighted, as current data collection methods are time-consuming and labor-intensive [4]. Group 2: Dream4Drive Framework - Dream4Drive decomposes input videos into multiple 3D-aware guidance maps, rendering 3D assets onto these maps to generate edited, multi-view realistic videos for training perception models [1][9]. - The framework utilizes a large-scale 3D asset dataset, DriveObj3D, which includes typical categories from driving scenarios, supporting diverse 3D perception video editing [2][9]. - Experiments show that Dream4Drive can significantly enhance perception model performance with only 420 synthetic samples, which is less than 2% of the real sample size [6][27]. Group 3: Experimental Results - The article presents comparative results demonstrating that Dream4Drive outperforms existing models in various training epochs, achieving higher mean Average Precision (mAP) and nuScenes Detection Score (NDS) [27][28]. - High-resolution synthetic data (512×768) leads to significant performance improvements, with mAP increasing by 4.6 percentage points (12.7%) and NDS by 4.1 percentage points (8.6%) [29][30]. - The findings indicate that the position of inserted assets affects performance, with distant insertions generally yielding better results due to reduced occlusion issues [37][38]. Group 4: Conclusions and Implications - The study concludes that existing evaluations of synthetic data in autonomous driving are biased, and Dream4Drive provides a more effective approach for generating high-quality synthetic data for perception tasks [40][42]. - The results emphasize the importance of using assets that match the style of the dataset to minimize the domain gap between synthetic and real data, enhancing model training [42].
做了几期线上交流,我发现大家还是太迷茫
自动驾驶之心· 2025-10-24 00:04
Core Viewpoint - The article emphasizes the establishment of a comprehensive community called "Autonomous Driving Heart Knowledge Planet," aimed at providing a platform for knowledge sharing and networking in the autonomous driving industry, addressing the challenges faced by newcomers in the field [1][3][14]. Group 1: Community Development - The community has grown to over 4,000 members and aims to reach nearly 10,000 within two years, providing a space for technical sharing and communication among beginners and advanced learners [3][14]. - The community integrates various resources including videos, articles, learning paths, Q&A, and job exchange, making it a comprehensive hub for autonomous driving enthusiasts [3][5]. Group 2: Learning Resources - The community has organized over 40 technical learning paths, covering topics such as end-to-end autonomous driving, multi-modal large models, and data annotation practices, significantly reducing the time needed for research [5][14]. - Members can access a variety of video tutorials and courses tailored for beginners, covering essential topics in autonomous driving technology [9][15]. Group 3: Industry Insights - The community regularly invites industry experts to discuss trends, technological advancements, and production challenges in autonomous driving, fostering a serious content-driven environment [6][14]. - Members are encouraged to engage with industry leaders for insights on job opportunities and career development within the autonomous driving sector [10][18]. Group 4: Networking Opportunities - The community facilitates connections between members and various autonomous driving companies, offering resume forwarding services to help members secure job placements [10][12]. - Members can freely ask questions regarding career choices and research directions, receiving guidance from experienced professionals in the field [87][89].
执行力是当下自动驾驶的第一生命力
自动驾驶之心· 2025-10-17 16:04
Core Viewpoint - The article discusses the evolving landscape of the autonomous driving industry in China, highlighting the shift in competitive dynamics and the increasing investment in autonomous driving technologies as a core focus of AI development [1][2]. Industry Trends - The autonomous driving sector has undergone significant changes over the past two years, with new players entering the market and existing companies focusing on improving execution capabilities [1]. - The industry experienced a flourishing period before 2022, where companies with standout technologies could thrive, but has since transitioned into a more competitive environment that emphasizes addressing weaknesses [1]. - Companies that remain active in the market are progressively enhancing their hardware, software, AI capabilities, and engineering implementation to survive and excel [1]. Future Outlook - By 2025, the industry is expected to enter a "calm period," where unresolved technical challenges in areas like L3, L4, and Robotaxi will continue to present opportunities for professionals in the field [2]. - The article emphasizes the importance of comprehensive skill sets for individuals in the autonomous driving sector, suggesting that those with a short-term profit mindset may not endure in the long run [2]. Community and Learning Resources - The "Autonomous Driving Heart Knowledge Planet" community has been established to provide a comprehensive platform for learning and sharing knowledge in the autonomous driving field, featuring over 4,000 members and aiming for a growth to nearly 10,000 in the next two years [4][17]. - The community offers a variety of resources, including video content, learning pathways, Q&A sessions, and job exchange opportunities, catering to both beginners and advanced learners [4][6][18]. - Members can access detailed technical routes and practical solutions for various autonomous driving challenges, significantly reducing the time needed for research and learning [6][18]. Technical Focus Areas - The community has compiled over 40 technical routes related to autonomous driving, covering areas such as end-to-end learning, multi-modal models, and various simulation platforms [18][39]. - There is a strong emphasis on practical applications, with resources available for data processing, 4D labeling, and engineering practices in autonomous driving [12][18]. Job Opportunities - The community facilitates job opportunities by connecting members with openings in leading autonomous driving companies, providing a platform for resume submissions and internal referrals [13][22].