自动驾驶世界模型 - filings, earnings calls, financial reports, news - Reportify

自动驾驶世界模型

Search documents

答应大家的《自动驾驶世界模型》课程终于开课了！

自动驾驶之心· 2026-01-06 06:52

正式开课！添加助理领取优惠~ 讲师介绍 Jason：C9本科+QS50 PhD，已发表CCF-A论文2篇，CCF-B论文若干。现任国内TOP主机厂算法专家，目前从事端到端、大模型、世界模型等前沿算法的预研和量产，并已主持和完成多项自动驾驶感知和端到端算法的产品量产交付，拥有丰富的端到端算法研发和实战经验。课程大纲点击下方卡片，关注" 自动驾驶之心 "公众号戳我-> 领取自动驾驶近30个方向学习路线 Jason老师新课《世界模型与自动驾驶小班课》正式开课啦！自动驾驶之心联合工业界大佬共同开展，先前的《端到端与VLA自动驾驶小班课》备受大家好评，因此我们进一步推出这门世界模型小班课，课程聚焦于通用世界模型、视频生成、OCC生成等世界模型算法，涵盖特斯拉世界模型、李飞飞团队Marble等。欢迎大家加入学习~ 这门课程讲如何展开第一章：世界模型介绍第一章主要针对自动驾驶世界模型概括性的内容讲解。这一章老师会先复盘世界模型和端到端自动驾驶的联系，接着讲解世界模型的发展历史以及当下的应用案例。然后介绍世界模型有哪些流派：纯仿真的世界模型、仿真+Planning、生成传感器输入、生成 ...

自动驾驶世界模型

OCC生成类世界模型

通用世界模型

视频生成类世界模型

《世界模型与自动驾驶小班课》

自动驾驶世界模型

OCC生成类世界模型

通用世界模型

视频生成类世界模型

《世界模型与自动驾驶小班课》

双SOTA！GenieDrive：物理一致的自动驾驶世界模型（港大&华为诺亚）

自动驾驶之心· 2025-12-24 00:58

点击下方卡片，关注" 自动驾驶之心 "公众号戳我-> 领取自动驾驶近30个方向学习路线 >>自动驾驶前沿信息获取 → 自动驾驶之心知识星球论文作者 | Zhenya Yang等编辑 | 自动驾驶之心来自香港大学、华为以及华中科技大学的最新工作 GenieDrive，提出了一种以 4D Occupancy 作为中间表示的自动驾驶世界模型框架，在 4D 占据预测、轨迹可控性以及长时序视频生成能力等方面均显著优于现有自动驾驶世界模型，为该领域提供了一条"先生成 4D 占据、再生成视频"的全新研究路径。项目简介 GenieDrive 是一种新型的以 4D Occupancy 作为中间表示的自动驾驶世界模型框架，能够实现高度可控、多视角一致且符合物理规律的自动驾驶视频生成。得益于上述设计，GenieDrive 在仅使用 3.47M 参数的情况下实现了 41 FPS 的推理速度，并在 4D 占据预测任务上取得了 7.2% 的 mIoU 提升；同时，通过在视频生成阶段引入归一化多视角注意力机制，在 4D Occupancy 的引导下显著提升了多视角视频生成质量，将 FVD 指标降低了 20.7% ...

自动驾驶世界模型

自动驾驶世界模型

理想一篇OCC世界模型：全新轨迹条件稀疏占用世界模型SparseWorld-TC

自动驾驶之心· 2025-12-16 03:16

点击下方卡片，关注" 自动驾驶之心 "公众号戳我-> 领取自动驾驶近30个方向学习路线 >>自动驾驶前沿信息获取 → 自动驾驶之心知识星球论文作者 | Jiayuan Du等编辑 | 自动驾驶之心端到端自动驾驶预测技术正迎来革命性突破！传统方法依赖鸟瞰图（BEV）表示或离散令牌化，这些技术虽然在一定程度上推动了行业发展，但本质上限制了模型对复杂时空依赖的捕捉能力。特别是在长时预测和动态场景中，如何平衡精度与效率成为制约技术落地的核心挑战。如果仅靠VAE编码或BEV先验，往往难以兼顾细粒度细节和实时性需求。近年来，基于注意力的Transformer架构在语言和视觉领域的成功，为自动驾驶世界模型提供了全新思路——能否直接利用稀疏表示绕过中间表示，实现更灵活的时空建模？这是一个值得深入探索的方向。现有4D占用预测方法大多面临两大本质局限：一是过度依赖VAE等离散令牌化技术，导致表征容量受限，细节信息丢失；二是BEV表示引入的显式几何先验，虽然简化了问题形式化，但严重限制了特征交互的灵活性。例如，OccWorld和RenderWorld采用VQ-VAE编码场景，虽然简化了生成过程，却 ...

自动驾驶世界模型

自动驾驶世界模型

工业界大佬带队！彻底搞懂自动驾驶世界模型...

自动驾驶之心· 2025-12-11 03:35

Core Viewpoint - The article introduces a new course titled "World Models and Autonomous Driving Small Class," focusing on advanced algorithms in the field of autonomous driving, including general world models, video generation, and OCC generation [1][3]. Course Overview - The course is developed in collaboration with industry leaders and follows the success of a previous course on end-to-end and VLA autonomous driving [1]. - The course aims to enhance understanding and practical skills in world models, targeting individuals interested in the autonomous driving industry [11]. Course Structure - **Chapter 1: Introduction to World Models** - Discusses the relationship between world models and end-to-end autonomous driving, including historical development and current applications [6]. - Covers various types of world models, such as pure simulation, simulation + planning, and generation of sensor inputs and perception results [6]. - **Chapter 2: Background Knowledge of World Models** - Focuses on foundational knowledge, including scene representation, Transformer, and BEV perception [6][12]. - Highlights key technical terms frequently encountered in job interviews related to world models [7]. - **Chapter 3: General World Model Exploration** - Examines popular models like Marble from Li Fei-Fei's team, DeepMind's Genie 3, and Meta's JEPA, along with recent discussions on VLA + world model algorithms [7]. - **Chapter 4: Video Generation-Based World Models** - Concentrates on video generation algorithms, starting with Wayve's GAIA-1 & GAIA-2 and extending to recent works like UniScene and OpenDWM [8]. - **Chapter 5: OCC-Based World Models** - Focuses on OCC generation methods, discussing three major papers and a practical project that extends to vehicle trajectory planning [9]. - **Chapter 6: World Model Job Specialization** - Provides insights into the application of world models in the industry, addressing pain points and interview preparation for relevant positions [10]. Learning Outcomes - The course aims to equip participants with the skills to reach a level equivalent to one year of experience as a world model autonomous driving algorithm engineer [14]. - Participants will gain a comprehensive understanding of world model technologies, including video generation and OCC generation methods, and will be able to apply their knowledge in practical projects [14].

自动驾驶世界模型

通用世界模型

视频生成世界模型

OCC生成世界模型

《世界模型与自动驾驶小班课》

自动驾驶世界模型

通用世界模型

视频生成世界模型

OCC生成世界模型

《世界模型与自动驾驶小班课》

自驾世界模型剩下的论文窗口期没多久了......

自动驾驶之心· 2025-12-11 00:05

Core Insights - The article highlights the recent surge in research papers related to world models in autonomous driving, indicating a trend towards localized breakthroughs and verifiable improvements in the field [1] - It emphasizes the importance of refining submissions to top conferences, suggesting that the final 10% of polishing can significantly impact the overall quality and acceptance of the paper [2] - The platform "Autonomous Driving Heart" is presented as a leading AI technology media outlet in China, with a strong focus on autonomous driving and related interdisciplinary fields [3] Summary by Sections Research Trends - Numerous recent works in autonomous driving, such as MindDrive and SparseWorld-TC, reflect a focus on world models, which are expected to dominate upcoming conferences [1] - The article suggests that the main themes for the end of this year and the first half of next year will likely revolve around world models, indicating a strategic direction for researchers [1] Guidance and Support - The platform offers personalized guidance for students, helping them navigate the complexities of research and paper submission processes [7][13] - It claims a high success rate, with a 96% acceptance rate for students who have received guidance over the past three years [5] Faculty and Resources - The platform boasts over 300 dedicated instructors from top global universities, ensuring high-quality mentorship for students [5] - The instructors have extensive experience in publishing at top-tier conferences and journals, providing students with valuable insights and support [5] Services Offered - The article outlines various services, including personalized paper guidance, real-time interaction with mentors, and comprehensive support throughout the research process [13] - It also mentions the potential for students to receive recommendations from prestigious institutions and direct job placements in leading tech companies [19]

自动驾驶世界模型

自动驾驶之心论文辅导

自动驾驶世界模型

自动驾驶之心论文辅导

和港校自驾博士交流后的一些分享......

自动驾驶之心· 2025-11-20 00:05

Core Viewpoint - The article emphasizes the importance of building a comprehensive community for autonomous driving, providing resources, networking opportunities, and guidance for both newcomers and experienced professionals in the field [6][16][19]. Group 1: Community and Networking - The "Autonomous Driving Heart Knowledge Planet" community aims to create a platform for technical exchange and collaboration among members from renowned universities and leading companies in the autonomous driving sector [16][19]. - The community has grown to over 4,000 members and aims to reach nearly 10,000 within two years, facilitating discussions on technology trends and industry developments [6][7]. - Members can freely ask questions regarding career choices and research directions, receiving insights from industry experts [89][92]. Group 2: Learning Resources - The community offers a variety of learning materials, including video tutorials, technical routes, and Q&A sessions, covering over 40 technical directions in autonomous driving [9][11][16]. - Specific learning paths are provided for newcomers, including foundational courses and advanced topics in areas such as end-to-end driving, multi-sensor fusion, and 3D target detection [11][17][36]. - The community has compiled a comprehensive list of open-source projects and datasets relevant to autonomous driving, aiding members in their research and development efforts [32][34][36]. Group 3: Career Development - The community facilitates job referrals and connections with various autonomous driving companies, enhancing members' employment opportunities [11][19]. - Regular discussions with industry leaders are organized to explore career paths, job openings, and the latest trends in the autonomous driving field [8][19][92]. - Members are encouraged to engage in research collaborations and internships, particularly for those pursuing advanced degrees in related fields [3][6][16].

端到端自动驾驶

自动驾驶世界模型

自动驾驶之心知识星球

端到端自动驾驶

自动驾驶世界模型

自动驾驶之心知识星球

跨行转入自动驾驶大厂的经验分享

自动驾驶之心· 2025-11-04 00:03

Core Insights - The article emphasizes the importance of seizing opportunities and continuous learning in the rapidly evolving field of autonomous driving [1][4] - It highlights the creation of a comprehensive community platform, "Autonomous Driving Heart Knowledge Planet," aimed at facilitating knowledge sharing and career development in the autonomous driving sector [4][16] Group 1: Career Development - Transitioning to the autonomous driving industry can be successful through dedication and preparation, as illustrated by the experience of a professional who switched careers and excelled in various roles [1] - Continuous learning and adapting to industry trends are crucial for career advancement, as demonstrated by the professional's progression from algorithm evaluation to advanced safety algorithms [1] Group 2: Community and Resources - The "Autonomous Driving Heart Knowledge Planet" has over 4,000 members and aims to grow to nearly 10,000 in two years, providing a platform for discussion, technical sharing, and job opportunities [4][16] - The community offers a variety of resources, including video content, learning pathways, and Q&A sessions, to support both beginners and advanced learners in the autonomous driving field [7][10] Group 3: Technical Learning and Networking - The community organizes discussions with industry experts on various topics, including entry points for end-to-end autonomous driving and the integration of multi-sensor fusion [8][20] - Members have access to a wealth of technical routes and resources, including over 40 technical pathways and numerous datasets relevant to autonomous driving [10][36] Group 4: Job Opportunities - The community facilitates job referrals and connections with leading companies in the autonomous driving sector, enhancing members' chances of securing positions in the industry [11][12] - Regular updates on job openings and industry trends are provided, helping members stay informed about potential career advancements [21][93]

自动驾驶世界模型

视觉语言模型（VLM）

自动驾驶VLA

端到端自动驾驶

自动驾驶世界模型

视觉语言模型（VLM）

自动驾驶VLA

端到端自动驾驶

Dream4Drive：一个能够提升下游感知性能的世界模型生成框架

自动驾驶之心· 2025-10-29 00:04

Core Insights - The article discusses the development of Dream4Drive, a new synthetic data generation framework aimed at enhancing downstream perception tasks in autonomous driving, emphasizing the importance of high-quality, controllable multimodal video generation [1][2][5]. Group 1: Background and Motivation - 3D perception tasks like object detection and tracking are critical for decision-making in autonomous driving, but their performance heavily relies on large-scale, manually annotated datasets [4]. - Existing methods for synthetic data generation often overlook the evaluation of downstream perception tasks, leading to a misrepresentation of the effectiveness of synthetic data [5][6]. - The need for diverse and extreme scenario data is highlighted, as current data collection methods are time-consuming and labor-intensive [4]. Group 2: Dream4Drive Framework - Dream4Drive decomposes input videos into multiple 3D-aware guidance maps, rendering 3D assets onto these maps to generate edited, multi-view realistic videos for training perception models [1][9]. - The framework utilizes a large-scale 3D asset dataset, DriveObj3D, which includes typical categories from driving scenarios, supporting diverse 3D perception video editing [2][9]. - Experiments show that Dream4Drive can significantly enhance perception model performance with only 420 synthetic samples, which is less than 2% of the real sample size [6][27]. Group 3: Experimental Results - The article presents comparative results demonstrating that Dream4Drive outperforms existing models in various training epochs, achieving higher mean Average Precision (mAP) and nuScenes Detection Score (NDS) [27][28]. - High-resolution synthetic data (512×768) leads to significant performance improvements, with mAP increasing by 4.6 percentage points (12.7%) and NDS by 4.1 percentage points (8.6%) [29][30]. - The findings indicate that the position of inserted assets affects performance, with distant insertions generally yielding better results due to reduced occlusion issues [37][38]. Group 4: Conclusions and Implications - The study concludes that existing evaluations of synthetic data in autonomous driving are biased, and Dream4Drive provides a more effective approach for generating high-quality synthetic data for perception tasks [40][42]. - The results emphasize the importance of using assets that match the style of the dataset to minimize the domain gap between synthetic and real data, enhancing model training [42].

合成数据生成

自动驾驶世界模型

合成数据生成

自动驾驶世界模型

做了几期线上交流，我发现大家还是太迷茫

自动驾驶之心· 2025-10-24 00:04

Core Viewpoint - The article emphasizes the establishment of a comprehensive community called "Autonomous Driving Heart Knowledge Planet," aimed at providing a platform for knowledge sharing and networking in the autonomous driving industry, addressing the challenges faced by newcomers in the field [1][3][14]. Group 1: Community Development - The community has grown to over 4,000 members and aims to reach nearly 10,000 within two years, providing a space for technical sharing and communication among beginners and advanced learners [3][14]. - The community integrates various resources including videos, articles, learning paths, Q&A, and job exchange, making it a comprehensive hub for autonomous driving enthusiasts [3][5]. Group 2: Learning Resources - The community has organized over 40 technical learning paths, covering topics such as end-to-end autonomous driving, multi-modal large models, and data annotation practices, significantly reducing the time needed for research [5][14]. - Members can access a variety of video tutorials and courses tailored for beginners, covering essential topics in autonomous driving technology [9][15]. Group 3: Industry Insights - The community regularly invites industry experts to discuss trends, technological advancements, and production challenges in autonomous driving, fostering a serious content-driven environment [6][14]. - Members are encouraged to engage with industry leaders for insights on job opportunities and career development within the autonomous driving sector [10][18]. Group 4: Networking Opportunities - The community facilitates connections between members and various autonomous driving companies, offering resume forwarding services to help members secure job placements [10][12]. - Members can freely ask questions regarding career choices and research directions, receiving guidance from experienced professionals in the field [87][89].

端到端自动驾驶

自动驾驶世界模型

视觉语言模型（VLM）

自动驾驶VLA

端到端自动驾驶

自动驾驶世界模型

视觉语言模型（VLM）

自动驾驶VLA

执行力是当下自动驾驶的第一生命力

自动驾驶之心· 2025-10-17 16:04

Core Viewpoint - The article discusses the evolving landscape of the autonomous driving industry in China, highlighting the shift in competitive dynamics and the increasing investment in autonomous driving technologies as a core focus of AI development [1][2]. Industry Trends - The autonomous driving sector has undergone significant changes over the past two years, with new players entering the market and existing companies focusing on improving execution capabilities [1]. - The industry experienced a flourishing period before 2022, where companies with standout technologies could thrive, but has since transitioned into a more competitive environment that emphasizes addressing weaknesses [1]. - Companies that remain active in the market are progressively enhancing their hardware, software, AI capabilities, and engineering implementation to survive and excel [1]. Future Outlook - By 2025, the industry is expected to enter a "calm period," where unresolved technical challenges in areas like L3, L4, and Robotaxi will continue to present opportunities for professionals in the field [2]. - The article emphasizes the importance of comprehensive skill sets for individuals in the autonomous driving sector, suggesting that those with a short-term profit mindset may not endure in the long run [2]. Community and Learning Resources - The "Autonomous Driving Heart Knowledge Planet" community has been established to provide a comprehensive platform for learning and sharing knowledge in the autonomous driving field, featuring over 4,000 members and aiming for a growth to nearly 10,000 in the next two years [4][17]. - The community offers a variety of resources, including video content, learning pathways, Q&A sessions, and job exchange opportunities, catering to both beginners and advanced learners [4][6][18]. - Members can access detailed technical routes and practical solutions for various autonomous driving challenges, significantly reducing the time needed for research and learning [6][18]. Technical Focus Areas - The community has compiled over 40 technical routes related to autonomous driving, covering areas such as end-to-end learning, multi-modal models, and various simulation platforms [18][39]. - There is a strong emphasis on practical applications, with resources available for data processing, 4D labeling, and engineering practices in autonomous driving [12][18]. Job Opportunities - The community facilitates job opportunities by connecting members with openings in leading autonomous driving companies, providing a platform for resume submissions and internal referrals [13][22].

端到端自动驾驶

自动驾驶世界模型

视觉语言模型（VLM）

自动驾驶VLA

端到端自动驾驶

自动驾驶世界模型

视觉语言模型（VLM）

自动驾驶VLA