Workflow
占用网络
icon
Search documents
通用障碍物漏检,得升级下Occ自动标注模型了。。。
自动驾驶之心· 2025-08-11 23:33
Core Viewpoint - The article discusses the challenges and methodologies related to the automation of occupancy network (OCC) data labeling in the context of autonomous driving, emphasizing the need for high-quality training data to improve model generalization and safety. Group 1: OCC Data Labeling Challenges - The need for high-quality training data is highlighted due to incidents caused by undetected obstacles, such as fallen tree branches during adverse weather conditions [2]. - The OCC network is essential for modeling irregular obstacles and background elements, which increases the demand for accurate data labeling [5]. - The automation of OCC data labeling is being pursued by many companies to enhance model performance and reduce costs associated with manual labeling [2][10]. Group 2: Automation Techniques - The common process for generating OCC training ground truth involves three main methods: 2D-3D object detection consistency, comparison with edge models, and manual intervention for quality control [9]. - High-quality automated labeling data can be used for both vehicle model training and cloud model optimization, facilitating continuous iteration [10]. Group 3: 4D Automated Labeling Course - A course is introduced that covers the entire process of 4D automated labeling, including dynamic and static object detection, and the challenges faced in real-world applications [10][12]. - The course aims to address the difficulties in learning and advancing in the field of automated driving data labeling, providing a comprehensive understanding of core algorithms and practical applications [10][11]. Group 4: Key Learning Outcomes - Participants will gain knowledge of the entire 4D automated labeling process, including dynamic obstacle detection, SLAM reconstruction, and the generation of end-to-end ground truth [12][20]. - The course also focuses on the practical implementation of algorithms and the resolution of common issues encountered in the industry [15][22]. Group 5: Target Audience - The course is designed for various groups, including researchers, students, and professionals looking to transition into the field of data closure in autonomous driving [26][31].
出现断层了?ICCV2025的自动驾驶方向演变...
自动驾驶之心· 2025-07-24 09:42
Core Insights - The article highlights the latest advancements in autonomous driving technologies, focusing on various research papers and frameworks that contribute to the field [2][3]. Multimodal Models & VLA - ORION presents a holistic end-to-end framework for autonomous driving, utilizing vision-language instructed action generation [5]. - An all-in-one large multimodal model for autonomous driving is introduced, showcasing its potential applications [6][7]. - MCAM focuses on multimodal causal analysis for ego-vehicle-level driving video understanding [9]. - AdaDrive and VLDrive emphasize self-adaptive systems and lightweight models for efficient language-grounded autonomous driving [10]. Simulation & Reconstruction - ETA proposes a dual approach to self-driving with large models, enhancing efficiency through forward-thinking [13]. - InvRGB+L introduces inverse rendering techniques for complex scene modeling [14]. - AD-GS and BézierGS focus on object-aware scene reconstruction and dynamic urban scene reconstruction, respectively [18][19]. End-to-End & Trajectory Prediction - Epona presents an autoregressive diffusion world model for autonomous driving, enhancing trajectory prediction capabilities [25]. - World4Drive introduces an intention-aware physical latent world model for end-to-end autonomous driving [30]. - MagicDrive-V2 focuses on high-resolution long video generation for autonomous driving with adaptive control [35]. Occupancy Networks - The article discusses advancements in 3D semantic occupancy prediction, highlighting the transition from binary to semantic data [44]. - GaussRender and GaussianOcc focus on learning 3D occupancy with Gaussian rendering techniques [52][54]. Object Detection - Several papers address 3D object detection, including MambaFusion, which emphasizes height-fidelity dense global fusion for multi-modal detection [64]. - OcRFDet explores object-centric radiance fields for multi-view 3D object detection in autonomous driving [69]. Datasets - The ROADWork Dataset aims to improve recognition and analysis of work zones in driving scenarios [73]. - Research on driver attention prediction and motion planning is also highlighted, showcasing the importance of understanding driver behavior in autonomous systems [74][75].