Workflow
自动驾驶之心
icon
Search documents
执行力是当下自动驾驶的第一生命力
自动驾驶之心· 2025-10-17 16:04
Core Viewpoint - The article discusses the evolving landscape of the autonomous driving industry in China, highlighting the shift in competitive dynamics and the increasing investment in autonomous driving technologies as a core focus of AI development [1][2]. Industry Trends - The autonomous driving sector has undergone significant changes over the past two years, with new players entering the market and existing companies focusing on improving execution capabilities [1]. - The industry experienced a flourishing period before 2022, where companies with standout technologies could thrive, but has since transitioned into a more competitive environment that emphasizes addressing weaknesses [1]. - Companies that remain active in the market are progressively enhancing their hardware, software, AI capabilities, and engineering implementation to survive and excel [1]. Future Outlook - By 2025, the industry is expected to enter a "calm period," where unresolved technical challenges in areas like L3, L4, and Robotaxi will continue to present opportunities for professionals in the field [2]. - The article emphasizes the importance of comprehensive skill sets for individuals in the autonomous driving sector, suggesting that those with a short-term profit mindset may not endure in the long run [2]. Community and Learning Resources - The "Autonomous Driving Heart Knowledge Planet" community has been established to provide a comprehensive platform for learning and sharing knowledge in the autonomous driving field, featuring over 4,000 members and aiming for a growth to nearly 10,000 in the next two years [4][17]. - The community offers a variety of resources, including video content, learning pathways, Q&A sessions, and job exchange opportunities, catering to both beginners and advanced learners [4][6][18]. - Members can access detailed technical routes and practical solutions for various autonomous driving challenges, significantly reducing the time needed for research and learning [6][18]. Technical Focus Areas - The community has compiled over 40 technical routes related to autonomous driving, covering areas such as end-to-end learning, multi-modal models, and various simulation platforms [18][39]. - There is a strong emphasis on practical applications, with resources available for data processing, 4D labeling, and engineering practices in autonomous driving [12][18]. Job Opportunities - The community facilitates job opportunities by connecting members with openings in leading autonomous driving companies, providing a platform for resume submissions and internal referrals [13][22].
自驾行业完整的基建,更值得毕业的同学做探索!
自动驾驶之心· 2025-10-17 00:03
Core Viewpoint - The autonomous driving industry is maturing in terms of infrastructure and investment, making it a suitable field for students and professionals to explore and develop their skills [1][16]. Group 1: Industry Insights - The technology landscape in autonomous driving is consolidating, but there are still many product forms to refine, indicating ongoing opportunities for innovation [1]. - The industry is currently debating the technical routes of world models and VLA, suggesting that while theoretical aspects may be solidifying, practical implementation remains a challenge [1]. - The focus on L2 functionality and the regulatory progress for L3 indicates a gradual evolution towards more advanced levels of automation, with L4 still facing unresolved issues [1]. Group 2: Community and Learning Resources - A community called "Autonomous Driving Heart Knowledge Sphere" has been established, which integrates various resources such as videos, articles, learning paths, and job exchange, aimed at fostering collaboration and knowledge sharing [4][5]. - The community has grown to over 4,000 members, with a goal to reach nearly 10,000 in the next two years, providing a platform for both beginners and advanced learners [5]. - The community offers practical guidance on various topics, including entry points for end-to-end learning, multi-modal large models, and data annotation practices [7][8]. Group 3: Career Opportunities - The community actively shares job openings and facilitates connections between members and companies in the autonomous driving sector, enhancing employment opportunities [12][21]. - There is a focus on developing comprehensive learning paths for newcomers, ensuring they have access to a well-rounded education in autonomous driving technologies [17][38]. Group 4: Technical Development - The community has compiled over 40 technical routes and resources related to autonomous driving, covering areas such as perception, simulation, planning, and control [17][34]. - Regular discussions and live sessions with industry experts are held to explore trends, technical directions, and production challenges in autonomous driving [8][90].
工业界和学术界都在怎么搞端到端和VLA?
自动驾驶之心· 2025-10-17 00:03
Core Insights - The article discusses the evolution of end-to-end algorithms in autonomous driving, highlighting the transition from modular production algorithms to end-to-end and now to Vision-Language Alignment (VLA) models [1][3] - It emphasizes the rich technology stack involved in end-to-end algorithms, including BEV perception, visual language models (VLM), diffusion models, reinforcement learning, and world models [3] Summary by Sections End-to-End Algorithms - End-to-end algorithms are categorized into two main paradigms: single-stage and two-stage, with UniAD being a representative of the single-stage approach [1] - Single-stage can further branch into various subfields, particularly those based on VLA, which have seen a surge in related publications and industrial applications in recent years [1] Courses Offered - The article promotes two courses: "End-to-End and VLA Autonomous Driving Small Class" and "Practical Course on Autonomous Driving VLA and Large Models," aimed at helping individuals quickly and efficiently enter the field [3] - The "Practical Course" focuses on VLA, covering topics from VLM as an autonomous driving interpreter to modular and integrated VLA, along with detailed theoretical foundations [3][12] Instructor Team - The instructor team includes experts from both academia and industry, with backgrounds in multi-modal perception, autonomous driving VLA, and large model frameworks [8][11][14] - Notable instructors have published numerous papers in top-tier conferences and have extensive experience in research and practical applications in autonomous driving and large models [8][11][14] Target Audience - The courses are designed for individuals with a foundational understanding of autonomous driving, familiar with basic modules, and have knowledge of transformer models, reinforcement learning, and BEV perception [15][17]
世界模型VLA!DriveVLA-W0:7000万数据解锁自动驾驶VLA Scaling(中科院&引望)
自动驾驶之心· 2025-10-17 00:03
Core Insights - The article discusses the introduction of the DriveVLA-W0 training paradigm by the Chinese Academy of Sciences and Huawei, which addresses the "supervision deficit" issue in VLA models for autonomous driving [2][5][30] - The proposed method enhances the model's ability to learn from sparse action signals by incorporating world modeling tasks to generate dense self-supervised signals, thereby improving the model's performance as the training dataset scales [4][30][31] Summary by Sections Background - Scaling laws present an attractive path for achieving more generalizable driving intelligence, with expectations to utilize PB-level driving data for training robust foundational models [5] - The current challenge lies in the mismatch between the large scale of VLA models and the sparse supervision signals, leading to a "supervision deficit" that limits the model's ability to learn rich world representations [5][30] DriveVLA-W0 Paradigm - The DriveVLA-W0 paradigm introduces world modeling as a strong self-supervised approach to supplement sparse action signals, allowing the model to learn the underlying dynamics of driving environments [5][30] - The method has been validated on two mainstream VLA architectures, demonstrating significant improvements over baseline models [4][6] Experimental Validation - Extensive experiments on various datasets, including a large internal dataset of 70 million frames, confirm that the world modeling approach amplifies data scaling laws, leading to enhanced model performance [11][30] - The introduction of a lightweight action expert based on a mixture-of-experts (MoE) architecture reduces inference latency to 63.1% of the baseline model while maintaining strong performance [11][20] Key Contributions - The article identifies "supervision deficit" as a critical bottleneck in VLA scaling and proposes the DriveVLA-W0 paradigm to address this issue [11][30] - The findings reveal that as data scales up, the performance trend of action decoders reverses, with simpler autoregressive models outperforming more complex flow-matching models in large datasets [30][31] Conclusion - The research emphasizes that adopting predictive world modeling is crucial for unlocking the potential of large-scale data and achieving more generalizable driving intelligence [30][31]
千里智驾的软硬一体
自动驾驶之心· 2025-10-17 00:03
Core Insights - The article discusses the collaboration between Qianli Zhijia and Aixin Yuanzhi in the autonomous driving chip sector, highlighting the importance of integrating hardware and software for advanced driving algorithms [7][9] - It emphasizes the shift from L2+ to L3 and Robotaxi as the main battleground for autonomous driving companies, with L3 and Robotaxi expected to unlock significant commercial value [8] - The need for higher computing power in the next generation of autonomous driving chips is noted, with expectations of reaching several thousand TOPS, which will increase costs and necessitate cost-reduction strategies [8] Group 1 - Qianli Zhijia is merging its algorithm capabilities with Aixin Yuanzhi's chip technology to enhance its autonomous driving solutions [7] - The collaboration is seen as crucial for Qianli Zhijia to achieve its ambitious technical goals, which include L2+, L3, and Robotaxi [7] - The article mentions that major players in the industry, including Tesla and domestic new forces, are preparing for a significant hardware and software iteration [7][8] Group 2 - L2+ is described as a preliminary stage, while L3 and Robotaxi represent the future of autonomous driving, with the potential for companies to achieve valuations in the billions [8] - The article suggests that the next generation of chips will require tighter collaboration between algorithms and chip manufacturers, moving away from reliance on generic chips [8][9] - Aixin Yuanzhi has established a foothold in the mid-to-low tier autonomous driving market and stands to gain significantly from this partnership with Qianli Zhijia [9]
最新自进化综述!从静态模型到终身进化...
自动驾驶之心· 2025-10-17 00:03
Core Viewpoint - The article discusses the limitations of current AI agents, which rely heavily on static configurations and struggle to adapt to dynamic environments. It introduces the concept of "self-evolving AI agents" as a solution to these challenges, providing a systematic framework for their development and implementation [1][5][6]. Summary by Sections Need for Self-Evolving AI Agents - The rapid development of large language models (LLMs) has shown the potential of AI agents in various fields, but they are fundamentally limited by their dependence on manually designed static configurations [5][6]. Definition and Goals - Self-evolving AI agents are defined as autonomous systems that continuously and systematically optimize their internal components through interaction with their environment, adapting to changes in tasks, context, and resources while ensuring safety and performance [6][12]. Three Laws and Evolution Stages - The article outlines three laws for self-evolving AI agents, inspired by Asimov's laws, which serve as constraints during the design process [8][12]. It also describes a four-stage evolution process for LLM-driven agents, transitioning from static models to self-evolving systems [9]. Four-Component Feedback Loop - A unified technical framework is proposed, consisting of four components: system inputs, agent systems, environments, and optimizers, which work together in a feedback loop to facilitate the evolution of AI agents [10][11]. Technical Framework and Optimization - The article categorizes the optimization of self-evolving AI into three main directions: single-agent optimization, multi-agent optimization, and domain-specific optimization, detailing various techniques and methodologies for each [20][21][30]. Domain-Specific Applications - The paper highlights the application of self-evolving AI in specific fields such as biomedicine, programming, finance, and law, emphasizing the need for tailored approaches to meet the unique challenges of each domain [30][31][33]. Evaluation and Safety - The article discusses the importance of establishing evaluation methods to measure the effectiveness of self-evolving AI and addresses safety concerns associated with their evolution, proposing continuous monitoring and auditing mechanisms [34][40]. Future Challenges and Directions - The article identifies key challenges in the development of self-evolving AI, including balancing safety with evolution efficiency, improving evaluation systems, and enabling cross-domain adaptability [41][42]. Conclusion - The ultimate goal of self-evolving AI agents is to create systems that can collaborate with humans as partners rather than merely executing commands, marking a significant shift in the understanding and application of AI technology [42].
刚刚,一家车企的具身团队原地解散了......
自动驾驶之心· 2025-10-16 08:06
Core Viewpoint - A prominent embodied intelligence company, OneStar Robotics, has unexpectedly disbanded shortly after its establishment, raising questions about the reasons behind this sudden decision [3][5][20]. Company Overview - OneStar Robotics was founded on May 9, 2025, by Li Xingxing, the son of Geely's founder Li Shufu, and was positioned as a key player in the robotics sector for Geely [6][10]. - The company aimed to develop embodied intelligence, focusing on practical applications rather than just technological demonstrations [11][13]. Recent Developments - Just over a month prior to the disbandment, OneStar Robotics announced new financing and the recruitment of notable AI expert Ding Yan as CTO and co-founder [4][18]. - The company had secured several rounds of funding, including a multi-million yuan "friends and family" round and a seed round involving various investors from the Geely ecosystem [16][20]. Disbandment Details - The disbandment occurred without prior notice, and it is reported that the company did not have to compensate employees due to its short operational period [3][8][22]. - There are indications that the existing Geely-related foundational platform and business may revert to Geely Auto Group, while the technology team led by Ding Yan might pursue independent ventures [9][20]. Research and Development Strategy - OneStar Robotics distinguished itself by adopting a "scene-first" approach, focusing on real-world tasks and production scenarios to drive algorithm design and operational processes [13][14]. - The company collaborated with prestigious academic institutions, including Fudan University and Tsinghua University, to build a robust research framework [12].
果然!秋招会惩罚每一个本末倒置的研究生!
自动驾驶之心· 2025-10-16 04:00
随着秋招开始,有不少同学又开始了内耗和焦虑。就业形势一直在变,大家对于工作的渴望却日渐攀升,纷纷 感慨: 如果能重来,我一定多发论文, 多积攒项 目实践! 对于毕业的小伙伴们, 我能给到的建议就是 抓紧时间、注重复盘、注重复盘、整合资源。 简单来说就是校 招、社招两手抓,有的放矢进行查漏补缺,要善于利用一切资源来帮助自己决策。 对于刚刚开始科研生涯或者在读的同学们, 我的建议是被动等待难以突破,多点成果就业深造都有底气! 对 于寻求系统性科研能力提升与成果产出的朋友,自动驾驶之心推出了一站式科研辅导课程。 ★ 有位研二学员,毕业要求发小论文,但自己导师散养,找到了我们指导,3个月顺利完成一篇SCI论文。 为什么选我们? 自动驾驶之心作为国内最大的AI类技术自媒体平台,IP包含自动驾驶之心/具身智能之心/3D视觉之心等平 台, 拥有国内最顶尖的学术资源。 深耕自动驾驶、具身智能、机器人 方向多年。我们深刻理解这些交叉学科 的挑战与机遇,更明白一篇高质量论文对于学生(尤其是硕博生)学业和未来发展的重要性。 我们300+专职于自动驾驶/具身智能方向的老师。来自于全球QS排名前100的老师,发表过多篇顶会/子刊/A ...
新势力不再只是 “蔚小理”,“BIG 6+1” 挑战比亚迪
自动驾驶之心· 2025-10-16 04:00
Core Viewpoint - The article discusses the evolution of the new energy vehicle market in China, highlighting the shift from the "Wei Xiaoli" (NIO, Xpeng, Li Auto) representation of new car manufacturers to a broader classification of seven key players, termed "BIG 6+1," which includes Tesla, Leap Motor, AITO, Xiaomi, Xpeng, Li Auto, and NIO. This shift reflects the changing market dynamics as new entrants gain significant market share and challenge established brands like BYD [1][15]. Group 1: Market Dynamics - By 2025, the penetration rate of new energy vehicles in China is expected to exceed 50%, leading to the market's accelerated elimination of some new car manufacturers [1]. - In August 2025, the total insurance volume of seven new energy vehicle manufacturers approached or briefly surpassed that of BYD, the market leader [1][13]. - The "BIG 6+1" collectively accounted for approximately 30% of the entire market, with a significant share in the new energy segment [15]. Group 2: Classification of New Energy Manufacturers - A clear distinction is made between manufacturers with fuel vehicle production qualifications and those without, with only seven companies in the top 40 insurance volume rankings lacking such qualifications [2]. - The seven new energy vehicle manufacturers identified are Tesla, Leap Motor, AITO, Xiaomi, Xpeng, Li Auto, and NIO, with their respective market shares in August 2025 being 2.81%, 2.52%, 2.19%, 1.79%, 1.71%, 1.53%, and 1.40% [4][14]. Group 3: Sales and Market Share - The sales rankings for August 2025 show BYD leading with 284,005 units sold, followed by other brands, with the "BIG 6+1" collectively nearing BYD's sales figures [3][14]. - The average selling prices of the "BIG 6+1" brands vary, with Tesla at 29.67 million yuan, Li Auto at 34.90 million yuan, and Leap Motor at 12.98 million yuan, indicating a diverse pricing strategy among these manufacturers [9][11]. Group 4: Product Strategy and Offerings - The "BIG 6+1" brands have a varied product lineup, with most brands offering around seven models, while Xiaomi has the least with three models [5]. - The product pricing strategy shows a concentration in the 20,000 to 40,000 yuan range, with the cheapest model from Leap Motor priced at around 50,000 yuan [7][12]. Group 5: Future Outlook - The article suggests that as the "BIG 6+1" brands stabilize their sales figures, they will likely lead the new energy vehicle market, marking a new phase in the industry's development [15]. - Upcoming product launches from these brands, such as the AITO M7 and NIO ES8, are expected to further enhance their market positions and sales potential [15].
NeurIPS'25高分论文!华科、浙大&小米提出深度估计新范式
自动驾驶之心· 2025-10-15 23:33
Research Motivation and Contribution - The core issue in existing depth estimation methods is the "Flying Pixels" problem, which leads to erroneous actions in robotic decision-making and ghosting in 3D reconstruction [2] - The proposed method, Pixel-Perfect Depth (PPD), aims to eliminate artifacts caused by VAE compression by performing diffusion directly in pixel space [6] Innovation and Methodology - PPD introduces a novel diffusion model that operates in pixel space, addressing challenges of maintaining global semantic consistency and local detail accuracy [6][9] - The model incorporates a Semantics-Prompted Diffusion Transformer (SP-DiT) that enhances the modeling capabilities by integrating high-level semantic features during the diffusion process [9][16] Results and Performance - PPD outperforms existing generative depth estimation models across five public benchmarks, showing significant improvements in edge point cloud evaluation and producing depth maps with minimal "Flying Pixels" [14][20] - The model demonstrates exceptional zero-shot generalization capabilities, achieving superior performance without relying on pre-trained image priors [20][22] Experimental Analysis - A comprehensive ablation study indicates that the proposed SP-DiT significantly enhances performance metrics, with an 78% improvement in the AbsRel metric on the NYUv2 dataset compared to baseline models [25][26] - The introduction of a Cascaded DiT design improves computational efficiency by reducing inference time by 30% while maintaining high accuracy [26][27] Edge Point Cloud Evaluation - The model aims to generate pixel-perfect depth maps, addressing the challenge of evaluating edge accuracy through a newly proposed Edge-Aware Point Cloud Metric [28][30] - Experimental results confirm that PPD effectively avoids the "Flying Pixels" issue, demonstrating superior performance in edge accuracy compared to existing methods [28][34] Conclusion - PPD represents a significant advancement in depth estimation, providing high-quality outputs with sharp structures and clear edges, while minimizing artifacts [34][35] - The research opens new avenues for high-fidelity depth estimation based on diffusion models, emphasizing the importance of maintaining both global semantics and local geometric consistency [35]