端到端自动驾驶技术
Search documents
11月车市基本符合预期,英伟达开源VLA模型
CAITONG SECURITIES· 2025-12-11 13:00
Group 1: Automotive Market Overview - In November, the national retail sales of passenger cars reached 2.225 million units, a year-on-year decrease of 8.1% and a month-on-month decrease of 1.1%, aligning with the initial forecast of a "low start, medium growth, and stable end" trend for the year [5][11][20] - Cumulative retail sales for the year reached 21.483 million units, reflecting a year-on-year growth of 6.1%. The growth rate fluctuated throughout the year, with a notable slowdown in the fourth quarter due to high base effects from the previous year [5][11][20] - The "old-for-new" subsidy policy has been a significant driver for growth, with over 11.2 million applications for subsidies by the end of October, although the average daily subsidy scale dropped to 30,000 units in November due to the suspension of subsidies in various regions [5][11][20] Group 2: Intelligent Driving Developments - NVIDIA has officially open-sourced its new Vision-Language-Action (VLA) model, Alpamayo-R1, marking a significant shift in autonomous driving technology from mere behavior imitation to deeper causal reasoning [6][33] - The model's dataset, approximately 100TB in size, has been uploaded to the open-source community, indicating a move towards more accessible high-end autonomous driving models [6][33] Group 3: Investment Recommendations - The report suggests focusing on companies with strong positions in automotive intelligence and leading software capabilities, including Rui Ming Technology, Dao Tong Technology, Hei Zhi Ma Intelligent, Horizon Robotics, and others [8][39]
英伟达把自动驾驶核心技术公开了,吴新宙牵头研发,VLA大模型和海量数据免费用
3 6 Ke· 2025-12-03 10:52
Core Insights - NVIDIA has officially released and open-sourced its new Vision-Language-Action (VLA) model, Alpamayo-R1, and plans to open-source some core datasets in future updates [1][2]. Group 1: Model and Dataset Release - The corresponding dataset for the Alpamayo-R1 model has been uploaded to the open-source community, totaling approximately 100TB, marking NVIDIA's first open-source VLA model [2]. - The dataset can be used for both commercial and non-commercial purposes, allowing companies with limited VLA technology experience to quickly engage in VLA development [2]. Group 2: Technological Advancements - The introduction of Alpamayo-R1 signifies a shift in autonomous driving technology from mere "behavior imitation" to a new stage of deep "causal reasoning" [4]. - Alpamayo-R1 addresses critical safety issues in long-tail scenarios, achieving a 12% improvement in planning accuracy compared to baseline models and reducing off-road accident rates by 35% [5]. Group 3: Model Architecture and Training - The model employs a modular and efficient architecture that balances "slow thinking" and "fast action," driven by NVIDIA's Cosmos-Reason visual language model for complex environmental understanding [13]. - A new training strategy incorporating reinforcement learning (RL) has significantly improved reasoning quality by 45% and reasoning-action consistency by 37% [17]. Group 4: Industry Impact - The open-sourcing of Alpamayo-R1 and its dataset may lead to a reshaping of the autonomous driving industry, lowering entry barriers for small and medium-sized companies and research institutions [19]. - This move reflects NVIDIA's "soft and hard integration" strategy, showcasing the model's performance reliant on NVIDIA's powerful GPU capabilities and the Cosmos framework [19].
清华最新SOTA!ArbiViewGen:自监督框架实现多车型任意视角可控图像生成~
自动驾驶之心· 2025-08-10 23:32
点击下方 卡片 ,关注" 自动驾驶之心 "公众号 戳我-> 领取 自动驾驶近30个 方向 学习 路线 今天自动驾驶之心为大家分享 清华 最新的工作! ArbiViewGen:自监督框架实现多车型任意视点可控图像生成,性能达SOTA! 如 果您有相关工作需要分享,请在文末联系我们! 自动驾驶课程学习与 技术交流群加入 ,也欢迎添加小助理微信AIDriver005 >>自动驾驶前沿信息获取 → 自动驾驶之心知识星球 论文作者 | Yatong Lan等 编辑 | 自动驾驶之心 写在前面 & 笔者的个人理解 任意视角图像生成 在自动驾驶领域具有重要潜力,但由于缺乏外推视角的真实数据,这阻碍了高保真生成模型的训练,因此仍然是一个具有挑战性的任务。 在本工作中,我们提出了 ArbiViewGen ,一个基于扩散的新框架,用于从任意视角点生成可控的相机图像。为了解决未见视角中缺乏真实数据的问题,我们引入 了两个关键组件: 特征感知自适应视角拼接(FAVS) 和 跨视角一致性自监督学习(CVC-SSL) 。 FAVS 采用分层匹配策略,首先使用相机姿态建立粗略几何对应关系,然后通过改进的特征匹配算法进行细粒度对齐,并通过聚 ...