NVIDIA Thor芯片

Search documents
车企、科技企业VLA研发进展
Zhong Guo Qi Che Bao Wang· 2025-08-13 01:33
Group 1: Li Auto - Li Auto's i8 features the VLA "driver model," marking a significant advancement in intelligent driving following the previous VLM introduction [1] - The VLA model includes a newly designed spatial encoder that utilizes language models and logical reasoning to provide driving decisions, predicting trajectories of other vehicles and pedestrians through a diffusion model [1] - The inference frame rate of the VLA is approximately 10 Hz, more than tripling the previous VLM's rate of 3 Hz [1] Group 2: XPeng Motors - XPeng G7 officially commenced deliveries on July 7, with a clear timeline for the Ultra version's VLA and VLM software updates [2] - The VLA software OTA update is scheduled for September 2025, with VLM software upgrades following in November 2025, and personalized recommendations by December 2025 [2] - The XPeng G7 Ultra version is equipped with three self-developed Turing AI chips, boasting a total computing power of 2250 TOPS, positioning it as a leader among mass-produced models [2] Group 3: Chery Automobile - Chery plans to introduce the VLA and world model technology into fuel vehicles by 2025 through its Falcon 900 intelligent driving system, aiming to set a new benchmark for "oil-electric intelligence" [3] - The Falcon 900 system utilizes a self-developed VLA model that integrates visual perception, language understanding, and action execution [3] - The model has been trained on 20 million kilometers of real-world data, capable of understanding over 5000 traffic scenarios, achieving a 92% accuracy rate in recognizing non-standard traffic signals in complex urban conditions, a 37% improvement over traditional systems [3] Group 4: Geely Automobile - Geely is actively developing VLA technology, integrating it with world models to create a comprehensive world model system [4] - The Qianli Haohan system features a "dual end-to-end model" design, enabling a multi-modal VLA general scene model and an end-to-end model to back each other up [4] - This system is powered by dual NVIDIA Thor chips, with a total computing power of 1400 TOPS and over 40 perception units capable of detecting objects 0.75 meters in size from 300 meters away [4] Group 5: Yuanrong Qihang - Yuanrong Qihang is also investing in the VLA model, with five models expected to feature it by the third quarter of this year [5] - The company was among the earliest to publicly announce its VLA development in June of last year [5] - The VLA model focuses on defensive driving with four core functions: spatial semantic understanding, recognition of irregular obstacles, comprehension of text-based guide signs, and voice control of the vehicle, which will be gradually released with mass production [5]