Workflow
130多天后再谈AI!李想透露实现VLA的三个阶段,回应“智驾”是否该叫停

Group 1 - The core idea presented by Li Xiang is that the true breakthrough of artificial intelligence (AI) will occur when it becomes a production tool, similar to how humans employ drivers [2][6] - Li Xiang emphasizes that the VLA (Vision-Language-Action) model developed by the company represents a significant advancement in AI, allowing for natural language communication with the driver agent and improved decision-making capabilities [2][3] - The VLA model is described as a combination of end-to-end and visual language models, enabling better handling of complex traffic scenarios compared to previous models [3][4] Group 2 - The evolution of the VLA model is outlined in three stages: starting from rule-based algorithms, progressing to end-to-end + VLM, and finally reaching the VLA stage, which aims to emulate human intelligence [4][6] - Li Xiang asserts that the VLA model is currently the most capable architecture, although its implementation poses significant challenges due to the increased complexity and hardware requirements [6][7] - The industry consensus indicates that the VLA model could serve as a critical bridge in the transition from L2 driver assistance to L4 autonomous driving, highlighting its potential impact on the future of intelligent driving [6]