Workflow
PlanAgent
icon
Search documents
快速结构化深度了解理想AI/自动驾驶/VLA手册
理想TOP2· 2025-10-10 11:19
Core Insights - The article discusses the evolution of Li Xiang's vision for Li Auto, emphasizing the transition from a traditional automotive company to an artificial intelligence (AI) company, driven by the belief in the transformative potential of AI and autonomous driving [1][2]. Motivation - Li Xiang considers founding Autohome as his biggest mistake, aiming for a venture at least ten times larger than it [1]. - The belief in the feasibility of autonomous driving and the industry's transformative phase motivated the establishment of Li Auto [1]. Timeline of Developments - In September 2022, Li Auto internally defined itself as an AI company [2]. - On January 28, 2023, Li Xiang officially announced the company's identity as an AI company [2]. - By March 2023, discussions around AI began, although initial understanding of concepts like pretraining and finetuning was limited [2]. - By December 2024, Li Xiang articulated five key judgments regarding AI's role and potential, emphasizing the importance of foundational models [2][3]. Key Judgments - Judgment 1: Li Xiang believes in OpenAI's five stages of AI, asserting that AI will democratize knowledge and capabilities [2]. - Judgment 2: The foundational model is seen as the operating system of the AI era, crucial for developing super products [2]. - Judgment 3: Current efforts are aimed at achieving Level 3 (L3) autonomous driving and securing a ticket to Level 4 (L4) [2][3]. - Judgment 4: The integration of large language models with autonomous driving will create a new entity termed VLA [3]. - Judgment 5: Li Auto aims to produce a car without a steering wheel within three years, contingent on the VLA foundational model and sufficient resources [3]. Technical Insights - The design and training of the VLA foundational model focus on 3D spatial understanding and reasoning capabilities [5][6]. - Sparse modeling techniques are employed to enhance efficiency without significantly increasing computational load [7]. - The model incorporates future frame prediction and dense depth prediction tasks to mimic human thought processes [8]. - The use of diffusion techniques allows for real-time trajectory generation and enhances the model's ability to predict complex traffic scenarios [10]. Reinforcement Learning - The company aims to surpass human driving capabilities through reinforcement learning, addressing previous limitations in model training and interaction environments [11]. Future Directions - Li Auto is actively developing various models and frameworks to enhance its autonomous driving capabilities, including the introduction of new methodologies for video generation and scene reconstruction [12][13].