具身智能(embodied AI)
Search documents
深度|AI教母李飞飞:在AI时代,信任绝不能外包给机器,它本质上属于人类,存在于个体、社区与社会层面
Z Potentials· 2025-12-12 04:15
图片来源: Masters of Scale Z Highlights 李飞飞曾是 Stanford Human-Centered AI Institute 的创始主任,如今是 World Lab 的联合创始人兼首席执行官,在空间智能领域引领创新。本次访谈是 2025 Masters of Scale Summit 的一部分,她与主持人 Reid Hof man 探讨了空间智能新阶段。 从语言边界到世界建模: AGI 与空间智能的下一篇章 Ried : 大家好,本周我们邀请到了李飞飞参与现场对谈,探讨 AI 的现状与未来走向。我们已经一起做了多次这样的特别分享,令人振奋又深感荣幸,感 谢您的到场。 显而易见,所有关注你职业轨迹的人都知道,你是这波 AI 浪潮中的 OG :从 ImageNet 等奠基性工作开始,你的贡献奠定了今日的基础 —— 在此先向你 致谢。如今你正专注于空间智能( spatial intelligence )与世界建模( world building )。我想请你分享:是什么促使你从自己深爱的工作 —— 在 Stanford CS 和 Human-Centered AI 的岗位上暂时休假, ...
理想汽车
数说新能源· 2025-11-27 02:03
Company Strategy Choices - The company will return to an entrepreneurial organizational model led by the founding team starting from Q4 2025, abandoning the professional management model attempted over the past three years. This decision is based on the rapidly changing industry technology and competitive environment, as well as the founder's extensive experience in startups [18][19]. - The product direction will focus on embodied AI robots rather than just electric vehicles or smart devices. This choice is made to avoid competition based solely on parameters like range and price, and to address user needs in high-frequency life scenarios [18][19]. Technical Route Selection - The company will build a full-stack AI system oriented towards the physical world instead of a language model route. Key breakthroughs will focus on enhancing perception capabilities with 3D Vision Transformers, which could increase effective perception range by 2-3 times [19][20]. - The model layer will aim to optimize the operating frequency of models, with a target to increase the current 10Hz frequency of a 4 billion parameter MOE model by 2-3 times, requiring customized GPU architecture and operating systems [20]. - The hardware layer will develop the Drive Biowire system to reduce the response time from 550 milliseconds to 350 milliseconds, potentially lowering accident rates by over 50% [21]. Q3 2025 Financial and Operational Data - Total revenue for Q3 was 27.4 billion RMB, a year-on-year decrease of 36.2% and a quarter-on-quarter decrease of 9.5%. Vehicle sales revenue was 25.9 billion RMB, down 37.4% year-on-year and 10.4% quarter-on-quarter [22]. - The overall gross margin was 16.3%, down 5.2 percentage points year-on-year and 3.8 percentage points quarter-on-quarter. Excluding recall costs, the gross margin was 20.4% [23]. - The net loss for the quarter was 624.4 million RMB, compared to a net profit of 2.8 billion RMB in the same quarter last year [26]. Product and Technology Progress - The I series models (I8/I6) are positioned to cover mainstream and high-end family markets, with significant order growth since September. Production capacity is expected to increase to about 20,000 units per month by early 2026 [30]. - The VLA system has been fully deployed, enhancing path selection at complex intersections, with further upgrades planned to improve safety and perception capabilities [44]. Market Strategy and Response - The company anticipates a significant drop in deliveries in Q1 2026 due to consumers rushing to take advantage of policy incentives before they expire. Long-term strategies include ensuring all models meet new energy consumption standards to qualify for subsidies [33][40]. - The company plans to operate approximately 4,800 supercharging stations by 2026, with 35% located in highway service areas, to enhance user experience and support the transition to new energy vehicles [40].
图片生成仿真!这个AI让3D资产「开箱即用」,直接赋能机器人训练
量子位· 2025-11-23 04:09
Core Insights - The article introduces PhysX-Anything, the first framework for generating 3D assets with physical properties directly from a single image, aimed at enhancing embodied AI and robotics applications [5][27][28]. Group 1: Framework Overview - PhysX-Anything allows for the generation of high-quality, sim-ready 3D assets that include explicit geometric structures, joint movements, and physical parameters, addressing the limitations of existing 3D generation methods [5][6]. - The framework employs a "coarse-to-fine" generation approach, utilizing multiple dialogue rounds to create both global physical descriptions and detailed geometric information from a single image [8][14]. Group 2: Technical Innovations - A novel 3D representation method is introduced, achieving a compression ratio of 193 times while retaining geometric structure, inspired by voxel representation [9][27]. - The framework utilizes a tree-structured, VLM-friendly format to enhance the richness of physical attributes and textual descriptions, facilitating better understanding and reasoning by the VLM [12]. Group 3: Performance Evaluation - PhysX-Anything outperforms existing methods like URDFormer and PhysXGen in both geometric and physical attribute metrics, demonstrating superior generalization capabilities [18][20]. - Human evaluations indicate that the generated structures from PhysX-Anything received the highest scores for both geometric and physical attributes, confirming its effectiveness [22]. Group 4: Practical Applications - The generated sim-ready 3D assets can be directly imported into simulators for various robotic strategy learning tasks, showcasing their practical utility in embodied intelligence applications [25][26]. - The framework is expected to drive a paradigm shift from "visual modeling" to "physical modeling" in 3D vision and robotics research [28].
苹果AI陷“信心危机”:又一位华人高管出走,转投Meta机器人团队
3 6 Ke· 2025-09-04 09:30
Core Insights - Apple's AI department is facing a significant talent exodus, with key members, including Jian Zhang, leaving for competitors like Meta and OpenAI, raising concerns about the company's AI strategy and product development [1][3][9] Group 1: Talent Exodus - Jian Zhang, head of the robotics AI research team, is among four key members who have confirmed their departure from Apple, joining Meta as the Director of Robotics Technology [3][8] - Since Meta initiated a talent acquisition strategy in Silicon Valley, Apple has lost at least 10 AI talents, including former Foundation Models team leader, Pang Ruoming, who reportedly received a compensation package exceeding $200 million from Meta [3][9] - The ongoing talent drain is occurring during a critical phase for Apple's AI initiatives, particularly the development of the "Apple Intelligence" suite and the next-generation Siri, leading to uncertainty in technology development and product rollout [3][9] Group 2: Strategic Challenges - Apple's AI department is experiencing a "crisis of confidence" due to internal discussions about relying on third-party models for the next-generation Siri instead of fully developing in-house solutions, which has frustrated engineers [9][10] - The departure of Pang Ruoming, viewed as a pivotal figure in Apple's AI strategy, has weakened team morale and internal confidence, exacerbating the ongoing talent loss [10][12] - The restructuring of Siri's "V2" architecture has been described as chaotic, delaying advanced features until at least 2026, further contributing to the team's dissatisfaction [12][14] Group 3: Cultural and Organizational Issues - Apple's closed culture and bureaucratic processes are seen as hindrances to innovation, contrasting with the open collaboration that characterizes the AI industry [14][15] - The AI and machine learning teams are perceived as "second-class citizens" within the company, facing resource allocation challenges compared to hardware and product departments, which intensifies internal tensions [14][15] - To address these issues, Apple needs to enhance its incentive mechanisms, upgrade its data center infrastructure, streamline internal processes, and consider a more open approach to research collaboration [15]