Gemini Robotics 1.5系列

Search documents
首款推理具身模型,谷歌DeepMind造!打破一机一训,零样本迁移
具身智能之心· 2025-09-28 01:05
点击下方 卡片 ,关注" 具身智能之心 "公众号 作者丨机器之心 >> 点击进入→ 具身智能之心 技术交流群 更多干货,欢迎加入国内首个具身智能全栈学习社区 : 具身智能之心知识星球 (戳我) , 这里包含所有你想要的。 全球首个具备模拟推理能力的具身模型来了! 谷歌DeepMind正式发布 新一代通用机器人基座模型 ——Gemini Robotics 1.5系列。 它不止于对语言、图像进行理解,还结合了视觉、语言与动作 (VLA) ,并通过具身推理 (Embodied Reasoning) 来实现"先思考,再 行动"。 这一系列由两大模型组成: 其中,ER代表"具身推理"。 这意味着GR-ER 1.5是全球首个具备模拟推理能力的具身模型。 Gemini Robotics 1.5 (GR 1.5) :负责动作执行的多模态大模型; Gemini Robotics-ER 1.5 (GR-ER 1.5) :强化推理能力,提供规划与理解支持。 不过, GR-ER 1.5并不执行任何实际操作 ,GR 1.5正是为执行层而生。 两者结合,能让机器人不仅完成"折纸、解袋子"这样的单一动作,还能解决"分拣深浅色衣物"甚至"根 ...
首款推理具身模型,谷歌DeepMind造!自主理解/规划/执行复杂任务,打破一机一训,还能互相0样本迁移技能
量子位· 2025-09-27 04:46
Core Viewpoint - Google DeepMind has launched the Gemini Robotics 1.5 series, marking a significant milestone in the development of general AI for real-world applications, featuring embodied reasoning capabilities that allow robots to "think before acting" [1][9]. Group 1: Model Composition - The Gemini Robotics 1.5 series consists of two main models: GR 1.5 for action execution and GR-ER 1.5 for embodied reasoning [2][8]. - GR-ER 1.5 is the world's first embodied model with simulated reasoning capabilities [3]. Group 2: Functional Capabilities - The combination of GR-ER 1.5 and GR 1.5 enables robots to perform complex multi-step tasks, such as sorting clothes by color or packing luggage based on weather conditions [5][6]. - GR 1.5 can adapt to various robot hardware, allowing a single model to operate across different platforms without the need for separate training [16][18]. Group 3: Motion Transfer Mechanism - The innovative "Motion Transfer" mechanism allows skills learned on one robot to be transferred to another, enhancing cross-platform functionality [21][48]. - This mechanism abstracts different robot actions into a unified semantic space, enabling seamless skill sharing across diverse hardware [56]. Group 4: Safety and Explainability - The GR 1.5 series enhances safety by allowing robots to self-correct during tasks and recognize potential risks, ensuring safe operation in human environments [34][36]. - The embodied reasoning model provides transparency in the robot's decision-making process, improving interpretability and trust [55][58]. Group 5: Performance Metrics - In benchmark tests, GR 1.5 outperformed previous models in various dimensions, including instruction generalization and task completion rates, achieving nearly 80% in long-sequence tasks [61][62]. - The model demonstrated unprecedented zero-shot transfer capabilities in cross-robot migration tests [63]. Group 6: Future Developments - The GR 1.5 series represents a shift from executing single commands to genuinely understanding and solving physical tasks [69]. - Currently, developers can access GR-ER 1.5 through Google AI Studio, while GR 1.5 is available to select partners [71].