CASBOT W1
Search documents
突破视觉-语言-动作模型的瓶颈:QDepth-VLA让机器人拥有更精准的3D空间感知
机器之心· 2025-11-26 07:07
为此,由中国科学院自动化研究所与灵宝 CASBOT 共同提出了 QDepth-VLA —— 一种结合量化深度预测(Quantized Depth Prediction) 的 3D 信息增强型 VLA 模型。它通过独立的 Depth Expert 模块来学习离散化的深度表示。这种设计在保持原有 语义对齐能力的同时,显著提升了机器人在复杂操作场景下的空间推理与操控精度。 视觉-语言-动作模型(VLA)在机器人操控领域展现出巨大潜力。通过赋予预训练视觉-语言模型(VLM)动作生成能力, 机器人能够理解自然语言指令并在多样化场景中展现出强大的泛化能力。然而,这类模型在应对长时序或精细操作任务时, 仍然存在性能下降的现象。 这种现象的根源在于,模型虽具备语义理解能力,却缺乏对三维空间的几何感知与推理能力,导致其难以准确捕捉如机械臂 夹爪与物体之间相对位置关系等关键三维信息。 论文标题:QDepth-VLA: Quantized Depth Prediction as Auxiliary Supervision for Vision–Language–Action Models 论文链接: https://arxiv.o ...
跳街舞、打拳击、当服务员......数百款机器人亮相WAIC“秀绝技”
Hua Er Jie Jian Wen· 2025-07-27 12:33
Core Insights - The 2025 World Artificial Intelligence Conference (WAIC) in Shanghai showcased over 150 humanoid robots, marking the largest collective display of humanoid robots in China to date, indicating a shift from mere exhibition to practical applications in various sectors [1] - The event highlighted advancements in humanoid robots, which are now capable of performing tasks such as cooking, sorting materials, and security inspections, demonstrating their potential as real-world "producers" rather than just performers [1] Group 1: Humanoid Robot Innovations - The Galbot by Galaxy General, a quadruped robot, received the "Treasure of the Museum" title for its practical applications, including precise sorting and self-correction capabilities in a simulated automotive factory [3] - Star Motion Era introduced three versatile robots: L7, capable of dancing and sorting packages; XHAND1, a dexterous robotic hand; and Q5, a humanoid service robot that can provide guidance and perform various tasks [5] - The "Jueying X30" from Cloud Deep Technology showcased its ability to perform high-risk inspections, highlighting the feasibility of quadruped robots in replacing human labor in hazardous environments [7] Group 2: Market Trends and Orders - The humanoid robot industry is expected to transition from a technology-driven phase to a commercial phase by the second half of 2025, with market sentiment shifting towards orders and deliveries [15] - Significant orders were placed during WAIC, including a 124 million yuan order from China Mobile and various contracts from automotive manufacturers for material handling and assembly tasks [15] - The industry is experiencing rapid growth, with estimates suggesting an average growth rate of 50% to 100% in the first half of the year, driven by the increasing frequency of new robot releases [15]