Workflow
视触融合
icon
Search documents
人形机器人做汉堡火了! 伯克利等全新ViTacFormer让机器人操作稳如老手
机器之心· 2025-07-10 06:07
Core Viewpoint - The article discusses the advancements in humanoid robots, particularly focusing on the ViTacFormer framework that integrates visual and tactile information for dexterous manipulation tasks, showcasing its potential to revolutionize kitchen automation and other complex tasks [1][4][24]. Group 1: Technology and Innovation - The ViTacFormer framework is designed to enhance precision, stability, and continuous control in dexterous manipulation by combining visual and tactile data with a predictive mechanism for future tactile feedback [4][11]. - The system utilizes a dual-arm robot setup equipped with advanced tactile sensors and cameras to gather real-time data during operations, allowing for a comprehensive understanding of contact dynamics [13][14]. - ViTacFormer employs a cross-modal attention mechanism and an autoregressive tactile prediction branch, enabling the model to anticipate future contact states, thus improving action generation and overall task performance [9][11][24]. Group 2: Experimental Validation - The performance of ViTacFormer was evaluated through various short-range dexterous manipulation tasks, demonstrating a significant improvement in success rates, with an average increase of over 50% compared to existing methods [22][24]. - In a long-duration task simulating the complete process of making a hamburger, ViTacFormer achieved a continuous operation time of approximately 2.5 minutes with an overall success rate exceeding 80%, highlighting its effectiveness in complex, multi-stage tasks [28].