触觉语言模型（DOVE） - filings, earnings calls, financial reports, news

触觉语言模型（DOVE）

Search documents

国内首篇仿生触觉新突破！清华丁文伯团队研发“鸽眼”传感器，让机器人感知逼近人类！

机器人圈· 2026-01-21 09:34

Core Insights - The article discusses the development of a biomimetic multimodal tactile sensor (SuperTac) that enhances robotic perception by integrating multiple sensing modalities, inspired by the sensory capabilities of pigeons [3][4]. Group 1: Research Background - Current tactile sensing technologies, including electronic skin and vision-tactile sensors, have significant limitations in resolution, signal interpretation, and multi-modal integration [4]. - Existing tactile systems struggle with weak signal interpretation capabilities, making it difficult for robots to achieve human-like tactile cognition and interaction [4]. Group 2: Research Contributions - The SuperTac system consists of three core components: the biomimetic multimodal tactile sensor, a data processing and feature extraction module, and a tactile language model (DOVE) for signal interpretation [4][5]. - The system architecture employs a layered design for end-to-end processing from physical signal acquisition to semantic understanding [4][7]. Group 3: Biological Inspiration - The design of SuperTac is inspired by the pigeon’s visual system, utilizing its diverse retinal cell types to expand the sensor's spectral perception range [6]. - The sensor mimics the pigeon’s ability to perceive magnetic fields, integrating non-imaging sensing capabilities into the tactile domain [6]. Group 4: Design and Testing - The tactile skin of SuperTac features a four-layer structure with a total thickness of only 1mm, utilizing advanced materials for effective signal distribution and transparency [9][10]. - The data processing algorithms achieve high accuracy in detecting force and position, with a mean square error of 0.056mm for position detection and 0.0004N for force detection [12]. Group 5: Semantic Understanding and Reasoning - The DOVE model integrates tactile features into a semantic space, enabling natural language descriptions and reasoning of tactile information [14]. - The model is trained using a three-phase strategy, resulting in a robust framework for multi-modal feature representation and reasoning [14]. Group 6: Future Outlook - The research opens multiple promising directions for robotic tactile perception, including sensor miniaturization and the development of low-power decoding chips [18]. - Future work will focus on enhancing the DOVE model's performance across different sensor designs and datasets, aiming to bridge the perceptual gap between robots and humans [18].