具身智能迎来数据革命!它石智航发布WIYH数据集,比特斯拉Optimus领先半年
具身智能之心·2025-10-11 10:00

Core Insights - The article highlights the launch of the world's first large-scale real-world embodied VLTA (Vision-Language-Tactile-Action) multimodal dataset, World In Your Hands (WIYH), by the company Itstone Intelligent, marking a significant advancement in the embodied intelligence industry [1][6] - The WIYH dataset aims to address the challenges of data quality and availability in training large models, which have traditionally relied on inconsistent internet data and limited simulation data [1][3] Summary by Sections Dataset Features - The WIYH dataset is characterized by four main features: 1. Realism: Data is collected from actual embodied tasks, aligning with real-world applications [3] 2. Richness: It spans multiple industries and operational skills, enhancing the model's transfer and generalization capabilities [3] 3. Comprehensiveness: It includes multimodal data covering vision, language, touch, and action, facilitating pre-training alignment [3] 4. Volume: The dataset's scale is comparable to that of large language models, ensuring the future potential of embodied intelligence [3][4] Unique Advantages - The WIYH dataset offers three unique advantages: 1. Modal Integrity: It synchronously captures visual, tactile, and action data using proprietary collection equipment, ensuring precise temporal and spatial alignment [4] 2. Data Annotation: High-precision annotations are completed using the company's cloud-based foundational model, covering various granular truth labels for comprehensive supervision signals [4] 3. Collection Environment: Data is gathered in real-life operational settings, significantly enhancing authenticity, diversity, and generalization while reducing collection costs by an order of magnitude [4] Future Implications - The establishment of the WIYH dataset signifies the creation of a human-centric embodied data paradigm, enabling the pre-training of embodied AI models for real-world applications [6] - The dataset is expected to facilitate the transition from single-task applications to models with general operational capabilities, laying a solid foundation for the integration of embodied robots into various industries [6] - The company plans to make the WIYH dataset publicly available by December 2025, inviting research institutions and partners to collaborate in building a thriving ecosystem for embodied intelligence [6]