Workflow
人形机器人也要“进校学习”?数据采集成必答题
2 1 Shi Ji Jing Ji Bao Dao·2025-07-16 13:53

Core Viewpoint - The scarcity of real-world data is a significant constraint on the development of the embodied intelligence industry, and data collection centers may provide a solution to this issue [1][4]. Group 1: Data Collection Initiatives - Dematech and Zhiyuan Robotics have established the world's first logistics training factory for humanoid robots to collect data in real logistics scenarios [1]. - The Hefei City humanoid robot data collection pre-training site was launched in June, and the Pacini humanoid super data factory began operations this year [1][3]. - The establishment of data collection centers has accelerated since the second half of last year, with companies like Zhiyuan Robotics and Pacini leading the way [3]. Group 2: Data Collection Challenges - The humanoid robots require extensive data for training, with a single scenario potentially needing millions of data points, but the industry lacks high-quality, standardized data [4]. - Two main approaches to overcome the data scarcity have emerged: generating simulation data for training and building large-scale data collection centers for high-quality real-world data [4]. - The industry is currently facing challenges such as hardware solutions not being standardized and the issue of data silos, which increases data collection costs [7][8]. Group 3: Government Involvement - Local governments are also investing in data collection centers, with initiatives like the national and local co-built humanoid robot innovation centers [5]. - Government-led data collection centers typically serve as public service platforms, with collected data being made available to local robot companies once sufficient data is accumulated [5]. Group 4: Market Dynamics - The humanoid robot industry is expected to see significant data collection activity in the next two years, particularly in industrial applications [7]. - A complete data collection solution typically includes robots, hardware, software, cloud data processing services, and model training platforms, with costs ranging from 400,000 to 500,000 yuan [5].