全国已建设高质量数据集总体量超500PB
Xin Hua She·2025-12-04 14:24

Core Insights - The total volume of high-quality datasets in China has exceeded 500PB as of the end of September, contributing to the integration of artificial intelligence across various industries [1] Group 1: Data Development - The National Data Bureau, in collaboration with multiple departments, has established policy documents aimed at promoting the construction of high-quality datasets with a focus on application scenarios [1] - A total of 140 pilot tasks have been deployed to create a favorable environment for the construction and application of high-quality datasets alongside AI [1] Group 2: Industry Impact - As of the end of September, China has established 7 data labeling bases, attracting and nurturing 362 labeling companies, with a workforce of 85,000 in the labeling sector [1] - The data labeling industry has generated a related output value of 16.3 billion [1] - The daily Token consumption in China has surpassed 40 trillion, marking an increase of approximately 400 times compared to early 2024 [1]

全国已建设高质量数据集总体量超500PB - Reportify