Workflow
国家数据局:截至2025年6月底 我国已建设高质量数据集总体量超过了400PB
Xin Hua Cai Jing·2025-08-14 07:34

Group 1 - As of June 2025, China has built over 35,000 high-quality datasets, totaling more than 400PB, which is approximately 140 times the digital resources of the National Library of China [1] - The cumulative transaction value of high-quality datasets across the country reached nearly 4 billion yuan, with the total scale of high-quality datasets listed by data trading institutions reaching 246PB [1] - In Beijing, the proportion of high-quality datasets in total transactions surged from 10% last year to nearly 80% currently [1] Group 2 - China's rapid development in artificial intelligence is closely linked to its emphasis on data work, being the first country to treat data as a production factor [2] - The majority of model training in China now uses Chinese data, with over 60% of the data used for training domestic models being Chinese, and some models reaching 80% [2] - The development and supply capacity of high-quality Chinese data continue to enhance, driving rapid improvements in the performance of AI models in China [2]