Core Viewpoint - Data has become a core production factor driving industrial transformation, with high-quality datasets being essential for unlocking data value. Guangdong is leading the way in creating a new high ground for digital and intelligent development by hosting the first high-quality dataset innovation competition [1]. Group 1: Competition Overview - The first high-quality dataset innovation competition in Guangdong was launched on December 2, 2023, in Dongguan, with the theme "Data Gathering in the Bay Area, Intelligent Creation for the Future" [1]. - The competition employs a "challenge and leaderboard" mechanism to promote the discovery, supply, circulation, innovative application, and transformation of high-quality datasets, aiming to inject strong momentum into the digital transformation of the Guangdong-Hong Kong-Macao Greater Bay Area [1][2]. - The competition focuses on key sectors such as industrial manufacturing, healthcare, technological innovation, urban governance, and transportation, aiming to create reusable high-quality datasets for AI model training and industry applications [2]. Group 2: Key Participants and Structure - The initial batch of high-quality dataset challenges was announced, involving major sectors like energy, biomedicine, finance, and education, with participation from organizations such as State Grid, Guangzhou Laboratory, and Ping An Insurance [4]. - The competition will follow a structured organization system of "1 set of leaderboard mechanism + 3 competition phases + N supply-demand matchmaking events," creating a complete closed loop from data supply to technological research and industry upgrading [4]. - The event aims to promote the replication and promotion of mature data application scenarios while exploring the potential of emerging fields like low-altitude economy and industrial internet [4]. Group 3: Expert Insights - Experts highlighted that data preprocessing, annotation, synthesis, and quality assessment are critical steps in building high-quality datasets, ensuring they effectively support AI model training and application [4][5]. - Various organizations, including Baidu and China Telecom, are collaborating to establish standardized production processes and quality certification for high-quality datasets, addressing challenges in data collection and compliance [5]. - The ongoing investment and practical experience in Guangdong's high-quality dataset construction are transitioning from isolated breakthroughs to a more widespread development, providing robust data support for the innovation of the AI industry [5].
广东首批高质数据集赛题正式“发榜”,探索数据价值转化新路径
2 1 Shi Ji Jing Ji Bao Dao·2025-12-03 12:31