Workflow
文化大模型
icon
Search documents
全国首个主流文化语料库上线,推动数字文化产业高质量发展
Qi Lu Wan Bao Wang· 2025-08-25 08:39
Group 1 - The core viewpoint of the news is the collaboration between Shandong Digital Culture Group and People’s Daily to establish a mainstream cultural corpus, which is essential for the training and application of large AI models in the context of rapid advancements in generative AI technology [1][2] - The mainstream cultural corpus will focus on high-quality, authoritative media resources and private cultural resources accumulated over the years, addressing the common issues of insufficient sensitive area data and low-quality core data in AI models [1][2] - The project aligns with national and provincial policies aimed at enhancing the quality of cultural data and supporting the development of AI in the cultural sector, as outlined in various government documents [1] Group 2 - The first phase of the mainstream cultural corpus will concentrate on excellent cultural resources from Shandong, with an initial offering of 50,000 Q&A pairs and 20 million basic data articles, while also developing high-quality datasets related to Confucius [2] - The Shandong Cultural Data Annotation Platform, developed by the group, will provide comprehensive services for data collection, cleaning, annotation, and enhancement, supporting various data types and enabling a closed-loop process from data collection to usage [4] - The platform will be open to the public for free, encouraging cultural institutions, universities, and enterprises to create their own high-quality datasets, while a cultural data trading platform will be launched to facilitate the circulation and monetization of cultural data assets [4]