Workflow
Innovation Diffusion Theory
icon
Search documents
中国AI大模型测评报告出炉,8款大模型首测伦理判断能力
Bei Ke Cai Jing· 2025-07-10 10:02
Core Insights - The report indicates a significant increase in the use of AI large models in the media industry, with 96.27% of respondents having used them, up 22.9 percentage points from last year [2][4] - Approximately half of the respondents frequently use these models, and around 80% believe that they enhance work efficiency [7] - However, there is a rising concern about errors and biases in AI outputs, with 96% of respondents encountering such issues at least once a week, an increase of 7 percentage points [9][11] Group 1: Usage and Satisfaction - The proportion of respondents using large models has risen across all age groups, with the highest increase among those aged 45 and above, which rose by 41.98 percentage points to 95.83% [6] - The survey revealed that about half of the respondents use large models regularly, with only 7.74% using them infrequently [7] - Satisfaction with the multi-modal capabilities of large models remains low, particularly in creating multimedia content, indicating a need for improvement [8] Group 2: Concerns and Ethical Considerations - The report highlights that the most significant concern among respondents is the generation of false news due to hallucination issues, with 99.37% expressing worry [11] - There is a notable increase in concerns regarding data privacy, with approximately 95.6% of respondents worried about this issue, up 9.17 percentage points [11] - The assessment introduced ethical judgment as a new dimension, revealing that some models exhibited inappropriate behavior during testing [3][12] Group 3: Performance Evaluation - The evaluation tested eight mainstream large models across five core dimensions, with Tongyi, Xunfei Xinghuo, Wenxin Yiyan, and Tencent Yuanbao scoring above 7500 points, ranking first to fourth [13][14] - The models demonstrated significant value in information retrieval, text generation, and translation, although their long-text processing capabilities still require improvement [16] - The report emphasizes the potential of large models in the media industry, while also acknowledging the challenges of misinformation detection and ethical safety [16]