Workflow
深度推理模型写高考英语作文谁更强?记者实测,名校英语教师点评
Bei Ke Cai Jing·2025-06-09 01:24

Group 1 - The 2025 Gaokao English exam in Beijing featured an essay prompt that tested AI language models on their ability to generate coherent and culturally relevant responses [1][2] - Six AI models were evaluated, including DeepSeek R1, ChatGPT o3, Tongyi Qianwen Qwen3, Tencent Hunyuan T1, iFlytek Xinghuo X1, and Baidu Wenxin X1, with scores provided by two English teachers based on established grading criteria [1][2] - The top-performing model was iFlytek Xinghuo X1, achieving an average score of 19.5, followed closely by DeepSeek R1 and Baidu Wenxin X1 [27][28] Group 2 - The evaluation highlighted that while all AI models addressed the essay prompt, there were significant differences in the depth of content, logical coherence, and precision of expression [27][28] - The AI-generated essays were noted for their innovative ideas and advanced vocabulary, surpassing typical student responses in terms of information integration and detail [28][29] - Recent updates to major AI models in April and May 2023 have improved their reasoning capabilities, enhancing their performance in tasks such as English writing [29]