ChatGPT - 5.2
Search documents
港科大教授实测AI眼镜“作弊”:30分钟碾压95%的学生,把传统教学评估体系整破防了
量子位· 2026-01-06 07:06
Core Viewpoint - The article discusses an experiment conducted at Hong Kong University of Science and Technology where an AI-powered glasses equipped with ChatGPT-5.2 took a final exam, achieving a score of 92.5, outperforming over 95% of human students, raising questions about the validity of traditional educational assessment methods [1][4][6]. Group 1: Experiment Overview - The AI glasses, developed by a team led by Professors Zhang Jun and Meng Zili, were designed to "cheat" in a controlled exam setting for the course "Computer Network Principles" [7]. - The AI glasses utilized a sophisticated process where questions were captured via a camera, sent to the cloud for processing, and the answers were displayed back on the glasses for the student to transcribe [12][14]. - The AI achieved full marks in multiple-choice and single-page short answer questions, and scored 45.5 out of 53 in multi-page short answer questions, demonstrating strong reasoning capabilities [14]. Group 2: Hardware and Software Selection - The project team evaluated 12 mainstream smart glasses and selected Rokid Glasses due to their superior SDK and ecosystem, which allowed for better integration with the AI model [8][10][11]. - The choice of ChatGPT-5.2 was based on its strong response speed and general knowledge capabilities, making it suitable for the exam context [11]. Group 3: Implications for Educational Assessment - The experiment highlighted the limitations of traditional educational assessments, which focus primarily on the final answer rather than the learning process [21][46]. - As AI becomes proficient in standardized testing, the relevance of current assessment methods is called into question, particularly regarding their ability to measure deeper learning and critical thinking skills [22][32][42]. - The article suggests a shift in assessment focus from merely providing answers to evaluating reasoning processes and decision-making quality, which are harder for AI to replicate [38][48].