Workflow
刚刚,AI科学家Zochi在ACL「博士毕业」,Beta测试今日上线
机器之心·2025-05-29 04:53

Core Viewpoint - The article highlights the achievement of Intology's AI scientist Zochi, which has become the first AI system to independently pass peer review at a top-tier scientific conference, specifically the ACL main conference, indicating a significant milestone in AI research capabilities [1][3][5]. Group 1: AI Research Achievements - Zochi's paper titled "Tempest: Automatic Multi-Turn Jailbreaking of Large Language Models with Tree Search" has been accepted at ACL 2025, showcasing its ability to conduct independent scientific research [8][11]. - The acceptance rate for main conference papers at top-tier conferences like ACL is around 20%, making Zochi's achievement particularly noteworthy [3]. - Zochi's research demonstrated a 100% success rate on GPT-3.5-turbo and a 97% success rate on GPT-4 in its multi-turn attack methodology, indicating the effectiveness of its approach [11]. Group 2: Methodology and Innovation - The research utilized a tree search method to autonomously explore multiple adversarial prompt branches, integrating cross-branch learning and partial compliance tracking [9]. - Zochi's approach to scientific discovery involved minimal human intervention, primarily in formatting and creating figures, while it independently defined research directions and conducted experiments [8][9]. - The system's innovative method, CS-ReFT, achieved a 93.94% success rate in model adaptation using only 0.0098% of parameters, surpassing GPT-3.5-Turbo [21]. Group 3: Industry Impact and Criticism - The acceptance of Zochi's work has sparked discussions in the AI academic community regarding the implications of AI-generated research and the integrity of the peer review process [16][17]. - Intology faced criticism for its practices, as other teams like Sakana had previously informed conference organizers about their AI-generated submissions, raising concerns about transparency [16][17]. - Zochi's continuous output of high-quality research papers, with scores significantly above the average for AI-generated submissions, emphasizes its advanced capabilities in tackling complex scientific challenges [23].