Core Viewpoint - The article discusses the shift in content consumption habits towards efficiency, particularly in the context of AI models summarizing information for users, indicating a leap in human capability in the AI era [1][2]. Group 1: AI Model Utilization - Andrej Karpathy has adopted a habit of using large language models (LLMs) to read and summarize information, reflecting a broader trend among users [1][2]. - Karpathy initiated a project that combines four of the latest LLMs into a council to provide diverse insights and evaluations [3][4]. Group 2: LLM Council Mechanism - The LLM council operates as a web application where user questions are distributed among multiple models, which then review and rank each other's responses before a "Chairman LLM" generates the final answer [4][11]. - The council's process includes three stages: initial responses from each model, mutual evaluation of those responses, and final output generation by the chairman model [8][9][11]. Group 3: Model Performance and Evaluation - The models exhibit a willingness to acknowledge superior responses from other models, creating an interesting evaluation dynamic [6][7]. - In evaluations, GPT 5.1 was noted for its rich insights, while Claude was consistently rated lower, although subjective preferences varied among users [7]. Group 4: Future Implications and Open Source - The LLM council's design may represent a new benchmark for model evaluation, with potential for further exploration in multi-model integration [12][13]. - Karpathy has made the project open source, inviting others to explore and innovate upon it, although he will not provide support for it [14][15].
Karpathy组建大模型「议会」,GPT-5.1、Gemini 3 Pro等化身最强智囊团
机器之心·2025-11-23 04:06