Core Viewpoint - The article discusses the recent leak of Claude's system prompts, which has sparked discussions about a new paradigm in large language model (LLM) learning, termed "system prompt learning" [1][3][20]. Summary by Sections Claude System Prompt Leak - The complete Claude system prompts were leaked, containing 16,739 words (approximately 110kb), significantly larger than OpenAI's o4-mini prompts, which only have 2,218 words, about 13% of Claude's size [8][12]. - The leaked prompts detail Claude's behavior, preferences, and global problem-solving strategies, providing insights into how the model interacts with users [8][12]. New Learning Paradigm - Karpathy identified a lack of primary paradigms in LLM learning and proposed a new approach called "system prompt learning," which simulates human experience accumulation [3][13]. - This new paradigm allows LLMs to have a "memory" function, enabling them to autonomously reflect on user queries and record general problem-solving knowledge and strategies [4][20]. Mechanism of System Prompt Learning - The new paradigm emphasizes direct editing of prompts rather than relying solely on reinforcement learning, allowing LLMs to adjust and refine their response strategies based on real-time feedback [15][20]. - It mimics human learning processes, where individuals remember strategies for solving problems and apply them in future situations [18][19]. Community Reactions - The leak and the proposed new paradigm have led to intense discussions among the community, with some supporting the idea of adding a memory layer to facilitate system prompt learning [21][24]. - Others have raised concerns about the fundamental limitations of LLMs in continuous learning and the need for more effective thinking models [24].
Claude1.7万字系统提示词全网刷屏!Karpathy锐评:LLM训练缺乏关键范式
量子位·2025-05-13 01:03