Test-Time Compute
Search documents
2025 AI 年度复盘:读完200篇论文,看DeepMind、Meta、DeepSeek ,中美巨头都在描述哪种AGI叙事
3 6 Ke· 2026-01-12 08:44
Core Insights - The article discusses the evolution of artificial intelligence (AI) in 2025, highlighting a shift from merely increasing model parameters to enhancing model intelligence through foundational research in areas like fluid reasoning, long-term memory, spatial intelligence, and meta-learning [2][4]. Group 1: Technological Advancements - In 2025, significant technological progress was observed in fluid reasoning, long-term memory, spatial intelligence, and meta-learning, driven by the diminishing returns of scaling laws in AI models [2][3]. - The bottleneck in current AI technology lies in the need for models to not only possess knowledge but also to think and remember effectively, revealing a significant imbalance in AI capabilities [2][4]. - The introduction of Test-Time Compute revolutionized reasoning capabilities, allowing AI to engage in deeper, more thoughtful processing during inference [6][10]. Group 2: Memory and Learning Enhancements - The Titans architecture and Nested Learning emerged as breakthroughs in memory capabilities, enabling models to update their parameters in real-time during inference, thus overcoming the limitations of traditional transformer models [19][21]. - Memory can be categorized into three types: context as memory, RAG-processed context as memory, and internalized memory through parameter integration, with significant advancements in RAG and parameter adjustment methods [19][27]. - The introduction of sparse memory fine-tuning and on-policy distillation methods has mitigated the issue of catastrophic forgetting, allowing models to retain old knowledge while integrating new information [31][33]. Group 3: Spatial Intelligence and World Models - The development of spatial intelligence and world models was marked by advancements in video generation models, such as Genie 3, which demonstrated improved physical understanding and consistency in generated environments [35][36]. - The emergence of the World Labs initiative, led by Stanford professor Fei-Fei Li, focused on generating 3D environments based on multimodal inputs, showcasing a more structured approach to AI-generated content [44][46]. - The V-JEPA 2 model introduced by Meta emphasized predictive learning, allowing models to grasp physical rules through prediction rather than mere observation, enhancing their understanding of causal relationships [50][51]. Group 4: Reinforcement Learning Innovations - Reinforcement learning (RL) saw significant advancements with the rise of verifiable rewards and sparse reward metrics, leading to improved performance in areas like mathematics and coding [11][12]. - The GPRO algorithm gained popularity, simplifying the RL process by eliminating the need for a critic model, thus reducing computational costs while maintaining effectiveness [15][16]. - The exploration of RL's limitations revealed a ceiling effect, indicating that while RL can enhance existing model capabilities, further breakthroughs will require innovations in foundational models or algorithm architectures [17][18].
X @Anthropic
Anthropic· 2025-07-29 17:20
Research Findings - Anthropic Research 发现,在某些情况下,更长的推理时间会导致准确率降低 [1] - 研究表明,简单地增加测试时的计算量可能会无意中加强有问题的推理模式 [1] Implications - 行业应警惕测试时计算的逆向扩展现象,即计算资源增加反而导致性能下降 [1] - 行业需要更深入地研究和理解推理过程,以避免因盲目扩展计算资源而产生负面影响 [1]