测试时计算扩展 - filings, earnings calls, financial reports, news

测试时计算扩展

Search documents

机器之心· 2026-03-05 11:03

编辑｜Panda 「震惊！震惊！」是什么让著名计算机科学家和数学家、《计算机程序设计艺术》作者、图灵奖得主高德纳（Donald Knuth）发出了如此惊呼？图片由 AI 生成你没有猜错，正是 AI 。在他近期在斯坦福大学官网上公布的一篇论文《Claude's Cycles》中，开篇的「Shock! Shock!」非常直白地表达了他对于 AI 强大能力的震惊。论文地址： https://cs.stanford.edu/~knuth/papers/claude-cycles.pdf 紧接着他便写到：「我昨天得知，我已经研究了几周的一个开放性问题刚刚被 Claude Opus 4.6——Anthropic 公司三周前发布的混合推理模型 —— 解决了！看来我得在某个时候重新审视我对『生成式 AI』的看法了。不仅我的猜想有了一个不错的解决方案，而且这标志着自动推理和创造性问题解决领域的巨大进步，这真是一件令人高兴的事。我会在这篇短文中简要讲述这个过程。」此事引发了广泛关注，网友们纷纷点评，感叹新时代的到来。 | | | 这是 Hacker News 用户 Ian Danforth 给出的太长不读版本：高 ...

Artificial Intelligence

Artificial Intelligence

Claude Opus 4.6

为大模型思考装上“猎鹰重装引擎” ：腾讯混元 SEAT 重塑深度思考

AI科技大本营· 2025-07-15 11:30

Core Viewpoint - Tencent's Hunyuan team has introduced the SEAT adaptive parallel reasoning framework, transforming complex reasoning tasks from a "single-engine airship" into a "multi-engine rocket," enhancing the capabilities of large models in handling intricate reasoning challenges [7][44]. Group 1: SEAT Framework Overview - The SEAT framework integrates both sequential and parallel scaling paradigms, allowing for extensive exploration and deep refinement of reasoning processes [15][43]. - It employs a multi-round parallel reasoning approach, significantly enhancing the model's exploration capabilities by generating multiple independent reasoning paths simultaneously [16][20]. - The framework is designed to be plug-and-play, enabling easy integration with existing large language models without requiring additional training [29][44]. Group 2: Performance Enhancements - Initial experiments show that even with a minimal parallel setup (N=2), the SEAT framework can achieve a remarkable accuracy improvement of +14.1% for a 32B model and +24.5% for a 7B model [28]. - As the number of parallel paths increases (up to N=8), performance continues to improve, demonstrating the framework's powerful exploration capabilities [23]. Group 3: Semantic Entropy as Navigation - The SEAT framework introduces semantic entropy as a self-supervised metric to gauge the consistency of reasoning outputs, acting as a "navigation sensor" to determine when to stop computations [27][32]. - Two navigation strategies are implemented: a predefined threshold approach and an adaptive threshold-free mechanism, both aimed at optimizing the reasoning process [35][36]. Group 4: Safety Mechanisms - The SEAT framework includes a safety mechanism to prevent "semantic entropy collapse," which can lead to overconfidence and erroneous outputs in smaller models [38][40]. - By monitoring semantic entropy, the framework can issue stop commands before the model's performance deteriorates, ensuring stable reasoning outcomes [40][44].

Artificial Intelligence

Artificial Intelligence

腾讯混元SEAT框架

Grok 4 Heavy