Workflow
混合推理模式
icon
Search documents
DeepSeek、GPT-5带头转向混合推理,一个token也不能浪费
机器之心· 2025-08-30 10:06
Core Insights - The article discusses the trend of hybrid reasoning models in AI, emphasizing the need for efficiency in computational resource usage while maintaining performance [12][11]. - Companies are increasingly adopting adaptive computing strategies to balance cost and performance, with notable implementations from major AI firms [11][12]. Group 1: Industry Trends - The phenomenon of "overthinking" in AI models leads to significant computational waste, prompting the need for adaptive computing solutions [3][11]. - Major AI companies, including OpenAI and DeepSeek, are implementing models that can switch between reasoning modes to optimize token usage, achieving reductions of 25-80% in token consumption [7][10][11]. - The emergence of hybrid reasoning models is expected to become the new norm in the large model field, with a focus on balancing cost and performance [11][12]. Group 2: Company Developments - OpenAI's GPT-5 introduces a routing mechanism that allows the model to select the appropriate reasoning mode based on user queries, enhancing user experience while managing computational costs [36][41]. - DeepSeek's v3.1 model combines reasoning and non-reasoning capabilities into a single model, offering a cost-effective alternative to competitors like GPT-5 [45][46]. - Other companies, such as Anthropic, Alibaba, and Tencent, are also exploring hybrid reasoning models, each with unique implementations and user control mechanisms [18][19][34][35]. Group 3: Economic Implications - Despite decreasing token costs, subscription fees for AI models are rising due to the demand for state-of-the-art (SOTA) models, which are more expensive to operate [14][16]. - The projected increase in token consumption for advanced AI tasks could lead to significant cost implications for users, with estimates suggesting that deep research calls could rise to $72 per day per user by 2027 [15][16]. - Companies are adjusting subscription models and usage limits to manage costs, indicating a shift in the economic landscape of AI services [16][43]. Group 4: Future Directions - The future of hybrid reasoning will focus on developing models that can intelligently self-regulate their reasoning processes to minimize costs while maximizing effectiveness [57]. - Ongoing research and development in adaptive thinking models are crucial for achieving efficient AI systems that can operate at lower costs [52][57].