Core Insights - DeepSeek officially launched DeepSeek-V3.1 on August 21, featuring significant upgrades in its architecture and performance [1]. Group 1: Major Changes in DeepSeek-V3.1 - The new hybrid reasoning architecture allows the model to support both thinking and non-thinking modes simultaneously [2]. - Enhanced thinking efficiency enables DeepSeek-V3.1-Think to provide answers in a shorter time compared to DeepSeek-R1-0528 [2]. - Improved agent capabilities through post-training optimization have led to better performance in tool usage and agent tasks [2]. Group 2: API and User Experience Enhancements - The official app and web model have been upgraded to DeepSeek-V3.1, allowing users to switch freely between thinking and non-thinking modes via a "deep thinking" button [2]. - The DeepSeek API has also been upgraded, with deepseek-chat corresponding to non-thinking mode and deepseek-reasoner to thinking mode, expanding context to 128K [2]. - The API Beta interface now supports strict mode function calling to ensure outputs meet schema definitions [2]. Group 3: Performance Metrics - DeepSeek-V3.1 has shown significant improvements in multiple search evaluation metrics, outperforming R1-0528 in complex search tests requiring multi-step reasoning and expert-level multidisciplinary challenges [2]. Group 4: Technical Adjustments - DeepSeek-V3.1 utilizes UE8M0FP8Scale parameter precision and has made substantial adjustments to the tokenizer and chat template, resulting in noticeable differences from DeepSeek-V3 [3]. Group 5: Market Reaction - Following the announcement of DeepSeek-V3.1, DeepSeek concept stock Daily Interaction (300766) experienced a sharp rise in the late trading session [3].
DeepSeek,重磅发布!