Core Insights - The article highlights the official release of DeepSeek V3.1, emphasizing its enhanced capabilities, particularly in mixed reasoning models and agent performance improvements [1][5][8]. Group 1: Model Updates - DeepSeek V3.1 features a mixed reasoning architecture that supports both thinking and non-thinking modes within a single model [5][7]. - The context length has been expanded to 128K tokens, allowing for more extensive data processing [7]. - The new version shows significant improvements in agent capabilities, particularly in programming and search tasks, with notable performance increases in benchmarks [8][9]. Group 2: Efficiency Improvements - The thinking mode in V3.1 has undergone compression training, resulting in a 20%-50% reduction in output tokens while maintaining performance levels comparable to the previous version [12]. - The non-thinking mode also shows a significant decrease in output length compared to V3-0324, while preserving model performance [12]. Group 3: API and Framework Enhancements - New API features include a strict mode for function calling, ensuring outputs meet defined schema requirements [14]. - Compatibility with Anthropic API has been added, facilitating integration with other frameworks like Claude Code [14]. Group 4: Open Source and Training - The V3.1 Base model has been trained on an additional 840 billion tokens, enhancing its capabilities [15]. - Both the base model and post-training model are now open-sourced on platforms like Hugging Face and ModelScope [15]. Group 5: Pricing Adjustments - A new pricing structure will take effect on September 6, 2025, which includes the cancellation of night-time discounts [16]. - During the transition period before the new pricing takes effect, the original pricing policy will still apply [16].
DeepSeek-V3.1 发布,官方划重点:Agent、Agent、Agent!
Founder Park·2025-08-21 08:16