Core Insights - DeepSeek is expected to launch its next-generation flagship AI model, V4, in the coming weeks, focusing on strong code generation capabilities [2][6] - The V4 model is an iteration of the V3 model released in December 2024, and initial tests indicate it outperforms existing mainstream models like Anthropic, Claude, and OpenAI's GPT series in code generation [2][6] - The anticipated launch date for the V4 model is around mid-February, coinciding with the Lunar New Year, although this may be subject to change [2][6] Model Performance and Features - The V4 model has achieved a technological breakthrough in handling and parsing long code prompts, providing significant advantages for engineers working on complex software projects [4][7] - Improvements in understanding data patterns throughout the training process have been made, with no performance degradation observed [4][7] - Users can expect more logically coherent and clear outputs from the V4 model, reflecting enhanced reasoning capabilities and increased reliability in executing complex tasks [4][7] Previous Models and Market Impact - The V3.2 version released in December 2024 outperformed OpenAI's GPT-5 and Google's Gemini 3.0 Pro in certain benchmark tests, but no major model iterations have been released since, heightening anticipation for the V4 model [3][7] - DeepSeek's R1 model, an open-source reasoning model, gained significant attention for its cost-effective training relative to leading models developed in the U.S., while still delivering impressive performance [2][6] Research and Development Innovations - A new training architecture proposed in a recent research paper co-authored by DeepSeek's CEO allows for the development of larger AI models without proportionally increasing chip investments [8][9] - This series of technological advancements indicates that DeepSeek continues to make strides in innovation within the AI sector [8][9]
知情人士:DeepSeek将于2月发布其最新旗舰AI模型。