Workflow
闭源商业模型
icon
Search documents
网友热评Deepseek新版V3:编程堪比最强AI,期待更强R2!
硬AI· 2025-03-25 12:41
Core Viewpoint - DeepSeek has quietly released its new V3-0324 model, which boasts 671 billion parameters and improved coding capabilities comparable to Claude 3.7 Sonnet, marking a significant upgrade in performance without a major public announcement [3][10]. Group 1: Model Specifications - The V3-0324 model utilizes a mixture of experts (MoE) architecture with 671 billion parameters and 37 billion active parameters, addressing load balancing issues through an innovative "bias term" mechanism [10][11]. - The model's design includes a node-constrained routing mechanism to reduce cross-node communication overhead, enhancing training efficiency for large-scale distributed training [10][11]. Group 2: Programming Capabilities - V3-0324 achieved a coding score of 328.3, surpassing the standard Claude 3.7 Sonnet (322.3) and nearing the chain-of-thought version (334.8), establishing it as one of the strongest open-source models for programming tasks [13][14]. - Users reported that a simple prompt could generate an entire login page, demonstrating the model's advanced coding capabilities and aesthetic improvements over previous versions [16][19]. Group 3: Open Source License - The V3-0324 model has been updated to an MIT open-source license, which is more permissive than the initial version, allowing for easier integration with commercial and proprietary software [24]. - This change significantly lowers the barriers for developers and companies looking to implement high-performance AI models in commercial projects, accelerating the democratization of AI technology [24]. Group 4: Industry Impact - The emergence of DeepSeek V3-0324 indicates that open-source AI models are rapidly catching up to, and in some aspects surpassing, top-tier closed-source commercial models, creating unprecedented pressure on companies like OpenAI and Anthropic [27][28]. - As open-source models like DeepSeek continue to enhance their performance and relax usage conditions, the process of democratizing AI technology is accelerating, fostering a more open and innovative AI ecosystem [28][29].