开源模型崛起

Search documents
DeepSeek终于丢了开源第一王座,但继任者依然来自中国
猿大侠· 2025-07-19 03:43
Core Viewpoint - Kimi K2 has surpassed DeepSeek to become the number one open-source model globally, ranking fifth overall, closely following top proprietary models like Musk's Grok 4 [1][18]. Group 1: Rankings and Performance - Kimi K2 achieved a score of 1420, placing it fifth in the overall rankings, with only a slight gap from leading proprietary models [2][21]. - The top ten models all scored above 1400, indicating that open-source models are increasingly competitive with proprietary ones [20][22]. - Kimi K2's performance in various categories includes tying for first in multi-turn dialogue and second in programming ability, matching models like GPT 4.5 and Grok 4 [3][18]. Group 2: Community Engagement and Adoption - Kimi K2 has gained significant attention in the open-source community, with 5.6K stars on GitHub and nearly 100,000 downloads on Hugging Face [5][4]. - The CEO of AI search engine startup Perplexity has publicly endorsed Kimi K2, indicating plans for further training based on this model [5][24]. Group 3: Architectural Decisions - Kimi K2 inherits the DeepSeek V3 architecture but includes several parameter adjustments to optimize performance [8][11]. - Key structural changes in Kimi K2 include increasing the number of experts, halving the number of attention heads, retaining only the first layer as dense, and implementing flexible routing for expert combinations [12][14]. - Despite an increase in total parameters by 1.5 times, the model's efficiency in prefill and decode times has improved, suggesting a cost-effective optimization strategy [13][14]. Group 4: Industry Perspectives - The perception that open-source models are inferior is being challenged, with industry experts predicting that open-source will increasingly outperform proprietary models [18][24]. - Tim Dettmers from the Allen Institute for AI and the CEO of Perplexity have both emphasized the growing importance of open-source models in shaping AI capabilities globally [24][25].