GPT 4.5

Search documents
DeepSeek终于丢了开源第一王座,但继任者依然来自中国
猿大侠· 2025-07-19 03:43
Core Viewpoint - Kimi K2 has surpassed DeepSeek to become the number one open-source model globally, ranking fifth overall, closely following top proprietary models like Musk's Grok 4 [1][18]. Group 1: Rankings and Performance - Kimi K2 achieved a score of 1420, placing it fifth in the overall rankings, with only a slight gap from leading proprietary models [2][21]. - The top ten models all scored above 1400, indicating that open-source models are increasingly competitive with proprietary ones [20][22]. - Kimi K2's performance in various categories includes tying for first in multi-turn dialogue and second in programming ability, matching models like GPT 4.5 and Grok 4 [3][18]. Group 2: Community Engagement and Adoption - Kimi K2 has gained significant attention in the open-source community, with 5.6K stars on GitHub and nearly 100,000 downloads on Hugging Face [5][4]. - The CEO of AI search engine startup Perplexity has publicly endorsed Kimi K2, indicating plans for further training based on this model [5][24]. Group 3: Architectural Decisions - Kimi K2 inherits the DeepSeek V3 architecture but includes several parameter adjustments to optimize performance [8][11]. - Key structural changes in Kimi K2 include increasing the number of experts, halving the number of attention heads, retaining only the first layer as dense, and implementing flexible routing for expert combinations [12][14]. - Despite an increase in total parameters by 1.5 times, the model's efficiency in prefill and decode times has improved, suggesting a cost-effective optimization strategy [13][14]. Group 4: Industry Perspectives - The perception that open-source models are inferior is being challenged, with industry experts predicting that open-source will increasingly outperform proprietary models [18][24]. - Tim Dettmers from the Allen Institute for AI and the CEO of Perplexity have both emphasized the growing importance of open-source models in shaping AI capabilities globally [24][25].
梁文锋等来及时雨
是说芯语· 2025-07-19 01:26
Core Viewpoint - The article discusses the competitive landscape of AI models, particularly focusing on DeepSeek and its challenges in maintaining user engagement and market position against emerging competitors like Kimi and others in the "AI Six Dragons" group [3][4][8]. Group 1: DeepSeek's Performance and Challenges - DeepSeek experienced a significant decline in monthly active users, dropping from a peak of 169 million in January to 160 million by May, a decrease of 5.1% [3][4]. - The app's download ranking has plummeted, falling out of the top 30 in the Apple App Store, indicating a loss of user interest [4]. - The user engagement rate for DeepSeek has decreased from 7.5% at the beginning of the year to 3% by the end of May, with website traffic also down by 29% [4][5]. Group 2: Competition and Market Dynamics - Competitors like Kimi and others are rapidly releasing new models, with Kimi K2 being highlighted for its performance and open-source nature, achieving state-of-the-art results in various benchmarks [10][11]. - The pricing strategy of Kimi K2 aligns closely with DeepSeek's, offering competitive rates for API usage, which could further erode DeepSeek's market share [11]. - Other players in the market are also emphasizing cost-effectiveness and performance, challenging DeepSeek's previously established reputation for value [10][11]. Group 3: Technological and Strategic Implications - DeepSeek's R2 model has faced delays due to supply chain issues related to the NVIDIA H20 chip, which has impacted its computational capabilities [5][7]. - The lack of significant updates to DeepSeek's models has led to a perception of stagnation, with competitors rapidly advancing in both performance and features [8][10]. - The article suggests that DeepSeek needs to quickly release new models and enhance its capabilities to regain market interest and user engagement [17][19].
DeepSeek终于丢了开源第一王座,但继任者依然来自中国
量子位· 2025-07-18 08:36
Core Viewpoint - Kimi K2 has surpassed DeepSeek to become the number one open-source model globally, ranking fifth overall, closely following top proprietary models like Musk's Grok 4 [1][19]. Group 1: Ranking and Performance - Kimi K2 achieved a score of 1420, placing it fifth in the overall ranking, with only a slight gap from leading proprietary models [2][22]. - The top ten models now all have scores above 1400, indicating that open-source models are increasingly competitive with proprietary ones [20][21]. Group 2: Community Engagement and Adoption - Kimi K2 has gained significant attention in the open-source community, with 5.6K stars on GitHub and nearly 100,000 downloads on Hugging Face [5][4]. - The CEO of AI search engine startup Perplexity has publicly endorsed Kimi K2, indicating its strong internal evaluation and future plans for further training based on this model [5][27]. Group 3: Model Architecture and Development - Kimi K2 inherits the DeepSeek V3 architecture but includes several parameter adjustments to optimize performance [9][12]. - Key modifications in Kimi K2's structure include increasing the number of experts, halving the number of attention heads, retaining only the first layer as dense, and implementing flexible expert routing [13][15]. Group 4: Industry Trends and Future Outlook - The stereotype that open-source models are inferior is being challenged, with industry experts predicting that open-source will increasingly outperform proprietary models [19][24]. - Tim Dettmers from the Allen Institute for AI suggests that open-source models defeating proprietary ones will become more common, highlighting their importance in localizing AI experiences [25][27].
梁文锋等来及时雨
36氪· 2025-07-16 10:19
Core Viewpoint - The article discusses the competitive landscape of AI large models, focusing on DeepSeek's challenges and the emergence of new players like Kimi, which are rapidly gaining market attention and user engagement [3][4][10]. Group 1: DeepSeek's Performance and Challenges - DeepSeek experienced a significant decline in monthly active users, dropping from a peak of 1.69 billion in May, reflecting a 5.1% decrease [4]. - The user engagement for DeepSeek has fallen from a peak of 7.5% in January to 3% by the end of May, with a 29% decrease in website traffic [4][5]. - The company has faced delays in launching its R2 model due to unexpected export restrictions on the H20 chip, which has limited its computational resources [5][8]. Group 2: Competitive Landscape - Other AI players, referred to as the "AI Six Dragons," are set to release new foundational models, intensifying competition against DeepSeek [3][4]. - Kimi's K2 model has achieved state-of-the-art performance in various benchmarks, surpassing DeepSeek in tasks related to coding and mathematical reasoning [14]. - The pricing strategy of Kimi K2 aligns closely with DeepSeek's API pricing, making it a direct competitor in terms of cost [15]. Group 3: Market Dynamics and User Preferences - DeepSeek's reputation for cost-effectiveness is being challenged as competitors like Alibaba, ByteDance, and Baidu offer lower-priced alternatives [13]. - The lack of significant upgrades in DeepSeek's models has led to a perception shift, with users increasingly viewing it as less competitive compared to newer models [12][13]. - The context window limitation of DeepSeek's models (64K) is significantly smaller than that of competitors like Kimi K2 (128K) and MiniMax-M1 (1 million), impacting its performance [22][23]. Group 4: Future Considerations - To regain market interest, DeepSeek must expedite the release of new models and enhance its capabilities, particularly in multi-modal functionalities, which are becoming increasingly important in the AI landscape [28][30]. - The article suggests that DeepSeek's focus on open-source development should also align with commercial viability to maintain user engagement and developer activity [24][25].
Think a Recession Is Coming? This AI Stock Can Still Thrive.
The Motley Fool· 2025-05-06 09:15
One of the core assumptions that underpins the artificial intelligence (AI) boom is that each new generation of AI model will require ever-increasing computational horsepower to train and run. DeepSeek, the Chinese AI company that managed to put out an AI model that performed well using a fraction of the computational resources of top-tier AI models, raised some serious questions about the future of the AI industry. There are some other signs, as well, that more computing power may not be the answer. OpenAI ...