昇腾MindSpeed

Search documents
阿里Qwen3能否成为下一个DeepSeek?
3 6 Ke· 2025-05-07 11:38
Core Insights - Alibaba's Tongyi Qianwen team has officially released and open-sourced the new generation model Qwen3, which includes multiple model types, featuring two mixed expert (MoE) models with parameter scales of 30B and 235B, and six dense models ranging from 0.6B to 32B [1][2] Model Features - Qwen3 represents the first mixed reasoning model family in China, sparking intense discussions in the open-source community about its potential to become the next DeepSeek [2] - The architecture of Qwen3 employs a mixed expert (MoE) design, allowing for a total parameter count of 235B while only activating 22B, significantly reducing real-time computational demands [3][6] - The pre-training data volume for Qwen3 has increased to 36T, three times that of Qwen2.5, enhancing its performance in reasoning, instruction following, tool invocation, and multilingual capabilities [5] Deployment and Cost Efficiency - The deployment cost for Qwen3 is significantly lower than that of DeepSeek, requiring only four H20 cards for the full version, which is one-third of the memory usage compared to similar performance models [6] - The low deployment cost and quick adaptation of Qwen3 have led to immediate customer orders following its release [2][6] Strategic Positioning - Alibaba is intensifying its AI strategy, planning to invest over 380 billion yuan in cloud and AI hardware infrastructure over the next three years, aiming to strengthen the "twin stars" strategy of Tongyi Qianwen and Quark [7] - Quark has emerged as the primary user interface for Qwen3, with all users able to access the latest open-source model for free, while Tongyi Qianwen has been pivotal in supporting B-end enterprise services [9] Market Challenges - Despite the technological advantages of Tongyi Qianwen, challenges remain in reducing usage barriers for small and medium enterprises and addressing the dilution of user experience as Quark's user base grows [10] - The competitive landscape is fierce, with other tech giants like Tencent and ByteDance also making significant moves in the AI space, highlighting the uncertainties ahead for Alibaba's AI strategy [10]
【昇腾全系列支持Qwen3】4月29日讯,据华为计算公众号,Qwen3于2025年4月29日发布并开源。此前昇腾MindSpeed和MindIE一直同步支持Qwen系列模型,此次Qwen3系列一经发布开源,即在MindSpeed和MindIE中开箱即用,实现Qwen3的0Day适配。
news flash· 2025-04-29 06:27
Core Insights - Huawei's Ascend series fully supports the Qwen3 model, which was released and open-sourced on April 29, 2025 [1] - The Ascend MindSpeed and MindIE have been consistently supporting the Qwen series models, ensuring immediate compatibility with Qwen3 upon its release [1]