阿里发布并开源模型Qwen3,成本仅为DeepSeek-R1的1/3
Guan Cha Zhe Wang·2025-04-29 03:27

Core Insights - Alibaba has launched the new Qwen3 model, which is claimed to be the world's strongest open-source model, outperforming leading models like DeepSeek-R1 and OpenAI-o1 while significantly reducing costs and computational power requirements [1][3]. Model Performance - The flagship version Qwen3-235B-A22B achieved high scores in various benchmark tests, including 81.5 in the AIME25 assessment, over 70 in the LiveCodeBench evaluation, and 95.6 in the ArenaHard test, surpassing competitors [1][2]. - Qwen3's total parameter count is 235 billion, setting a new standard for open-source model intelligence, and it can be deployed with only four H20 GPUs, utilizing one-third of the memory compared to similar performance models [3]. Model Variants and Accessibility - Qwen3 includes multiple versions, such as 30B and 235B MoE models, along with several dense models ranging from 0.6B to 32B [3]. - The model supports over 119 languages and is available under a permissive Apache 2.0 license, allowing global developers and organizations to download and commercialize it for free [6]. Industry Impact - Qwen3 is expected to enhance the capabilities of intelligent agents and large model applications, lowering the barriers for utilizing agent tools [6]. - Alibaba has released over 200 models, with a global download count exceeding 300 million, establishing Qwen3 as the leading open-source model, surpassing the U.S. Llama [6].

Seek .-阿里发布并开源模型Qwen3,成本仅为DeepSeek-R1的1/3 - Reportify