大模型开源
Search documents
阿里千问3.5三款中等规模模型开源:性能不再依赖参数堆叠
Feng Huang Wang· 2026-02-25 07:49
官方介绍,Qwen3.5-35B-A3B的表现已超越前代更大规模模型Qwen3-235B-A22B-2507及Qwen3-VL- 235B-A22B,而Qwen3.5-122B-A10B与27B版本进一步缩小了中等规模模型与前沿模型的差距,尤其在 复杂代理场景中表现优异。这表明性能超越规模,不再单纯依赖参数堆叠,而是通过架构优化、数据质 量提升及强化学习推动智能发展。 凤凰网科技讯2月25日,千问大模型官方宣布,正式开源千问3.5最新中等规模模型:Qwen3.5-35B- A3B、Qwen3.5-122B-A10B、Qwen3.5-27B。 ...
Qwen3.5开源家族扩容
Cai Jing Wang· 2026-02-25 07:04
Core Insights - The company has further opened source models Qwen3.5-122B-A10B, Qwen3.5-35B-A3B, and Qwen3.5-27B(Dense) following the initial open-source release of flagship model Qwen3.5-397B-A17B [1] - The Qwen3.5-Flash API has officially launched on Alibaba Cloud [1]
阿里发布三款中型千问3.5新模型
Mei Ri Jing Ji Xin Wen· 2026-02-25 06:50
每经AI快讯,2月25日,阿里继续开源千问3.5系列模型。本次开源三款中等规模的新模型,包括 Qwen3.5-35B-A3B、Qwen3.5-122B-A10B和Qwen3.5-27B。目前,Qwen3.5-Flash已上线阿里云百炼,每 百万Token输入低至0.2元。 ...
千问大模型:Qwen3.5-Flash来袭,三款中等规模模型全开源
Xin Lang Cai Jing· 2026-02-25 06:44
Core Insights - Qwen 3.5 models have been officially open-sourced, including Qwen3.5-35B-A3B, Qwen3.5-122B-A10B, and Qwen3.5-27B, showcasing significant performance improvements over previous larger models [1][2][12] Model Performance - Qwen3.5-35B-A3B outperforms the previous larger models Qwen3-235B-A22B-2507 and Qwen3-VL-235B-A22B, indicating that performance is now driven by architectural optimization, data quality enhancement, and reinforcement learning rather than just parameter scaling [1][3][13] - The Qwen3.5-122B-A10B and Qwen3.5-27B further narrow the performance gap between medium-scale models and cutting-edge models, particularly excelling in complex agent scenarios [1][3][13] Architectural Innovations - Qwen3.5 employs a hybrid attention mechanism combined with a highly sparse MoE architecture, trained on a larger scale of mixed text and visual tokens, achieving greater performance with fewer total and active parameters [3][10][15] - The new models have surpassed larger models in various authoritative benchmarks, including IFBench, GPQA, HMMT 25, MMMLU, BFCL v4, and SWE-bench Verified [3][10][15] Developer Accessibility - The Qwen3.5-27B model is designed for local deployment, featuring enhanced agent capabilities and native multimodal abilities, outperforming GPT-5 mini in multiple agent evaluations [4][16] - Qwen3.5-Flash API service is available on Alibaba Cloud, priced at 0.2 yuan per million tokens, offering high performance and cost-effectiveness for developers and enterprises [5][17] Community Support - All three models are available on platforms like Magic搭 and Hugging Face, along with the open-sourced Qwen3.5-35B-A3B-Base model to support community research, fine-tuning, and secondary development [7][19]
阿里千问宣布Qwen3.5开源家族扩容
Di Yi Cai Jing· 2026-02-25 02:15
据通义实验室官微消息,继旗舰模型Qwen3.5-397B-A17B首次开源后,现进一步开源Qwen3.5-122B- A10B、Qwen3.5-35B-A3B、Qwen3.5-27B(Dense)。同时,Qwen3.5-Flash API已正式上线阿里云百 炼。 (文章来源:第一财经) ...
以小胜大高性价比,千问春节档真正的杀手锏来了
新浪财经· 2026-02-17 05:14
Core Viewpoint - Alibaba has launched the Qwen3.5-Plus model, which is positioned as the strongest open-source model globally, outperforming top closed-source models like Gemini-3-pro and GPT-5.2, while offering a significantly lower API price of 0.8 yuan per million tokens, which is only 1/18 of Gemini-3-pro's cost [2][8]. Group 1: Technical Innovations - The Qwen3.5-Plus model features a total of 397 billion parameters, with only 17 billion activated during inference, allowing it to utilize less than 5% of the computational power while accessing its full knowledge base [5]. - The introduction of a mixed attention mechanism enables the model to allocate attention resources dynamically based on the importance of information, optimizing computational efficiency [4]. - The model has transitioned from a pure text model to a native multimodal model, significantly enhancing its capabilities in reasoning, programming, and various assessments, surpassing some benchmarks of Gemini 3 Pro and GPT-5.2 [5]. Group 2: Business Logic Behind Open Source - The efficiency achieved through architectural innovation allows Qwen3.5 to be both powerful and cost-effective, making advanced AI capabilities accessible to individual developers, startups, and small enterprises [7]. - Since its open-source launch in 2023, Alibaba has released over 400 Qwen models, achieving over 1 billion downloads, and has become the leading open-source model recognized for its developer-friendly approach [7]. - The Qwen model continues to evolve, now supporting 201 languages and expanding its vocabulary size from 150,000 to 250,000, which can enhance encoding efficiency for less common languages by up to 60% [7]. Group 3: Market Position and Growth - According to Omdia, Alibaba Cloud's market share in China's cloud market increased from 34% to 36% in Q3 2025, maintaining its position as the market leader [9]. - AI has become a major driver of new demand for cloud infrastructure services, with Alibaba Cloud's AI-related product revenue experiencing triple-digit year-on-year growth for nine consecutive quarters [9]. Group 4: Agent Capabilities - The Qwen3.5 model has achieved breakthroughs in agent applications, enabling it to autonomously operate mobile and computer tasks, significantly improving operational efficiency [11]. - The Qwen App has launched the world's first consumer-grade AI shopping agent, which successfully processed 120 million orders in just six days during the Spring Festival, demonstrating its commercial viability [11]. - Developers can access the new Qwen3.5-Plus model through platforms like HuggingFace and Alibaba Cloud, with plans for further releases of different model sizes and functionalities [11].
阿里除夕夜将开源新一代千问Qwen3.5模型
Di Yi Cai Jing· 2026-02-16 02:13
Core Insights - Alibaba is set to open source its next-generation Qwen 3.5 model on New Year's Eve, marking a significant innovation in model architecture [1] Group 1 - The new Qwen 3.5 model represents a comprehensive innovation in its architecture [1]
鏖战2025年,大模型围着开源转
3 6 Ke· 2025-12-25 10:29
Core Viewpoint - By 2025, open-source will dominate the landscape of large models, with a significant increase in the number of users adopting open-source models globally, marking a shift in the competitive dynamics between open and closed-source approaches [1][20]. Group 1: Open-Source vs Closed-Source Dynamics - The debate between open-source and closed-source large models has been ongoing, with both sides presenting strong arguments, but the trend is shifting towards open-source as more major internet companies adopt this approach [1][5]. - Closed-source models, initially seen as the only viable path due to advantages in data security and commercial monetization, are now facing challenges in areas like AI accessibility and ecosystem development [3][10]. - The emergence of open-source models has created a new competitive landscape, with companies like Meta and Alibaba leading the charge in open-source initiatives [5][10]. Group 2: Impact of DeepSeek - The introduction of DeepSeek has significantly altered the competitive balance, demonstrating that open-source models can achieve high performance at lower costs, thus attracting more companies to switch to open-source strategies [7][20]. - DeepSeek's training cost was approximately $294,000, with a training duration of about 80 hours, showcasing a more efficient approach compared to traditional methods [7]. - Open-source models like DeepSeek and Qwen have reportedly matched or even surpassed the performance of leading international products, shifting the focus of competition from pure performance to cost, efficiency, and commercialization capabilities [8][20]. Group 3: Market Trends and User Engagement - The AI application market is rapidly evolving, with mobile and PC active user numbers reaching 729 million and 200 million respectively by September 2025, indicating a shift towards more specialized and efficient applications [11][13]. - Open-source models are seen as the quickest path to market, fostering a collaborative ecosystem that enhances user engagement and accelerates innovation [13][14]. - Companies are increasingly recognizing the long-term commercial value of high user engagement within open-source ecosystems, leading to a competitive race among internet giants to provide comprehensive open-source solutions [15][19]. Group 4: Commercialization of Open-Source - Open-source does not equate to free; companies are exploring various monetization strategies, including enterprise versions, commercial APIs, and cloud services, to sustain their open-source initiatives [18][19]. - Alibaba has open-sourced over 300 models, generating more than 170,000 derivative models, positioning itself as a leader in the global open-source model landscape [16]. - Baidu is integrating its self-developed Kunlun chips with open-source models, adopting a full-stack autonomous approach to enhance its competitive edge [17].
全力进军C端,阿里千问下场是“姗姗来迟”还是“高手出山”?
Bei Ke Cai Jing· 2025-11-20 11:36
Core Insights - Alibaba officially announced the "Qianwen" project on November 17, aiming to enter the AI to C market and disrupt the domestic AI large model application market [1] - The Qwen series models have gained global recognition for their capabilities, but the Qianwen app has not achieved a comparable user base in the ToC market [1][2] - The launch of the Qianwen app is seen as a strategic move to compete directly with ChatGPT and other leading AI applications [2][4] Market Positioning - As of October, ChatGPT had 800 million weekly active users, while Qianwen's monthly active users were only 404,480, indicating a significant gap in user engagement [2][3] - The top three AI chat applications in China are Doubao (272 million), DeepSeek (163 million), and Tencent Yuanbao (53.05 million), with Qianwen lagging far behind [2][3] - Despite the low user numbers, Alibaba's AI application Quark has maintained a steady third position in the AI application rankings, with 95.46 million monthly active users [8][11] Product Development - The Qianwen app has undergone multiple name changes, reflecting Alibaba's shifting strategy in the AI to C space over the past two years [3][4] - The app features a user-friendly interface with 14 functional options, including real-time recording and AI-assisted tasks, leveraging the capabilities of the Qwen model family [5][7] - Qwen has over 300 open-source models and has achieved over 600 million downloads, positioning itself as a leader in the open-source large model space [7] Strategic Goals - Alibaba aims to create a "super entrance" for AI in the C-end market, integrating various life scenarios such as maps, food delivery, and shopping into the Qianwen app [4][13] - The company plans to launch an international version of the Qianwen app to compete for global users, leveraging its existing model capabilities [7][14] - The Qianwen team believes that no national-level AI application has emerged in China yet, indicating potential for growth in the market [13] Challenges Ahead - The competitive landscape is dominated by Doubao, which has a substantial lead in monthly active users, posing a significant challenge for Qianwen [11][12] - Alibaba's internal data barriers across its various business lines may hinder the integration needed for Qianwen to fully leverage its ecosystem [14] - User perception of "task-oriented AI" remains at a tool level, and cultivating high-frequency usage habits for Qianwen will be a challenge [14][15]
大模型的尽头是开源
3 6 Ke· 2025-10-13 10:06
Core Insights - The competition among major tech companies in the AI model space is shifting towards open-source strategies, with companies like Alibaba, Tencent, and Baidu releasing their models simultaneously, indicating a consensus on the necessity of open-source approaches [1][2][10] - Open-source is no longer an optional strategy but a critical requirement for companies to gain a competitive edge in the evolving market [2][10] - The focus is now on the breadth and depth of ecosystems rather than just the technical superiority of individual models, as companies aim to create comprehensive platforms for developers [11][16] Group 1: Open-Source Strategy - Major companies are increasingly adopting open-source models to leverage collective developer intelligence and enhance model capabilities [1][2][5] - Tencent's recent releases, including the "Hunyuan Image 3.0" model, highlight its strategy to engage external developers and accelerate advancements in complex tasks like 3D modeling [2][3] - Alibaba has released multiple models, including the flagship Qwen3-Max, and has opened over 300 models with significant download numbers, aiming to become the preferred choice for developers [3][8] Group 2: Market Dynamics - The open-source movement is seen as a response to diverse industry needs, with companies like Baidu optimizing their models for specific applications such as OCR and education [5][10] - The competitive landscape is evolving, with companies needing to demonstrate not just technical capabilities but also the ability to integrate their models into broader industry applications [11][14] - The shift towards open-source is expected to lower barriers for enterprises, allowing them to adopt advanced AI technologies at a reduced cost [5][10] Group 3: Ecosystem Development - Companies are focusing on building extensive ecosystems around their open-source models, which will drive dependency on their cloud infrastructure and services [7][10] - The competition is not just about releasing models but also about how effectively companies can convert these open-source capabilities into industry applications and developer loyalty [10][16] - Baidu's strategy involves integrating its models with proprietary hardware, enhancing the overall ecosystem and making it more appealing for enterprise clients [13][16]