BABA(09988)
Search documents
千问3.5,除夕开源!
Shang Hai Zheng Quan Bao· 2026-02-16 11:08
Core Insights - Alibaba has launched the new generation model Qwen3.5-Plus, which performs comparably to Gemini 3 Pro, with plans to release various sizes and functionalities of the Qwen3.5 series models soon [2][6] - The Qwen3.5 model represents a significant leap from previous versions, transitioning from a pure text model to a native multimodal model, enhancing its capabilities in reasoning and knowledge acquisition [4][8] Performance Metrics - Qwen3.5 achieved a score of 87.8 in the MMLU-Pro knowledge reasoning evaluation, surpassing GPT-5.2, and scored 88.4 in the GPQA assessment, exceeding Claude 4.5 [4] - In the IFBench instruction-following evaluation, Qwen3.5 set a record with a score of 76.5, outperforming all other models [4] - The model's performance in various benchmarks, including BFCL-V4 and Browsecomp, also exceeded that of Gemini 3 Pro and GPT-5.2 [4] Technical Innovations - The Qwen3.5 model features a total of 397 billion parameters, with only 17 billion activated, achieving high efficiency while reducing deployment memory usage by 60% [6][8] - Innovations in the Transformer architecture, including self-developed gating technology and a hybrid architecture combining linear attention and sparse mixture of experts (MoE), contribute to the model's efficiency [8][10] Multimodal Capabilities - Qwen3.5 has made significant advancements in visual capabilities, excelling in various evaluations such as MathVision, RealWorldQA, and CC_OCR [6] - The model supports direct input of videos up to 2 hours long, enhancing its ability to analyze and summarize long video content [6] Market Impact - The Qwen3.5-Plus model's API pricing is significantly lower, at 0.8 yuan per million tokens, which is only 1/18 of the cost of Gemini 3 Pro [6] - Since its open-source launch, Alibaba has released over 400 Qwen models, achieving over 1 billion downloads globally, with a monthly download volume surpassing that of the next seven competitors combined [12]
最新!阿里开源新一代大模型
券商中国· 2026-02-16 11:08
Core Viewpoint - Alibaba has launched the new generation large model Qwen3.5-Plus, which features significant architectural innovations and performance improvements compared to previous versions [1]. Group 1: Model Specifications and Performance - The Qwen3.5-Plus model has a total parameter count of 397 billion, with 17 billion activated parameters, outperforming the trillion-parameter Qwen3-Max model [2]. - The deployment memory usage has been reduced by 60%, and the maximum inference throughput can be increased by up to 19 times [2]. - The API pricing for Qwen3.5-Plus is set at 0.8 yuan per million tokens, which is 1/18 of the price of Gemini 3 Pro [2]. Group 2: Features and Capabilities - Qwen3.5-Plus has been pre-trained on a mixed token of visual and text data, and it now supports 201 languages, expanding the vocabulary size from 150,000 to 250,000, which can enhance encoding efficiency for less common languages by up to 60% [2]. - The Qwen App has launched the world's first consumer-grade AI shopping agent, which completed 120 million orders in just six days during the Spring Festival [2]. Group 3: Accessibility and Future Developments - The Qwen3.5-Plus model is immediately accessible on the Qwen App and PC, with developers able to download it from the Mota community and HuggingFace, or obtain API services directly from Alibaba Cloud [3]. - Alibaba plans to continue releasing different sizes and functionalities of the Qwen3.5 series models, with a more powerful flagship model, Qwen3.5-Max, expected to be released soon [3].
最强开源大模型除夕登场!397B参数千问3.5超越Gemini 3,百万Tokens低至8毛
量子位· 2026-02-16 11:00
Core Viewpoint - Alibaba's new AI model Qwen3.5-Plus has been released, claiming the title of the strongest open-source model, outperforming many closed-source models in various benchmarks [1][3]. Performance and Features - Qwen3.5-Plus has 397 billion parameters, with only 17 billion activated during inference, yet it outperforms the trillion-parameter Qwen3-Max [4]. - The model reduces deployment memory usage by 60% and increases maximum inference throughput by up to 19 times, significantly optimizing deployment costs and efficiency [5][60]. - Qwen3.5-Plus achieves state-of-the-art performance across multiple dimensions, including reasoning and programming, with a score of 87.8 on the MMLU-Pro test, surpassing GPT-5.2 [17]. Accessibility and Pricing - The API pricing for Qwen3.5 is highly competitive, with input costs as low as 0.8 yuan per million tokens, which is 1/18 of the cost of similar models like Gemini-3-Pro [9]. - The model supports 201 languages, expanding its vocabulary from 150k to 250k, and improves encoding efficiency for less common languages by 60% [9]. Technological Innovations - Qwen3.5-Plus incorporates several key technological advancements, including a mixed attention mechanism that dynamically allocates computational resources based on the importance of information [53]. - The model employs a sparse MoE architecture, activating only 17 billion parameters during inference, which significantly reduces computational costs while retaining knowledge advantages [55]. - A native multi-token prediction mechanism allows for batch output, nearly doubling inference speed compared to traditional models [56]. Multi-Modal Capabilities - Qwen3.5-Plus is designed for native multi-modal understanding, processing text and visual data simultaneously without the need for separate alignment networks [64]. - The model can handle long video inputs of up to 2 hours, enabling precise analysis and summarization of lengthy content [26]. Market Position and Impact - Since its inception, Alibaba has open-sourced over 400 models, achieving over 1 billion downloads globally, and establishing itself as a leader in the AI model space [71][72]. - The competitive pricing and open-source nature of Qwen3.5-Plus aim to democratize access to advanced AI technologies, similar to the paths taken by Linux and Android in their respective domains [73].
阿里除夕开源“王炸”千问 3.5-Plus ,性能媲美Gemini 3 Pro、Claude 4.5 Opus,百万 Token 8毛钱
AI前线· 2026-02-16 10:45
整理|冬梅 除夕当天,阿里巴巴低调但密集地抛出了一枚重磅"技术炸弹"——全新一代大模型 Qwen 3.5-Plus 正 式开源。 GitHub : https://github.com/QwenLM/Qwen3.5 API : https://modelstudio.console.alibabacloud.com/ap-southeast-1 /?tab=doc#/doc/? type=model&url=2840914_2&modelId=group-qwen3.5-plus Hugging Face : https://huggingface.co/collections/Qwen/qwen35 ModelScope : https://modelscope.cn/collections/Qwen/Qwen35 官方给出的定位非常直接:性能对标 Gemini 3 Pro,并在多个关键基准中实现超越;而在成本侧, 千问 3.5-Plus 的 API 价格低至每百万 Token 0.8 元人民币,仅为 Gemini 3 Pro 的 1/18。 在当前大模型进入"性能趋同、成本博弈"的阶段,这一组合几乎精准击 ...
阿里发布千问3.5
财联社· 2026-02-16 10:43
阿里今天下午在chat.qwen.ai页面上线了Qwen3.5-Plus和Qwen3.5-397B-A17B两款新模型。 Qwen3.5-Plus定位为Qwen3.5系列最新大语言模型,Qwen3.5-397B-A17B定位则是Qwen3.5开源系列旗舰大语言模型。两款模型均支持文本和多模态 任务。 ...
阿里除夕发布千问3.5,性能媲美Gemini 3,价更低
Nan Fang Du Shi Bao· 2026-02-16 10:16
Core Insights - Alibaba has launched the Qwen3.5-Plus model, which is claimed to rival Gemini 3 Pro, marking it as the strongest open-source model globally [1][3] - The Qwen3.5-Plus model features a total of 397 billion parameters, with only 17 billion activated, achieving superior performance with significantly reduced memory usage and enhanced inference efficiency [1][4] - The model has transitioned from a pure text model to a native multimodal model, incorporating visual and text mixed tokens for training, which has improved its reasoning capabilities and knowledge acquisition [1][3] Performance and Efficiency - Qwen3.5-Plus has demonstrated exceptional performance in various multimodal reasoning tasks, achieving top scores in assessments such as MathVision, VQA, and video understanding [3][4] - The model's inference throughput can be increased by up to 19 times in long-context scenarios, showcasing a substantial improvement in efficiency [4] - Innovations in the underlying architecture, including a self-developed gating technology and a combination of linear attention mechanisms, have contributed to the model's efficiency and performance [3][4] Market Context - The launch of Qwen3.5-Plus coincides with a wave of new releases from domestic AI models, including ByteDance's Doubao 2.0 and MiniMax M2.5, indicating a competitive landscape in the AI model sector [5] - The advancements in Qwen3.5-Plus are expected to enhance its application in various domains, including mobile and PC environments, improving operational efficiency for users [4]
阿里正式发布新一代基模千问3.5
新华网财经· 2026-02-16 10:06
Group 1 - Alibaba has launched a new generation large model, Qwen3.5-Plus, which features an innovative underlying model architecture and has a total parameter count of 397 billion, with only 17 billion activated. Its performance surpasses the trillion-parameter Qwen3-Max model, with a 60% reduction in deployment memory usage and a maximum inference throughput improvement of up to 19 times [2] - The API pricing for Qwen3.5-Plus is set at 0.8 yuan per million tokens, making it a cost-effective option for developers [2] - The Qwen3.5-Plus model has been integrated into the Qwen app and PC platform, and developers can access it through the Mota community and HuggingFace, or directly via Alibaba Cloud's API service [2] Group 2 - Qwen3.5-Plus is positioned as the latest large language model in the Qwen3.5 series, supporting both text and multimodal tasks [5] - The Qwen3.5-397B-A17B is identified as the flagship large language model of the Qwen3.5 open-source series, also supporting text and multimodal tasks [6] - Both models were quietly launched on the chat.qwen.ai page, indicating a strategic move to enhance Alibaba's offerings in the AI space [2]
阿里发布新一代基模千问3.5
Xin Lang Cai Jing· 2026-02-16 09:53
2月16日除夕当天,阿里巴巴开源全新一代大模型千问Qwen3.5-Plus。 此次发布的Qwen3.5-Plus版本总参数为3970亿,激活仅170亿,以小胜大,性能超过万亿参数的Qwen3- Max模型,部署显存占用降低60%,推理效率大幅提升,最大推理吞吐量可提升至19倍。Qwen3.5-Plus 的API价格每百万Token低至0.8元,仅为Gemini 3 pro的1/18。 炒股就看金麒麟分析师研报,权威,专业,及时,全面,助您挖掘潜力主题机会! 据悉,千问APP、PC端已第一时间接入Qwen3.5-Plus模型。开发者可在魔搭社区和HuggingFace下载新 模型,或通过阿里云百炼直接获取API服务。 (来源:智通财经) ...
Qwen3.5-Plus登顶全球最强开源模型
Xin Lang Cai Jing· 2026-02-16 09:53
Core Viewpoint - Alibaba Cloud has launched the new generation open-source model Qwen 3.5-Plus, which is claimed to be the strongest open-source model globally, marking a significant advancement from a pure text model to a native multimodal model [1] Group 1 - Qwen 3.5 has transitioned from pre-training on pure text tokens to pre-training on a mix of visual and text tokens, enhancing its capabilities [1] - The model has significantly increased its dataset, incorporating multilingual, STEM, and reasoning data, which allows it to acquire more comprehensive world knowledge and reasoning logic [1] - Qwen 3.5 achieves top-tier performance with less than 40% of the parameter count compared to the Qwen 3-Max base model, which has over one trillion parameters, excelling in various benchmark evaluations including reasoning, programming, and agent intelligence [1]
Qwen3.5-Plus的API价格每百万Token为0.8元
Jin Rong Jie· 2026-02-16 09:48
Core Insights - Alibaba has released the Qwen3.5-Plus model on New Year's Eve, featuring significant architectural innovations [1] Group 1: Model Specifications - The new version has a total of 397 billion parameters and 17 billion activated parameters, surpassing the performance of Qwen3-Max [1] - Memory usage has decreased by 60%, while inference throughput has increased by 19 times [1] Group 2: Pricing and Accessibility - The API price for Qwen3.5-Plus is set at 0.8 yuan per million tokens [1] - The Qwen app and PC version have already integrated this new model [1]