千问3.5,除夕开源!
Shang Hai Zheng Quan Bao·2026-02-16 11:08

Core Insights - Alibaba has launched the new generation model Qwen3.5-Plus, which performs comparably to Gemini 3 Pro, with plans to release various sizes and functionalities of the Qwen3.5 series models soon [2][6] - The Qwen3.5 model represents a significant leap from previous versions, transitioning from a pure text model to a native multimodal model, enhancing its capabilities in reasoning and knowledge acquisition [4][8] Performance Metrics - Qwen3.5 achieved a score of 87.8 in the MMLU-Pro knowledge reasoning evaluation, surpassing GPT-5.2, and scored 88.4 in the GPQA assessment, exceeding Claude 4.5 [4] - In the IFBench instruction-following evaluation, Qwen3.5 set a record with a score of 76.5, outperforming all other models [4] - The model's performance in various benchmarks, including BFCL-V4 and Browsecomp, also exceeded that of Gemini 3 Pro and GPT-5.2 [4] Technical Innovations - The Qwen3.5 model features a total of 397 billion parameters, with only 17 billion activated, achieving high efficiency while reducing deployment memory usage by 60% [6][8] - Innovations in the Transformer architecture, including self-developed gating technology and a hybrid architecture combining linear attention and sparse mixture of experts (MoE), contribute to the model's efficiency [8][10] Multimodal Capabilities - Qwen3.5 has made significant advancements in visual capabilities, excelling in various evaluations such as MathVision, RealWorldQA, and CC_OCR [6] - The model supports direct input of videos up to 2 hours long, enhancing its ability to analyze and summarize long video content [6] Market Impact - The Qwen3.5-Plus model's API pricing is significantly lower, at 0.8 yuan per million tokens, which is only 1/18 of the cost of Gemini 3 Pro [6] - Since its open-source launch, Alibaba has released over 400 Qwen models, achieving over 1 billion downloads globally, with a monthly download volume surpassing that of the next seven competitors combined [12]