Workflow
大模型
icon
Search documents
正面硬刚Gemini 3 Pro,阿里开源Qwen3.5-Plus|甲子光年
Sou Hu Cai Jing· 2026-02-16 15:57
Core Insights - Alibaba has officially open-sourced its new foundational model, Qwen3.5-Plus, which boasts 397 billion parameters but only activates 17 billion for inference, challenging existing models like Google's Gemini 3 Pro and OpenAI's GPT-5.2 [2][4] - The model represents a significant shift towards a more efficient architecture, moving away from traditional dense models to a sparse mixture of experts (MoE) approach, which drastically reduces computational resource requirements [5][6] Group 1: Architectural Innovations - Qwen3.5-Plus achieves a balance of performance and efficiency by integrating linear attention mechanisms with sparse MoE architecture, allowing for a significant reduction in memory usage and increased inference speed [6][8] - Compared to its predecessor, Qwen3-Max, Qwen3.5-Plus reduces deployment memory usage by 60% and increases inference throughput by up to 19 times in long-context scenarios [6][8] - The model's ability to dynamically allocate attention resources allows it to focus on important information while reducing computational complexity, enhancing its overall efficiency [8] Group 2: Native Multimodal Capabilities - Qwen3.5-Plus features a native multimodal design that integrates visual and textual data from the pre-training phase, enabling it to perform complex tasks without the typical losses associated with separate modality processing [9][10] - This capability allows the model to execute tasks such as converting sketches into runnable code or providing code fixes based on UI screenshots, marking a significant advancement in AI's practical applications [10][11] - The model's enhanced video understanding capabilities enable it to process long videos for analysis and summarization, showcasing its potential in embodied intelligence applications [12][13] Group 3: Market Impact and Strategy - The aggressive pricing strategy of Qwen3.5-Plus, with API call costs as low as 0.8 RMB per million tokens, positions it as a disruptive force in the global AI market, significantly undercutting competitors [16][17] - Alibaba's open-source model ecosystem has grown to over 400 models, with more than 20,000 derivative models developed by the community, establishing a robust and active foundation for AI development [17] - The model's support for 201 languages and dialects, with a vocabulary expansion from 150,000 to 250,000, enhances its accessibility and efficiency for low-resource languages, further embedding it in emerging markets [17][18] Group 4: Future Implications - Qwen3.5-Plus sets a new benchmark for open-source models, demonstrating that the path to AGI does not solely rely on closed-source solutions, but can also thrive in an open ecosystem [19][20] - The model's release signifies a shift from a parameter race to a competition based on architectural efficiency, emphasizing the importance of cost-effectiveness, transparency, and collaboration in AI development [18][19] - As the model continues to evolve, it is poised to become a preferred choice for enterprise-level localized deployments, marking a significant milestone in the journey towards AGI [21][24]
对话松延动力创始人姜哲源:从亮相春晚到「要规模」
Hua Er Jie Jian Wen· 2026-02-16 13:47
Core Viewpoint - The performance of the bionic humanoid robot at the Spring Festival Gala has provided a significant opportunity for Songyan Power to enhance its brand visibility and accelerate commercialization in the humanoid robot industry [1][20]. Group 1: Company Overview - Songyan Power specializes in bionic humanoid robots and general bipedal humanoid robots, with plans for expansion by 2026 [1]. - The founder, Jiang Zheyuan, expresses cautious optimism about the company's future despite competition from established automotive companies like Tesla and Li Auto [1][8]. - The company aims to differentiate itself by focusing on unique performance areas, such as comedy skits, rather than competing in the saturated market of basic physical movement control [3]. Group 2: Market Strategy - The strategic focus for 2026 is summarized in two keywords: "penetration" and "expansion" [4][24]. - "Penetration" refers to deepening market presence in existing segments, while "expansion" targets untapped markets, particularly in K12 education and consumer-grade robots [4][24]. - The pricing strategy for the humanoid robot "Xiaobumi" is set at around 10,000 yuan, making it accessible for schools that cannot afford more expensive options [5][28]. Group 3: Industry Competition - The humanoid robot market is becoming increasingly competitive, with major automotive companies entering the space, indicating a shift in market dynamics [6][8]. - Tesla plans to repurpose its Fremont factory to produce 1 million units of the Optimus humanoid robot by the end of 2026, leveraging its existing AI technology [7][8]. - The market for humanoid robots is expected to expand beyond exhibitions and education into industrial manufacturing and home services, with UBS predicting a demand for 30,000 units in industrial applications by 2026 [9][10]. Group 4: Challenges and Data Requirements - The industry faces challenges regarding the suitability of humanoid robots for industrial tasks, as many factories are already highly automated [12][13]. - The potential for humanoid robots in household settings is significant, but requires advanced capabilities to navigate diverse and non-standard environments [15][16]. - Data collection remains a critical challenge, as obtaining sufficient and diverse data for training humanoid robots is complicated by privacy concerns [16][18].
千问3.5除夕开源!可通过千问APP免费体验
Xin Lang Cai Jing· 2026-02-16 13:00
Core Insights - Alibaba has launched its new generation large model Qwen3.5-Plus, which reportedly rivals Gemini 3 Pro in performance, featuring a total parameter count of 397 billion and an activation of only 17 billion, achieving superior performance with reduced memory usage and significantly enhanced inference efficiency [1][3] Group 1: Model Performance and Features - Qwen3.5-Plus has achieved a major leap in performance, surpassing the previous Qwen3-Max model, with a maximum inference throughput improvement of up to 19 times [1][3] - The model has transitioned from a pure text model to a native multimodal model, incorporating visual and text mixed tokens for pre-training, which enhances its reasoning logic and world knowledge [1][2] - In various authoritative evaluations, Qwen3.5 has demonstrated superior performance in multimodal reasoning, visual question answering, and video understanding, outperforming previous specialized models [2] Group 2: Technical Innovations - The performance improvements of Qwen3.5 are attributed to significant innovations in the classic Transformer architecture, including the integration of gating technology and a hybrid architecture that combines linear attention mechanisms with sparse mixture of experts (MoE) [3] - The model's training efficiency has been enhanced through advanced techniques, achieving a throughput increase of 8.6 times in common contexts and up to 19 times in ultra-long contexts [3][5] Group 3: Application and Market Impact - Qwen3.5's multimodal training has been efficiently executed on Alibaba Cloud's AI infrastructure, significantly lowering the difficulty threshold for native multimodal training [5] - The model has been integrated into the Qwen App and PC, enabling it to autonomously perform tasks on mobile and desktop platforms, thus enhancing operational efficiency [6] - The Qwen App has successfully executed 120 million orders in just six days during the Spring Festival, marking a significant milestone in real-world task execution and commercialization [6] Group 4: Future Developments - Alibaba plans to continue releasing various sizes and functionalities of the Qwen3.5 series models, with a more powerful flagship model, Qwen3.5-Max, set to be launched soon [7] - The Qwen model family has seen over 400 models open-sourced since 2023, with a global download count exceeding 1 billion, indicating strong developer interest and engagement [6][7]
30亿元砸向春晚,AI巨头在抢什么?
Mei Ri Jing Ji Xin Wen· 2026-02-16 12:07
Core Insights - The 2026 Spring Festival Gala has become a battleground for AI companies, with significant investments from major players like Alibaba, ByteDance, Tencent, and Baidu [1][3][9] - Traditional brands are being challenged as AI companies leverage the high viewership of the gala to promote their services, aiming to integrate AI into everyday life [6][8] Group 1: AI Companies' Investments - Alibaba's Qianwen invested 3 billion yuan in the Spring Festival Gala, becoming the exclusive sponsor for four major regional TV stations [3][7] - ByteDance's Volcano Engine partnered with CCTV as the exclusive AI cloud partner for the gala, focusing on program creation and live interaction [3][4] - Tencent and Baidu launched 1 billion yuan and 500 million yuan red envelope campaigns, respectively, to enhance user engagement with their AI models [1][3] Group 2: AI Technology Applications - Local TV stations are exploring AI applications, with Shandong TV using AIGC technology for virtual stage backgrounds and Beijing TV showcasing service robots in family settings [4][8] - The integration of AI technologies is expected to improve production efficiency and reduce costs for the gala, with examples including the use of XR virtual stages [8][9] Group 3: User Engagement Strategies - AI companies are using the Spring Festival as an opportunity to change user habits, with Qianwen bundling its services with various applications like food delivery and movie ticket purchases [9][10] - The effectiveness of these strategies is evident, as Qianwen reported a 500% increase in orders for movie tickets through its platform during the festival [9][10] Group 4: Challenges and Future Outlook - Despite the high investments, there are concerns about retaining users post-festival, as the initial engagement may not translate into long-term usage [9][10] - The competition among AI models is likened to a marathon, where sustained user engagement and commercial conversion will determine success [10]
除夕开源,阿里发布新一代基础模型千问3.5
Bei Jing Shang Bao· 2026-02-16 11:45
Core Insights - Alibaba has launched its new generation open-source model, Qwen3.5-Plus, which is claimed to rival Gemini 3 Pro, making it the strongest open-source model globally [1] Model Performance - The Qwen3.5-Plus version features a total of 397 billion parameters and 17 billion activated parameters, outperforming the trillion-parameter Qwen3-Max model [1] - The deployment memory usage has been reduced by 60%, and inference efficiency has significantly improved, with maximum inference throughput potentially increasing by up to 19 times [1] Pricing Strategy - The API pricing for Qwen3.5-Plus is set at 0.8 yuan per million tokens, which is only 1/18th of the price of Gemini 3 Pro [1]
千问3.5,除夕开源!
Core Insights - Alibaba has launched the new generation model Qwen3.5-Plus, which performs comparably to Gemini 3 Pro, with plans to release various sizes and functionalities of the Qwen3.5 series models soon [2][6] - The Qwen3.5 model represents a significant leap from previous versions, transitioning from a pure text model to a native multimodal model, enhancing its capabilities in reasoning and knowledge acquisition [4][8] Performance Metrics - Qwen3.5 achieved a score of 87.8 in the MMLU-Pro knowledge reasoning evaluation, surpassing GPT-5.2, and scored 88.4 in the GPQA assessment, exceeding Claude 4.5 [4] - In the IFBench instruction-following evaluation, Qwen3.5 set a record with a score of 76.5, outperforming all other models [4] - The model's performance in various benchmarks, including BFCL-V4 and Browsecomp, also exceeded that of Gemini 3 Pro and GPT-5.2 [4] Technical Innovations - The Qwen3.5 model features a total of 397 billion parameters, with only 17 billion activated, achieving high efficiency while reducing deployment memory usage by 60% [6][8] - Innovations in the Transformer architecture, including self-developed gating technology and a hybrid architecture combining linear attention and sparse mixture of experts (MoE), contribute to the model's efficiency [8][10] Multimodal Capabilities - Qwen3.5 has made significant advancements in visual capabilities, excelling in various evaluations such as MathVision, RealWorldQA, and CC_OCR [6] - The model supports direct input of videos up to 2 hours long, enhancing its ability to analyze and summarize long video content [6] Market Impact - The Qwen3.5-Plus model's API pricing is significantly lower, at 0.8 yuan per million tokens, which is only 1/18 of the cost of Gemini 3 Pro [6] - Since its open-source launch, Alibaba has released over 400 Qwen models, achieving over 1 billion downloads globally, with a monthly download volume surpassing that of the next seven competitors combined [12]
最新!阿里开源新一代大模型
券商中国· 2026-02-16 11:08
Core Viewpoint - Alibaba has launched the new generation large model Qwen3.5-Plus, which features significant architectural innovations and performance improvements compared to previous versions [1]. Group 1: Model Specifications and Performance - The Qwen3.5-Plus model has a total parameter count of 397 billion, with 17 billion activated parameters, outperforming the trillion-parameter Qwen3-Max model [2]. - The deployment memory usage has been reduced by 60%, and the maximum inference throughput can be increased by up to 19 times [2]. - The API pricing for Qwen3.5-Plus is set at 0.8 yuan per million tokens, which is 1/18 of the price of Gemini 3 Pro [2]. Group 2: Features and Capabilities - Qwen3.5-Plus has been pre-trained on a mixed token of visual and text data, and it now supports 201 languages, expanding the vocabulary size from 150,000 to 250,000, which can enhance encoding efficiency for less common languages by up to 60% [2]. - The Qwen App has launched the world's first consumer-grade AI shopping agent, which completed 120 million orders in just six days during the Spring Festival [2]. Group 3: Accessibility and Future Developments - The Qwen3.5-Plus model is immediately accessible on the Qwen App and PC, with developers able to download it from the Mota community and HuggingFace, or obtain API services directly from Alibaba Cloud [3]. - Alibaba plans to continue releasing different sizes and functionalities of the Qwen3.5 series models, with a more powerful flagship model, Qwen3.5-Max, expected to be released soon [3].
最强开源大模型除夕登场!397B参数千问3.5超越Gemini 3,百万Tokens低至8毛
量子位· 2026-02-16 11:00
Core Viewpoint - Alibaba's new AI model Qwen3.5-Plus has been released, claiming the title of the strongest open-source model, outperforming many closed-source models in various benchmarks [1][3]. Performance and Features - Qwen3.5-Plus has 397 billion parameters, with only 17 billion activated during inference, yet it outperforms the trillion-parameter Qwen3-Max [4]. - The model reduces deployment memory usage by 60% and increases maximum inference throughput by up to 19 times, significantly optimizing deployment costs and efficiency [5][60]. - Qwen3.5-Plus achieves state-of-the-art performance across multiple dimensions, including reasoning and programming, with a score of 87.8 on the MMLU-Pro test, surpassing GPT-5.2 [17]. Accessibility and Pricing - The API pricing for Qwen3.5 is highly competitive, with input costs as low as 0.8 yuan per million tokens, which is 1/18 of the cost of similar models like Gemini-3-Pro [9]. - The model supports 201 languages, expanding its vocabulary from 150k to 250k, and improves encoding efficiency for less common languages by 60% [9]. Technological Innovations - Qwen3.5-Plus incorporates several key technological advancements, including a mixed attention mechanism that dynamically allocates computational resources based on the importance of information [53]. - The model employs a sparse MoE architecture, activating only 17 billion parameters during inference, which significantly reduces computational costs while retaining knowledge advantages [55]. - A native multi-token prediction mechanism allows for batch output, nearly doubling inference speed compared to traditional models [56]. Multi-Modal Capabilities - Qwen3.5-Plus is designed for native multi-modal understanding, processing text and visual data simultaneously without the need for separate alignment networks [64]. - The model can handle long video inputs of up to 2 hours, enabling precise analysis and summarization of lengthy content [26]. Market Position and Impact - Since its inception, Alibaba has open-sourced over 400 models, achieving over 1 billion downloads globally, and establishing itself as a leader in the AI model space [71][72]. - The competitive pricing and open-source nature of Qwen3.5-Plus aim to democratize access to advanced AI technologies, similar to the paths taken by Linux and Android in their respective domains [73].
当AI入驻春晚,红包、技术、场景谁能留下用户?
Mei Ri Jing Ji Xin Wen· 2026-02-16 10:55
Core Viewpoint - The 2026 Spring Festival Gala has become a battleground for AI companies, with significant investments and collaborations aimed at leveraging the high viewership and family-oriented consumption during the holiday season [5][10]. Group 1: AI Companies' Investments and Collaborations - Alibaba's Qianwen has invested 3 billion yuan in the Spring Festival Gala, becoming the exclusive sponsor for four major regional TV stations [5][9]. - ByteDance's Volcano Engine has partnered with CCTV as the exclusive AI cloud partner for the 2026 Spring Festival Gala, supporting program creation and live interaction [5][10]. - Tencent and Baidu have launched 1 billion yuan and 500 million yuan red envelope plans, respectively, to enhance access to their large models through their platforms [5]. Group 2: AI Technology Applications in Spring Festival Galas - Local TV stations are actively exploring AI technology, with Shandong TV using AIGC technology for virtual stage backgrounds and Beijing TV focusing on "human-machine symbiosis" themes [6][10]. - Bilibili has partnered with CCTV for the Spring Festival Gala, enhancing its technology investment to improve user experience during live broadcasts [6][8]. Group 3: Changing Role of the Spring Festival Gala - The Spring Festival Gala is evolving from a cultural ceremony to a platform for technology demonstration and commercial transformation, providing local TV stations with opportunities to innovate [10]. - AI technologies are being integrated into the gala to enhance production efficiency and reduce costs, with examples including the use of XR virtual stages [10]. Group 4: User Engagement and Retention Challenges - AI companies are utilizing the Spring Festival to attract new users, but there are concerns about user retention post-holiday, as many may only engage for the red envelopes [11][12]. - Qianwen aims to change user habits by bundling its services with various scenarios, such as ordering New Year's Eve dinners and movie tickets, leading to a significant increase in orders during the festival [11][12].
除夕迎「源神」?Qwen3.5以小胜大,捅破性价比天花板,大模型竞赛下半场开始了
机器之心· 2026-02-16 10:09
Core Viewpoint - The article highlights the launch of Qwen3.5-Plus, emphasizing its dual strengths of being both powerful and cost-effective, marking a significant advancement in the open-source AI model landscape [3][8]. Group 1: Model Performance - Qwen3.5-Plus has achieved top performance in various core capabilities such as multimodal understanding, complex reasoning, programming, and agent intelligence, surpassing many leading closed-source models like GPT-5.2 and Gemini-3-pro [3][8]. - The model operates with 397 billion parameters, significantly fewer than its predecessor Qwen3-Max, yet it outperforms it, demonstrating a new paradigm of efficiency in AI model design [7][16]. Group 2: Cost Efficiency - The pricing of Qwen3.5-Plus is notably low at 0.8 yuan per million tokens, making it 18 times cheaper than its competitor Gemini-3-pro, which reflects a strategic pricing model driven by technological advancements rather than cost-cutting [7][8]. - The deployment costs for Qwen3.5-Plus are reduced by 60%, and its inference throughput has increased by 19 times, showcasing its efficiency and affordability [7][17]. Group 3: Technological Innovations - Qwen3.5-Plus incorporates several architectural innovations, including a hybrid attention mechanism that optimizes resource allocation based on information weight, leading to improved precision and efficiency [18]. - The model employs a sparse MoE (Mixture of Experts) architecture, activating only 17 billion parameters during inference, which allows it to utilize less than 5% of its computational power while accessing a vast knowledge base [18]. - It features native multimodal capabilities, integrating text and visual data from the outset, which enhances its understanding and reduces information loss during processing [21][22]. Group 4: Market Impact - The introduction of Qwen3.5-Plus signifies a shift in the AI landscape, where the focus is not solely on the most powerful models but on making advanced AI capabilities accessible and usable for a broader audience [25][26]. - The model's release is expected to lower barriers for businesses looking to adopt AI technologies, potentially transforming them into foundational tools within various industries [25][26].