大模型
Search documents
30亿元砸向春晚,AI巨头在抢什么?
Mei Ri Jing Ji Xin Wen· 2026-02-16 12:07
Core Insights - The 2026 Spring Festival Gala has become a battleground for AI companies, with significant investments from major players like Alibaba, ByteDance, Tencent, and Baidu [1][3][9] - Traditional brands are being challenged as AI companies leverage the high viewership of the gala to promote their services, aiming to integrate AI into everyday life [6][8] Group 1: AI Companies' Investments - Alibaba's Qianwen invested 3 billion yuan in the Spring Festival Gala, becoming the exclusive sponsor for four major regional TV stations [3][7] - ByteDance's Volcano Engine partnered with CCTV as the exclusive AI cloud partner for the gala, focusing on program creation and live interaction [3][4] - Tencent and Baidu launched 1 billion yuan and 500 million yuan red envelope campaigns, respectively, to enhance user engagement with their AI models [1][3] Group 2: AI Technology Applications - Local TV stations are exploring AI applications, with Shandong TV using AIGC technology for virtual stage backgrounds and Beijing TV showcasing service robots in family settings [4][8] - The integration of AI technologies is expected to improve production efficiency and reduce costs for the gala, with examples including the use of XR virtual stages [8][9] Group 3: User Engagement Strategies - AI companies are using the Spring Festival as an opportunity to change user habits, with Qianwen bundling its services with various applications like food delivery and movie ticket purchases [9][10] - The effectiveness of these strategies is evident, as Qianwen reported a 500% increase in orders for movie tickets through its platform during the festival [9][10] Group 4: Challenges and Future Outlook - Despite the high investments, there are concerns about retaining users post-festival, as the initial engagement may not translate into long-term usage [9][10] - The competition among AI models is likened to a marathon, where sustained user engagement and commercial conversion will determine success [10]
除夕开源,阿里发布新一代基础模型千问3.5
Bei Jing Shang Bao· 2026-02-16 11:45
Core Insights - Alibaba has launched its new generation open-source model, Qwen3.5-Plus, which is claimed to rival Gemini 3 Pro, making it the strongest open-source model globally [1] Model Performance - The Qwen3.5-Plus version features a total of 397 billion parameters and 17 billion activated parameters, outperforming the trillion-parameter Qwen3-Max model [1] - The deployment memory usage has been reduced by 60%, and inference efficiency has significantly improved, with maximum inference throughput potentially increasing by up to 19 times [1] Pricing Strategy - The API pricing for Qwen3.5-Plus is set at 0.8 yuan per million tokens, which is only 1/18th of the price of Gemini 3 Pro [1]
千问3.5,除夕开源!
Shang Hai Zheng Quan Bao· 2026-02-16 11:08
Core Insights - Alibaba has launched the new generation model Qwen3.5-Plus, which performs comparably to Gemini 3 Pro, with plans to release various sizes and functionalities of the Qwen3.5 series models soon [2][6] - The Qwen3.5 model represents a significant leap from previous versions, transitioning from a pure text model to a native multimodal model, enhancing its capabilities in reasoning and knowledge acquisition [4][8] Performance Metrics - Qwen3.5 achieved a score of 87.8 in the MMLU-Pro knowledge reasoning evaluation, surpassing GPT-5.2, and scored 88.4 in the GPQA assessment, exceeding Claude 4.5 [4] - In the IFBench instruction-following evaluation, Qwen3.5 set a record with a score of 76.5, outperforming all other models [4] - The model's performance in various benchmarks, including BFCL-V4 and Browsecomp, also exceeded that of Gemini 3 Pro and GPT-5.2 [4] Technical Innovations - The Qwen3.5 model features a total of 397 billion parameters, with only 17 billion activated, achieving high efficiency while reducing deployment memory usage by 60% [6][8] - Innovations in the Transformer architecture, including self-developed gating technology and a hybrid architecture combining linear attention and sparse mixture of experts (MoE), contribute to the model's efficiency [8][10] Multimodal Capabilities - Qwen3.5 has made significant advancements in visual capabilities, excelling in various evaluations such as MathVision, RealWorldQA, and CC_OCR [6] - The model supports direct input of videos up to 2 hours long, enhancing its ability to analyze and summarize long video content [6] Market Impact - The Qwen3.5-Plus model's API pricing is significantly lower, at 0.8 yuan per million tokens, which is only 1/18 of the cost of Gemini 3 Pro [6] - Since its open-source launch, Alibaba has released over 400 Qwen models, achieving over 1 billion downloads globally, with a monthly download volume surpassing that of the next seven competitors combined [12]
最新!阿里开源新一代大模型
券商中国· 2026-02-16 11:08
Core Viewpoint - Alibaba has launched the new generation large model Qwen3.5-Plus, which features significant architectural innovations and performance improvements compared to previous versions [1]. Group 1: Model Specifications and Performance - The Qwen3.5-Plus model has a total parameter count of 397 billion, with 17 billion activated parameters, outperforming the trillion-parameter Qwen3-Max model [2]. - The deployment memory usage has been reduced by 60%, and the maximum inference throughput can be increased by up to 19 times [2]. - The API pricing for Qwen3.5-Plus is set at 0.8 yuan per million tokens, which is 1/18 of the price of Gemini 3 Pro [2]. Group 2: Features and Capabilities - Qwen3.5-Plus has been pre-trained on a mixed token of visual and text data, and it now supports 201 languages, expanding the vocabulary size from 150,000 to 250,000, which can enhance encoding efficiency for less common languages by up to 60% [2]. - The Qwen App has launched the world's first consumer-grade AI shopping agent, which completed 120 million orders in just six days during the Spring Festival [2]. Group 3: Accessibility and Future Developments - The Qwen3.5-Plus model is immediately accessible on the Qwen App and PC, with developers able to download it from the Mota community and HuggingFace, or obtain API services directly from Alibaba Cloud [3]. - Alibaba plans to continue releasing different sizes and functionalities of the Qwen3.5 series models, with a more powerful flagship model, Qwen3.5-Max, expected to be released soon [3].
最强开源大模型除夕登场!397B参数千问3.5超越Gemini 3,百万Tokens低至8毛
量子位· 2026-02-16 11:00
Core Viewpoint - Alibaba's new AI model Qwen3.5-Plus has been released, claiming the title of the strongest open-source model, outperforming many closed-source models in various benchmarks [1][3]. Performance and Features - Qwen3.5-Plus has 397 billion parameters, with only 17 billion activated during inference, yet it outperforms the trillion-parameter Qwen3-Max [4]. - The model reduces deployment memory usage by 60% and increases maximum inference throughput by up to 19 times, significantly optimizing deployment costs and efficiency [5][60]. - Qwen3.5-Plus achieves state-of-the-art performance across multiple dimensions, including reasoning and programming, with a score of 87.8 on the MMLU-Pro test, surpassing GPT-5.2 [17]. Accessibility and Pricing - The API pricing for Qwen3.5 is highly competitive, with input costs as low as 0.8 yuan per million tokens, which is 1/18 of the cost of similar models like Gemini-3-Pro [9]. - The model supports 201 languages, expanding its vocabulary from 150k to 250k, and improves encoding efficiency for less common languages by 60% [9]. Technological Innovations - Qwen3.5-Plus incorporates several key technological advancements, including a mixed attention mechanism that dynamically allocates computational resources based on the importance of information [53]. - The model employs a sparse MoE architecture, activating only 17 billion parameters during inference, which significantly reduces computational costs while retaining knowledge advantages [55]. - A native multi-token prediction mechanism allows for batch output, nearly doubling inference speed compared to traditional models [56]. Multi-Modal Capabilities - Qwen3.5-Plus is designed for native multi-modal understanding, processing text and visual data simultaneously without the need for separate alignment networks [64]. - The model can handle long video inputs of up to 2 hours, enabling precise analysis and summarization of lengthy content [26]. Market Position and Impact - Since its inception, Alibaba has open-sourced over 400 models, achieving over 1 billion downloads globally, and establishing itself as a leader in the AI model space [71][72]. - The competitive pricing and open-source nature of Qwen3.5-Plus aim to democratize access to advanced AI technologies, similar to the paths taken by Linux and Android in their respective domains [73].
当AI入驻春晚,红包、技术、场景谁能留下用户?
Mei Ri Jing Ji Xin Wen· 2026-02-16 10:55
Core Viewpoint - The 2026 Spring Festival Gala has become a battleground for AI companies, with significant investments and collaborations aimed at leveraging the high viewership and family-oriented consumption during the holiday season [5][10]. Group 1: AI Companies' Investments and Collaborations - Alibaba's Qianwen has invested 3 billion yuan in the Spring Festival Gala, becoming the exclusive sponsor for four major regional TV stations [5][9]. - ByteDance's Volcano Engine has partnered with CCTV as the exclusive AI cloud partner for the 2026 Spring Festival Gala, supporting program creation and live interaction [5][10]. - Tencent and Baidu have launched 1 billion yuan and 500 million yuan red envelope plans, respectively, to enhance access to their large models through their platforms [5]. Group 2: AI Technology Applications in Spring Festival Galas - Local TV stations are actively exploring AI technology, with Shandong TV using AIGC technology for virtual stage backgrounds and Beijing TV focusing on "human-machine symbiosis" themes [6][10]. - Bilibili has partnered with CCTV for the Spring Festival Gala, enhancing its technology investment to improve user experience during live broadcasts [6][8]. Group 3: Changing Role of the Spring Festival Gala - The Spring Festival Gala is evolving from a cultural ceremony to a platform for technology demonstration and commercial transformation, providing local TV stations with opportunities to innovate [10]. - AI technologies are being integrated into the gala to enhance production efficiency and reduce costs, with examples including the use of XR virtual stages [10]. Group 4: User Engagement and Retention Challenges - AI companies are utilizing the Spring Festival to attract new users, but there are concerns about user retention post-holiday, as many may only engage for the red envelopes [11][12]. - Qianwen aims to change user habits by bundling its services with various scenarios, such as ordering New Year's Eve dinners and movie tickets, leading to a significant increase in orders during the festival [11][12].
除夕迎「源神」?Qwen3.5以小胜大,捅破性价比天花板,大模型竞赛下半场开始了
机器之心· 2026-02-16 10:09
Core Viewpoint - The article highlights the launch of Qwen3.5-Plus, emphasizing its dual strengths of being both powerful and cost-effective, marking a significant advancement in the open-source AI model landscape [3][8]. Group 1: Model Performance - Qwen3.5-Plus has achieved top performance in various core capabilities such as multimodal understanding, complex reasoning, programming, and agent intelligence, surpassing many leading closed-source models like GPT-5.2 and Gemini-3-pro [3][8]. - The model operates with 397 billion parameters, significantly fewer than its predecessor Qwen3-Max, yet it outperforms it, demonstrating a new paradigm of efficiency in AI model design [7][16]. Group 2: Cost Efficiency - The pricing of Qwen3.5-Plus is notably low at 0.8 yuan per million tokens, making it 18 times cheaper than its competitor Gemini-3-pro, which reflects a strategic pricing model driven by technological advancements rather than cost-cutting [7][8]. - The deployment costs for Qwen3.5-Plus are reduced by 60%, and its inference throughput has increased by 19 times, showcasing its efficiency and affordability [7][17]. Group 3: Technological Innovations - Qwen3.5-Plus incorporates several architectural innovations, including a hybrid attention mechanism that optimizes resource allocation based on information weight, leading to improved precision and efficiency [18]. - The model employs a sparse MoE (Mixture of Experts) architecture, activating only 17 billion parameters during inference, which allows it to utilize less than 5% of its computational power while accessing a vast knowledge base [18]. - It features native multimodal capabilities, integrating text and visual data from the outset, which enhances its understanding and reduces information loss during processing [21][22]. Group 4: Market Impact - The introduction of Qwen3.5-Plus signifies a shift in the AI landscape, where the focus is not solely on the most powerful models but on making advanced AI capabilities accessible and usable for a broader audience [25][26]. - The model's release is expected to lower barriers for businesses looking to adopt AI technologies, potentially transforming them into foundational tools within various industries [25][26].
阿里正式发布新一代基模千问3.5
新华网财经· 2026-02-16 10:06
Group 1 - Alibaba has launched a new generation large model, Qwen3.5-Plus, which features an innovative underlying model architecture and has a total parameter count of 397 billion, with only 17 billion activated. Its performance surpasses the trillion-parameter Qwen3-Max model, with a 60% reduction in deployment memory usage and a maximum inference throughput improvement of up to 19 times [2] - The API pricing for Qwen3.5-Plus is set at 0.8 yuan per million tokens, making it a cost-effective option for developers [2] - The Qwen3.5-Plus model has been integrated into the Qwen app and PC platform, and developers can access it through the Mota community and HuggingFace, or directly via Alibaba Cloud's API service [2] Group 2 - Qwen3.5-Plus is positioned as the latest large language model in the Qwen3.5 series, supporting both text and multimodal tasks [5] - The Qwen3.5-397B-A17B is identified as the flagship large language model of the Qwen3.5 open-source series, also supporting text and multimodal tasks [6] - Both models were quietly launched on the chat.qwen.ai page, indicating a strategic move to enhance Alibaba's offerings in the AI space [2]
阿里发布新一代基模千问3.5
Xin Lang Cai Jing· 2026-02-16 09:53
2月16日除夕当天,阿里巴巴开源全新一代大模型千问Qwen3.5-Plus。 此次发布的Qwen3.5-Plus版本总参数为3970亿,激活仅170亿,以小胜大,性能超过万亿参数的Qwen3- Max模型,部署显存占用降低60%,推理效率大幅提升,最大推理吞吐量可提升至19倍。Qwen3.5-Plus 的API价格每百万Token低至0.8元,仅为Gemini 3 pro的1/18。 炒股就看金麒麟分析师研报,权威,专业,及时,全面,助您挖掘潜力主题机会! 据悉,千问APP、PC端已第一时间接入Qwen3.5-Plus模型。开发者可在魔搭社区和HuggingFace下载新 模型,或通过阿里云百炼直接获取API服务。 (来源:智通财经) ...
Qwen3.5-Plus登顶全球最强开源模型
Xin Lang Cai Jing· 2026-02-16 09:53
Core Viewpoint - Alibaba Cloud has launched the new generation open-source model Qwen 3.5-Plus, which is claimed to be the strongest open-source model globally, marking a significant advancement from a pure text model to a native multimodal model [1] Group 1 - Qwen 3.5 has transitioned from pre-training on pure text tokens to pre-training on a mix of visual and text tokens, enhancing its capabilities [1] - The model has significantly increased its dataset, incorporating multilingual, STEM, and reasoning data, which allows it to acquire more comprehensive world knowledge and reasoning logic [1] - Qwen 3.5 achieves top-tier performance with less than 40% of the parameter count compared to the Qwen 3-Max base model, which has over one trillion parameters, excelling in various benchmark evaluations including reasoning, programming, and agent intelligence [1]