BABA(BABA)
Search documents
中文大模型基准测评2025年年度报告:2026开年特别版:含1月底重磅模型动态评测
SuperCLUE团队· 2026-02-05 02:00
Investment Rating - The report does not explicitly provide an investment rating for the industry or companies involved. Core Insights - The report highlights significant advancements in Chinese large models and AI agents, marking a transition from "following" to "keeping pace" with global leaders in AI technology [14][24]. - The competitive landscape shows a clear distinction between domestic and international models, with domestic open-source models gaining substantial ground [23][47]. - The report emphasizes the importance of multi-modal capabilities and the emergence of AI agents in practical applications, particularly in programming and task planning [16][14]. Summary by Sections 1. Key Developments in 2025 - The report outlines three major phases of AI model evolution: the initial competition among models, the explosion of multi-modal capabilities, and the rise of AI agents [14][16]. - Notable models such as Kimi-K2.5-Thinking and Qwen3-Max-Thinking have emerged as leaders in specific tasks like code generation and mathematical reasoning [18][24]. 2. Annual Evaluation Results and Analysis - The 2025 annual evaluation ranks Claude-Opus-4.5-Reasoning as the top model globally, followed by Gemini-3-Pro-Preview and GPT-5.2(high) [23][45]. - Domestic models like Kimi-K2.5-Thinking and Qwen3-Max-Thinking are positioned fourth and sixth, indicating a strong competitive stance [23][45]. - The report notes that domestic models are rapidly closing the gap with international counterparts, particularly in code generation and reasoning tasks [24][48]. 3. SuperCLUE Model Quadrant and Capability Landscape - The report presents a model quadrant that categorizes models based on their capabilities in reasoning and application, highlighting the emergence of "technical leaders" and "practical leaders" in the domestic market [38][39]. - The capability landscape indicates that while domestic models excel in certain areas, they still face challenges in hallucination control and precise instruction adherence [42][48]. 4. Comparative Analysis of Domestic and International Models - The analysis reveals that closed-source models dominate the top rankings, with significant advantages in reasoning and instruction-following tasks [74][80]. - Domestic open-source models are noted for their rapid advancements, particularly in coding tasks, where they have begun to outperform some international models [56][84]. - The report emphasizes the structural differences between domestic and international models, with domestic models showing a strong trend towards open-source development [24][47].
“春节大战”在即 马云现身阿里千问春节项目组
Di Yi Cai Jing· 2026-02-05 01:58
2月4日晚间,阿里员工在社交媒体爆料,阿里巴巴创始人马云现身杭州阿里总部千问春节项目组。图片 显示,马云在阿里巴巴合伙人邵晓锋陪同下来到千问办公区,一旁已有马年元素的新年立牌,立牌标注 有"千问C端事业群"。 此前,千问透露,在独家冠名四家卫视马年春节晚会的同时,会将AI生视频、AI识图、AI问答等AI创 作能力带进晚会的节目和互动环节,在四台春晚播出期间,千问还将发放专属口令红包,观众可边看春 晚边在APP内抢红包。马云亲自视察的背后,除了"红包大战",这个春节的AI竞争仍在持续加码,如何 花式抢夺这个春节的流量已是AI应用们的一道难题。 伴随着春节临近,"AI大战"正在持续升温。此前,百度、豆包等均启动了春节红包计划,阿里千问也在 2月2日宣布将投入30亿启动"春节请客计划",2月3日,千问APP还宣布将独家冠名四家卫视的马年春节 晚会。马年春节正被AI应用视作大规模抢占C端入口的一个重要节点,随着马云"视察",阿里对这场战 役的重视程度显而易见。 (文章来源:第一财经) "千问C端事业群"成立于2025年12月,阿里整合了原智能信息与智能互联两大业务板块,囊括千问 APP、夸克、AI硬件、UC及书旗等核 ...
“春节大战”在即,马云现身阿里千问春节项目组
Di Yi Cai Jing· 2026-02-05 01:55
阿里正举集团之力打这场"AI时代的超级入口"之战。 阿里正在举集团之力打这场"AI时代的超级入口"之战。谈及千问推广AI购物可能面临的集团内部协同挑 战时,吴嘉在接受包括第一财经在内的媒体采访时表示,千问的红包补贴会和其他业务一起出,对于各 业务分账、在千问app下单对其他业务可能带来的影响等,"现在没算得那么清楚,第一目标是体验好, 让用户用起来。"他透露,各业务和千问会有共同的业务目标,阿里相信,未来AI会带来大量新的生活 服务,绝对不只是存量。 30亿"春节请客计划"是阿里历史上春节活动投入最大的一次,目前,阿里对这项计划的描述是"以免单 形式请全国人民在春节期间吃喝玩乐",但官方尚未透露具体的活动形式。 如何发红包也将是千问进击这个春节的挑战之一。热战正酣,2月4日,微信安全中心公众号发文称,微 信对以春节为主题集中爆发的过度营销、诱导分享等违规行为进行打击,连腾讯自家的元宝也被"封 杀",似乎也正为微信对百度、阿里等的春节红包管控埋下伏笔。 此前,千问透露,在独家冠名四家卫视马年春节晚会的同时,会将AI生视频、AI识图、AI问答等AI创 作能力带进晚会的节目和互动环节,在四台春晚播出期间,千问还将发 ...
花30亿血拼春节,阿里能否“烧”出“超级App”?
Xin Lang Cai Jing· 2026-02-05 01:54
文 | 雷达财经 彭程 编辑 | 孟帅 随着马年春晚的脚步愈发临近,AI行业掀起的"红包大战"愈演愈烈。 不过,目前,千问暂未对外披露具体的活动规则及细节。从宣传海报来看,千问此次发起的"春节请客 计划"并非单一的补贴活动,而是与阿里生态下多个业务深度绑定的场景化服务拓展。 据悉,此次"春节请客计划",千问将联动阿里旗下淘宝、闪购、飞猪、大麦、高德、盒马等众多应用共 同参与。 2月2日,阿里旗下千问高调官宣,其将豪掷30亿元启动"春节请客计划"。而千问这一金额也将一举超越 腾讯元宝、百度文心助手等同行,成为目前马年春节AI"红包大战"的"烧钱王"。 据悉,千问这场规模高达30亿级别的"春节请客计划",并非单纯的现金补贴,而是千问联动阿里旗下全 生态的战略布局,淘宝、闪购、飞猪、大麦、高德、盒马等核心业务悉数参战。 相较于同行——腾讯元宝侧重社交裂变、百度文心助手主打娱乐互动、字节豆包聚焦春晚联动,千问则 走出了一条不一样的营销之路,将生态协同作为核心抓手,打造"AI+实体"服务闭环。 有分析认为,阿里千问烧钱换流量的打法,是其在AI C端市场与字节、腾讯、百度等互联网巨头围绕用 户心智占领和流量入口地位展开的 ...
港股科网股,再度集体下跌
Di Yi Cai Jing Zi Xun· 2026-02-05 01:44
2月5日,恒生指数低开0.82%,恒生科技指数跌1.31%。 百度逆势涨超2%,董事会授权一项总金额不超过50亿美元的股票回购计划。 编辑|钉钉 | 名称 | 现价 | 涨跌 | 涨跌幅 | | --- | --- | --- | --- | | 恒生指数 | 26627.95 | -219.37 | -0.82% | | 恒生科技 | 5295.89 | -70.55 | -1.31% | | 恒生生物科技 | 14995.29 | -160.91 | -1.06% | | 恒生中国企业指数 | 8978.46 | -69.92 | -0.77% | | 恒生综合指数 | 4073.68 | -33.61 | -0.82% | 科网股延续跌势,哔哩哔哩跌逾4%,腾讯音乐、华虹半导体跌逾3%,阿里巴巴、快手、中芯国际、美 团等跌逾2%。 | 名称 | 现价 | 涨跌 | 涨跌幅 - | | --- | --- | --- | --- | | 哔哩哔哩-W | 233.600 | -10.000 | -4.11% | | 腾讯音乐-SW | 62.200 | -2.300 | -3.57% | | 金蝶国际 | ...
基础大模型新技术、新产品密集推出 中国AI产业创新步伐加快
Ke Ji Ri Bao· 2026-02-05 01:16
Core Insights - The AI industry in China is experiencing intensified competition, with major players like Baidu, Alibaba, and DeepSeek rapidly launching new technologies and products to seize innovation leadership in AI [1][4] Group 1: Baidu Developments - Baidu launched the official version of its Wenxin large model 5.0 on January 22, utilizing a native multimodal architecture that supports various forms of input and output, including text, images, audio, and video [1] - The Wenxin model 5.0 has achieved top rankings in both text and visual understanding categories in the LMArena global model competition, placing it among the international elite [1] - Baidu's AI chip brand Kunlun has evolved from specialized to general-purpose, recently initiating an independent listing process to accelerate its multi-domain layout [3] Group 2: Alibaba Innovations - Alibaba's latest Qwen3-Max-Thinking inference model features a new testing time expansion mechanism, enhancing efficiency and cost-effectiveness in reasoning calculations [2] - The company is leveraging its application ecosystem and traffic advantages to integrate the Qwen model into its platforms like Taobao and Alipay, facilitating efficient collaboration between technology and various business scenarios [2] Group 3: DeepSeek Contributions - DeepSeek has focused on open-source advantages, launching the DeepSeek-OCR-2 model, which employs an innovative DeepEncoder V2 method for dynamic image processing, enhancing its logical reasoning capabilities [2] - In response to DeepSeek's advancements, Baidu quickly released and open-sourced its Paddle OCR-VL-1.5 model, introducing a novel "irregular frame positioning" technology for accurate recognition of distorted documents [2] Group 4: Industry Trends - The AI technology landscape in China is transitioning into a new phase of large-scale implementation, with companies demonstrating clear paths of innovation capability upgrades, collectively advancing the industry from "catching up" to "leading" [4]
科技股遭抛售,AMD大跌17%
Mei Ri Jing Ji Xin Wen· 2026-02-05 01:04
美股三大指数周三收盘涨跌不一,道指涨0.53%,纳指跌1.51%,标普500指数跌0.51%,热门科技股普 遍下跌,AMD跌超17%,英伟达、特斯拉、博通、Meta跌超3%。半导体设备与材料、存储概念股、加 密矿企跌幅居前,闪迪跌近16%,美光科技跌超9%。减肥药概念股、住宅地产涨幅居前,礼来涨超 10%,安进涨超8%。纳斯达克中国金龙指数收跌1.95%,热门中概股普跌,哔哩哔哩跌超6%,百度跌 超4%,阿里巴巴。蔚来、小鹏汽车跌超2%,理想汽车、霸王茶姬涨超1%。 0:00 ...
心智观察所:春节里一场不得不打、且必须此刻打响的战役
Guan Cha Zhe Wang· 2026-02-05 00:55
Core Insights - The article discusses the escalating competition among major Chinese tech companies in the AI space, particularly during the 2026 Lunar New Year, with significant financial investments aimed at capturing user engagement and establishing a dominant position in the AI ecosystem [1][2][7]. Group 1: Competitive Landscape - Major players like Alibaba, Tencent, Baidu, and ByteDance are investing heavily in AI initiatives, with Alibaba committing 30 billion yuan, Tencent 10 billion yuan, and Baidu 5 billion yuan for marketing during the Spring Festival, totaling over 45 billion yuan [1][5]. - The competition is not just about financial incentives but about establishing a "super entry point" for how users interact with machines and the digital world [1][2]. Group 2: Strategic Timing and User Engagement - The timing of this competition is critical, as the user base for generative AI in China surpassed 500 million by mid-2025, indicating a shift from niche interest to mainstream usage [5][6]. - The Spring Festival is seen as an optimal time for companies to introduce AI to a broad audience, leveraging the high engagement and willingness to try new technologies during this period [5][6]. Group 3: Differentiated Strategies - Alibaba's approach focuses on integrating its AI assistant "Qianwen" into its extensive commercial ecosystem, aiming to transition from mere conversation to transactional capabilities [8]. - Tencent aims to replicate its past success with WeChat by leveraging social networks to distribute its AI features, emphasizing user experience while evolving its ecosystem [9]. - ByteDance is taking a more aggressive stance by becoming the exclusive AI cloud partner for the CCTV Spring Festival Gala, positioning its AI assistant "Doubao" as a central figure in national interactions [10]. - Baidu's strategy involves embedding its AI features into its existing app, facilitating a seamless transition for users from traditional search to AI-driven interactions [11]. Group 4: Long-term Challenges - The competition is expected to extend beyond initial user acquisition to long-term user retention, with companies needing to ensure that AI becomes an integral part of daily life rather than just a seasonal novelty [12][13]. - The ultimate success will depend on the ability to create sustainable business models that convert user engagement into long-term value, as the industry currently relies heavily on substantial investments [12][13].
春节里一场不得不打、且必须此刻打响的战役
Guan Cha Zhe Wang· 2026-02-05 00:27
【文/观察者网专栏作者 心智观察所】 除夕夜的倒计时还没开始,互联网世界的硝烟已经弥漫至每个家庭的客厅屏幕。 2026年农历新年前夕,一场代价高昂的战役悄然升级。人们熟悉的"红包大战"仍在继续,但舞台中央的 主角,已悄然从移动支付应用,换成了野心勃勃的AI助手。 阿里巴巴旗下的"千问"掷出三十亿启动"春节请客计划",腾讯"元宝"豪撒十亿现金红包,百度"文心"亦 分五亿入局。而字节跳动,则以一种更隐晦却更深入的方式参战——成为央视春晚的独家AI云合作伙 伴。粗略计算,几家巨头为这个春节预备的营销弹药,已经超过四十五亿元。 这场看似熟悉的"撒钱"竞赛,内里却进行着一场彻底的基因置换。巨头们争夺的,早已不是用户钱包里 那个小小的支付通道,甚至不再是某个应用的下载数字。它们真正押注的,是定义下一个时代人们如何 与机器对话、如何与世界交互的"超级入口"权。 中国互联网产业的核心叙事,由此正式从移动互联网的流量内卷,转向AI原生生态的卡位与构建。 临界点:一场不得不打、且必须此刻打响的战役 为什么是2026年?为什么是这个春节?一切偶然的背后,是技术曲线、市场水温与竞争态势交汇成的必 然。 首要的原因是,用户心智的堰塞湖 ...
人类史上最富!马斯克身家超8000亿美元
Zhong Guo Ji Jin Bao· 2026-02-05 00:19
Market Overview - The three major U.S. stock indices closed mixed, with the Dow Jones up by 0.53%, while the S&P 500 and Nasdaq fell by 0.51% and 1.51% respectively [2] - Major technology stocks mostly declined, with the U.S. Tech Giants Index down by 1.32% [4] Notable Stock Movements - Amgen saw a significant increase of over 8%, while Nike rose by more than 5%, leading the Dow [2] - Tesla dropped by 3.78%, Nvidia by 3.41%, and Facebook by 3.28%, leading the declines among major tech stocks [4][5] Hedge Fund Activity - Hedge funds are significantly shorting software stocks, with short sellers profiting $24 billion this year, while the sector's overall market value has shrunk by $1 trillion [9] - The iShares Expanded Tech Software ETF (IGV) has fallen by over 21% year-to-date, with a 30% decline since its peak in September of the previous year [9] Chinese Stocks Performance - Chinese stocks generally fell, with the Nasdaq Golden Dragon China Index down by 1.95% and the Chinese Tech Leaders Index down by 2.80% [7] - Notable declines included NetEase down by 5.41%, Baidu down by 4.77%, and Tencent down by 4.14% [7][8] Elon Musk's Wealth - Elon Musk has become the first person in history to surpass a net worth of $850 billion (approximately 5.92 trillion RMB), primarily due to the acquisition of AI startup xAI by SpaceX, which is valued at $1.25 trillion [6] Gold and Silver Market - Gold futures experienced volatility, initially rising by 3.63% before settling at $4986.40 per ounce, a 1.04% increase [11] - Silver futures surged by 10.46% to $92.02 per ounce before closing at $87.77 per ounce, reflecting a 5.36% increase [12]