Qwen系列大模型
Search documents
阿里千问APP首发遭遇流量洪峰,官方回应“状态良好,欢迎来问”
Jin Shi Shu Ju· 2025-11-17 06:08
千问APP依托Qwen系列大模型打造。Qwen自2023年全面开源以来,性能超越Llama、Deepseek等国际开源模型,全球下载量突破6亿次。 Airbnb首席执行官布莱恩·切斯基(Brian Chesky)表示,公司业务大量依赖Qwen,认为其比OpenAI模型更快更高效。英伟达首席执行官黄仁 勋(Jensen Huang)也指出,Qwen在全球开源模型市场占据重要份额,并仍在扩张。 阿里巴巴今年早前宣布投入3800亿元用于AI基础设施建设,并计划追加更大投入。9月24日云栖大会上,阿里发布通义旗舰模型Qwen3-Max和 下一代基础模型架构Qwen3-Next,其中Qwen3-Max-Instruct预览版在LMArena文本排行榜上位列第三,超过GPT-5-Chat。同时,阿里宣布与英 伟达在PhysicalAI领域展开合作,为企业用户提供全链路平台服务。 阿里巴巴于1月17日宣布,个人AI助手千问APP正式开启公测,免费向用户开放。 千问App基于全球性能第一的开源模型Qwen3,定位为既能"对话",又能"办事"的个人AI助手。阿里计划将地图、外卖、订票、办公、学习、 购物、健康等生活场景接入千问 ...
超越GPT-4o!华人团队新框架让Qwen跨领域推理提升10%,刷新12项基准测试
量子位· 2025-06-04 00:17
General-Reasoner团队 投稿 量子位 | 公众号 QbitAI 一项新的强化学习方法,直接让Qwen性能大增,GPT-4o被赶超! 来自加拿大滑铁卢大学与TikTok新加坡,M-A-P的华人团队提出了一种全新训练框架: General- Reasoner 。 结果直接让Qwen系列大模型的跨领域推理准确率提升近10%,在多个基准测试中甚至超越GPT-4o。 上图显示出General-Reasoner在多项跨领域评测中显著提升基础模型推理能力。 当前,强化学习(RL)被视为提升模型推理能力的关键手段。其中,Zero-RL方法通过直接训练基础 模型,已在数学和编程等结构化任务上展现出强大效果。 问题是,这些方法往往局限于数据丰富、答案结构清晰的领域,在面对物理、金融或人文社科等更广 泛的领域时,模型难以有效泛化。 接下来看看研究团队是如何解决这些推理难题的? 相较现有方法的关键革新 目前的Zero-RL框架如SimpleRL通常聚焦于单一领域数据,采用简单的规则式答案验证,存在以下不 足: 数据单一 多为数学竞赛或代码任务,泛化能力有限; 验证方式僵化 仅能识别明确结构化答案,无法灵活处理多样化的答 ...
TMT行业月报:阿里巴巴扩大AI投资;VAL模型或将改变智能驾驶竞争格局
HONGTA SECURITIES· 2025-03-06 12:12
Investment Rating - The investment rating for the communication industry is "Outperform the Market" [1]. Core Insights - The report highlights significant investments in AI infrastructure by leading companies, with Alibaba announcing a plan to invest 380 billion yuan (approximately 54.5 billion USD) over the next three years, which surpasses its total investment in the past decade [20][24]. - The AI computing power demand is rapidly increasing, with the domestic AI computing scale expected to reach 725.3 EFLOPS in 2024, a year-on-year growth of 74.1%, and projected to reach 2781.9 EFLOPS by 2028 [21][24]. - The report discusses the emergence of the Vision-Language-Action (VLA) model in the autonomous driving sector, which integrates visual input, language reasoning, and action output into a single framework, enhancing the performance of intelligent driving systems [26][30]. Summary by Sections 1. Market Review - From February 5 to February 28, 2025, the CSI 300 index rose by 1.91%, with the communication industry also increasing by 1.91%, while the computer industry surged by 16.31% [6][13]. - The communication sector experienced significant volatility, benefiting from operators' increased investment in computing power, leading to strong stock performance for companies like China Unicom and China Telecom [6][13]. 2. Communication Industry - Major companies are expanding their AI investments, with Tencent, Baidu, and Alibaba expected to increase their capital expenditures by 19.1% in 2025, reaching 15.42 billion USD [20][24]. - The report notes that the construction of intelligent computing centers is set to accelerate, with over 458 projects announced in the public bidding market for 2024 [24][25]. 3. Computer Industry - The VLA model represents a new direction in autonomous driving technology, improving the ability to process complex traffic scenarios and enhancing decision-making capabilities [26][30]. - The global autonomous driving market is projected to grow from 207.4 billion USD in 2024 to 273.8 billion USD in 2025, with the Chinese market expected to reach 399.3 billion yuan in 2024 [31][32].