Qwen 3

Search documents
Canalys:一季度中国内地在云基础设施服务上的支出达到116亿美元 同比增长16%
智通财经网· 2025-07-11 02:24
智通财经APP获悉,Canalys最新数据显示,2025年第一季度,中国内地在云基础设施服务上的支出达到116亿美元,同比增长16%。AI相 关需求已成为推动企业向云端迁移的主要动力。为把握这一增长机遇,云厂商正积极加大对AI基础设施和模型研发的投资。为弥合AI大 模型能力与实际业务需求之间的差距,厂商正采取多种战略路径,包括开源AI模型、拓展合作伙伴生态、推出AI智能体开发平台等。 在其最新发布的《企业AI合同数据库》中,Omdia记录了多项中国内地企业与云厂商之间的新合作案例,企业正通过部署预训练的即用 型模型及相关服务,加速其AI落地进程。2025年第一季度,阿里云在中国内地云服务市场中占据33%的份额,华为云为18%,腾讯云为 10%。 2025年第一季度,中国内地云基础设施服务市场持续加速发展。随着企业加快AI部署,云服务市场的潜力正不断释放。AI大模型对算力 的巨大需求,正在显著推动企业对基于云的GPU资源的依赖。 客户对AI的旺盛需求正在重塑云计算的应用方式。Canalys高级总监 Rachel Brindley 表示:"AI正在全方位推动企业加速上云。一方面, 原本依赖本地数据中心的组织,正将 ...
2025年第一季度,中国云基础设施市场加速增长,阿里云稳居第一,华为云与腾讯云加速AI布局
Canalys· 2025-07-11 01:52
Canalys (现并入Omdia) 将云基础设施服务定义为由第三方提供商托管,并通过互联网向用户提供的以下服务总和:裸金 属即服务( BMaaS )、基础设施即服务( IaaS )、平台即服务( PaaS )、容器即服务( CaaS )以及无服务器( Serverless )服务。 Canalys Canalys(现并入Omdia)是全球领先的科技市场独立分析机构,以渠道为核心 。致力于引导客户 展望科技行业的未来,并协助客户打造具有创新思维的商业模式。25年来,我们一直为全球科技厂商提高全球高度及本地视 角的市场分析及定制解决方案,我们的分析师作为各自领域的专家,把市场知识和客户要求相结合,为其打造定制化的研究 产品。 我们的研究涵盖新兴技术、企业技术、移动技术和智能技术。 渠道的深度理解是我们工作的基 石。我们通过专业的报告、数据和预测,为客户提供战略决策支持。同时,我们论坛和Candefero在线社区为渠道伙伴提供 了宝贵的互动与反馈平台。我们始终以高精度及高质量的数据、创新的技术运用和优质的客户服务为立足之本,赢得客户的 信任与认可。 Canalys (现并入 Omdia )的最新数据, 2025 ...
全球媒体聚焦|美媒:中国AI“弯道超车” 美国领先优势“告急”
Sou Hu Cai Jing· 2025-07-03 10:09
《华尔街日报》近日的一篇报道认为,中国人工智能企业正在削弱美国在全球人工智能领域的主导地位,挑战 美国的领先优势。 | #1: Google Gemini 2.5 Pro (U.S.) | | | --- | --- | | | 1,477 | | #2: OpenAl ChatGPT 4o (U.S.) | | | | 1,428 | | Tied #3: DeepSeek R1-0528 (China) | | | | 1,424 | | Tied #3: xAl Grok 3 Preview (U.S.) | | | | 1,422 | | Tied #9: Alibaba Qwen 3 (China) | | | | 1,388 | | Tied #11: Tencent Hunyuan (China) | | | | 1,376 | | Tied #11: MiniMax M1 (China) | | | | 1.373 | | Tied #13: Anthropic Opus 4 (U.S.) | | | | 1.373 | | Tied #13: Mistral Medium 3 (Europe ...
小红书开源1420亿参数大模型,部分性能与阿里Qwen3模型相当
Tai Mei Ti A P P· 2025-06-10 01:07
Core Insights - Xiaohongshu has recently open-sourced its first self-developed large model, dots.llm1, through platforms like Github and Hugging Face [2][9] - The model has been trained using 11.2 trillion high-quality tokens, significantly outperforming the open-source TxT360 data [5] - Xiaohongshu's valuation has surged from $20 billion to $26 billion as of March 2023, surpassing the market values of companies like Bilibili and Zhihu [9] Model Performance - Dots.llm1 features a mixture of experts (MoE) model with 142 billion parameters, activating only 14 billion during inference to reduce costs while maintaining performance [3][5] - In various benchmarks, dots.llm1 shows competitive performance against Alibaba's Qwen models, particularly excelling in Chinese language tasks [7][8] - The model achieved a score of 92.6 on CLUEWSC and 92.2 on C-Eval, indicating industry-leading performance in Chinese semantic understanding [7] Training Efficiency - The hi lab team has implemented advanced training techniques, achieving a 14% improvement in forward computation and a 6.68% improvement in backward computation compared to NVIDIA's Transformer Engine [5] - Future plans include integrating more efficient architectural designs and exploring sparse MoE layers to enhance computational efficiency [10] Strategic Direction - Xiaohongshu is shifting focus from being merely a content community and live e-commerce platform to actively developing AI technologies, particularly large language models [9][10] - The company aims to deepen its understanding of optimal training data and explore methods to achieve human-like learning efficiency [11]
大模型强化学习新突破——SPO新范式助力大模型推理能力提升!
机器之心· 2025-06-08 08:21
当前,强化学习(RL)在提升大语言模型(LLM)推理能力方面展现出巨大潜力。DeepSeek R1、Kimi K1.5 和 Qwen 3 等模型充分证明了 RL 在增强 LLM 复杂推理能力方面的有效性。 然而,要实现有效的强化学习,需要解决一个根本性的挑战,即 信用分配问题(credit assignment) :在大语言模型的场景下,如何将整个序列(LLM 的回复)最终的评估结果,归因到序列中具体的决策动作(token)上。 这一问题的困难在于奖励信号非常稀疏 — 只能在序列结束时才能获得明确的成功或失败反馈。 当前主要方法 在强化学习中,通常采用优势值估计(advantage estimation)的方法来解决信用分配问题。目前针对大语言模型的强化学习方法主要分为两类,它们之 间的区别在于优势值估计的粒度不同。 粗粒度的轨迹级 (trajectory-level) 方法,如 DeepSeek R1 使用的 GRPO,只根据最终的奖励为整个序列计算一个优势值。这种方法虽然高效但反馈信号 过于粗糙,LLM 无法对错误回答中正确的部分进行奖励,也无法对正确回答中冗余的部分进行惩罚。 论文题目:Segment ...
饿了么的行业新战事:向一家AI公司进化
雪豹财经社· 2025-05-31 01:00
用AI改造外卖行业 作者 丨瀚星 外卖行业战火重燃迄今已逾百日,从美团京东的对垒,到饿了么挺进、行业变阵为"三国杀",变数 横生,入局最晚者却提速最快。 饿了么联合淘宝闪购虽然姗姗来迟,却跑得更快。在不到一个月的时间内取得了超过4000万单的日 单量,成为外卖大战至今最引人瞩目的战报之一。 正在被悄然重塑的外卖市场格局背后,一个出人意料的重要角色开始被注意到,那就是AI。 这个能大幅提升外卖行业当下和未来效率的创新杠杆,正在成为这场争夺战的关键胜负手。 外卖的"含AI量"大幅提升 对外卖骑手来说,夏天一年之中最辛苦的几个月,不仅要在持续高温下奔波送餐,还要时常面对因 天气原因带来的设备问题。 饿了么骑手黄晓琴发现,每到天气炎热时,自己的智能手机就容易出现卡顿,原因是在配送路途中 长时间被暴晒而导致电池温度升高 夏天还会时常下雨 打湿手机屏幕 不仅操作不方便 还影响 长时间被暴晒而导致电池温度升高。夏天还会时常下雨,打湿手机屏幕,不仅操作不方便,还影响 骑手接单、点送达等。 为了帮助骑手解决这些痛点,饿了么上线了一款AI助手"小饿",这是国内首个基于大模型技术打造 的骑手端智能体。骑手原本需要自己操作手机界面, ...
3 Signs That Alibaba's Turnaround Effort Is Bearing Fruit
The Motley Fool· 2025-05-24 13:15
Core Insights - Alibaba is undergoing a transformation to regain its market position and enhance shareholder value, with significant leadership changes and a focus on core businesses [1][2][4] E-commerce Business - Alibaba's e-commerce segment is showing signs of recovery, with a reported 12% growth in customer management revenue for the quarter ending March 31, up from 9% in the previous quarter and 4% in the fiscal year ending March 31, 2024 [6] - The international e-commerce business has also seen a 22% growth, indicating diversification and potential for future expansion across various regions and platforms [7] Cloud Computing Business - Alibaba Cloud faced challenges in fiscal 2024 with only 3% revenue growth, but has recently rebounded with an 18% increase in revenue to 30 billion yuan, driven by public cloud growth and AI-related revenue [8][9] - AI-related revenue has experienced triple-digit growth for seven consecutive quarters, reflecting a strong adoption of cloud computing and AI solutions across multiple industries [10] Shareholder Returns - In the latest fiscal year, Alibaba repurchased $11.9 billion of its stock and distributed $4.6 billion in dividends, totaling $16.5 billion returned to shareholders [13] - These actions are aimed at rebuilding investor trust and attracting long-term investment, particularly from Western markets, while signaling the company's strong financial health [14] Future Outlook - Alibaba's recent performance indicates that its turnaround efforts are gaining traction, positioning the company favorably for sustained growth in the upcoming quarters [15]
产业趋势与热点复盘周报:美股科技强势回归,关注鸿蒙电脑-20250520
Changjiang Securities· 2025-05-20 15:39
丨证券研究报告丨 %% %% %% %% research.95579.com 投资策略丨专题报告 [Table_Title] 美股科技强势回归,关注鸿蒙电脑——产业趋势 与热点复盘周报 报告要点 [Table_Summary] 5 月 8 日-15 日,受益于中美和谈,关税暂时缓和,美国三大股指迎来不同程度的反弹,其中科 技板块涨幅明显,波动率回归较低位置。产业趋势上: 1)阿里巴巴在 5 月 12 日发布开源模 型 Qwen 3 模型,可在两种思考模式下自由切换;2)特斯拉发布其人形机器人舞蹈视频,并表 示其人形机器人优化"仿真到现实"(Sim-to-Real)的训练代码,通过强化学习完成训练。近 期热点包括中美和谈、特朗普访问中东、资产重组管理办法修改,未来关注即将发布的鸿蒙电 脑以及鸿蒙操作系统,小米 AI 眼镜、"玄戒"SoC 芯片以及 15s PRO 等。 分析师及联系人 [Table_Author] 戴清 SAC:S0490524010002 SFC:BTR264 请阅读最后评级说明和重要声明 1 [Table_Title 美股科技强势回归,关注鸿蒙电脑 2] ——产业趋 势与热点复盘周报 [T ...
Alibaba shares drop 4% in premarket trading after big profit miss
CNBC· 2025-05-15 09:51
Core Insights - Alibaba's shares declined by 4% in premarket trading after missing earnings expectations for its fiscal fourth quarter, with revenue up 7% year-on-year but below analyst estimates [1][6] Financial Performance - Revenue for the fiscal fourth quarter was 236.5 billion Chinese yuan ($32.6 billion), slightly below the expected 237.2 billion yuan [6] - Net income was reported at 12.4 billion yuan, significantly lower than the expected 24.7 billion yuan [6] Market Conditions - Investors are concerned about the impact of macroeconomic volatility on consumer sentiment in China, particularly due to the ongoing trade tensions between Washington and Beijing [2] - Recent agreements to suspend most tariffs on goods between the U.S. and China may influence market conditions [2] Strategic Initiatives - Alibaba has extended its partnership with Rednote (Xiaohongshu) to enhance shopping experiences on its Tmall and Taobao platforms by embedding product links in posts [3] - The company is focusing on advancements in artificial intelligence, launching the Qwen 3 large language model to power its AI assistant Quark [4] Competitive Landscape - The AI sector in China is highly competitive, with notable investments from other tech giants like Tencent, which reported a 91% year-on-year increase in capital expenditures driven by AI investments [4]
下周聊:大模型进入 RL 下半场,模型评估为什么重要?
Founder Park· 2025-05-09 11:55
大模型进入 RL 下半场。前段时间,OpenAI Agent Reseacher 姚顺雨的博客文章《The second half》掀起热议,从「模型算法」到「实际效用」, 如何重新定义问题和设计真实用例的 evaluation 变得尤为重要。 从评测基准到实际应用效果,现有的评估体系怎样有效衡量 Agent 产品的 ROI ?对于创企、希望 应用 AI 的企业来说,如何用好模型的测评结果来指导产品的开发落地? Superclue 在模型测评领域有着深厚的经验,与国内外众多模型及 Agent 团队保持着紧密的联系 与交流。Superclue 近期推出了中文通用 AI 智能体的测评基准 AgentCLUE-General,对主流的 Agent 产品能力进行了深度剖析。 我们特别邀请到 SuperCLUE 的联合创始人朱雷,一起聊聊当前大模型、Agent 评估中的核心难 题。 o3解读:OpenAI发力tool use,Manus们会被模型取代吗? Qwen 3 发布,开源正成为中国大模型公司破局的「最优解」 转载原创文章请添加微信:founderparker AI 下半场,大模型的 Evaluation 为什么 ...