Workflow
通义千问(Qwen)系列
icon
Search documents
OpenAI时隔六年再开源,国内大模型竞争格局添变数
3 6 Ke· 2025-08-06 07:50
Core Insights - OpenAI has released two open-source large language models, gpt-oss-120b and gpt-oss-20b, marking its first open-source model release since GPT-2 in 2019, signaling a significant shift in the global AI landscape [1][2][4] Model Specifications - gpt-oss-120b has a total parameter count of 117 billion and can run on a single 80GB GPU, designed for production environments and high inference demands [2] - gpt-oss-20b has a total parameter count of 21 billion and can operate on a 16GB GPU, optimized for lower latency and localized use cases [2] - Both models utilize the Transformer architecture and incorporate a mixture of experts (MoE) design to enhance efficiency [2] Licensing and Usability - The models are released under a permissive Apache 2.0 license, allowing developers to use, modify, and commercialize without fees or copyleft restrictions [3] - They support configurable inference strength and provide full access to the reasoning process, facilitating debugging and enhancing output credibility [3] Market Impact - OpenAI's release is seen as a response to increasing competition in the global AI market, where many companies are rapidly developing and releasing their own models [4][5] - Prior to OpenAI's release, several Chinese companies, including Tencent and Alibaba, had already launched their own open-source models, intensifying the competitive landscape [6][7][8] Competitive Landscape - The recent surge in open-source model releases from various companies in China, such as Baidu and Tencent, has set a new benchmark in the AI open-source arena [7][10] - OpenAI's entry with gpt-oss models is expected to significantly alter the dynamics of the domestic AI model competition, providing opportunities for local companies to learn and innovate [10]
中国AI模型全面爆发,AI大模型技术体系综合开源影响力榜单重磅发布!
AI科技大本营· 2025-04-18 05:53
一提到"大模型",很多人的第一反应往往是那个既能聊天,又会写代码、画画的"模型本身"。但其 实,大模型远不止是一个"能输出结果的程序"这么简单,其背后有一整套复杂而庞大的技术体系作为 支撑:从大规模、高质量、多样化的数据,到先进的模型架构与训练策略,再到推理部署、资源调度 等支撑落地的系统能力,以及不可或缺的科学评测机制。大模型更像是一个由模型、数据、系统、评 测平台 等多要素构成的"技术共同体",而非单一模块的堆叠。 如今在闭源技术壁垒与高昂商用门槛的对比下,开源大模型正迅速崛起,成为推动 AI 技术普惠化的 重要力量。但面对层出不穷的开源 AI 模型技术,我们该如何选型?不同的模型技术体系又各有怎样 的优势与短板? 在这一背景下,为系统呈现全球大模型生态的开源发展现状,CSDN 联合多家机构于 4 月 18 日在 2025 全球机器学习技术大会(ML-Summit 2025)现场重磅发布《AI 大模型技术体系综合开源影响 力榜单》,全面评估全球范围内开源大模型技术体系的贡献与影响力,旨在为行业提供参考坐标,推 动开源创新持续前行。 注:这里大模型是指 主要包括 decoder-only 以来的模型结构,包 ...