通义千问（Qwen）系列 - filings, earnings calls, financial reports, news

通义千问（Qwen）系列

Search documents

3 6 Ke· 2025-08-06 07:50

Core Insights - OpenAI has released two open-source large language models, gpt-oss-120b and gpt-oss-20b, marking its first open-source model release since GPT-2 in 2019, signaling a significant shift in the global AI landscape [1][2][4] Model Specifications - gpt-oss-120b has a total parameter count of 117 billion and can run on a single 80GB GPU, designed for production environments and high inference demands [2] - gpt-oss-20b has a total parameter count of 21 billion and can operate on a 16GB GPU, optimized for lower latency and localized use cases [2] - Both models utilize the Transformer architecture and incorporate a mixture of experts (MoE) design to enhance efficiency [2] Licensing and Usability - The models are released under a permissive Apache 2.0 license, allowing developers to use, modify, and commercialize without fees or copyleft restrictions [3] - They support configurable inference strength and provide full access to the reasoning process, facilitating debugging and enhancing output credibility [3] Market Impact - OpenAI's release is seen as a response to increasing competition in the global AI market, where many companies are rapidly developing and releasing their own models [4][5] - Prior to OpenAI's release, several Chinese companies, including Tencent and Alibaba, had already launched their own open-source models, intensifying the competitive landscape [6][7][8] Competitive Landscape - The recent surge in open-source model releases from various companies in China, such as Baidu and Tencent, has set a new benchmark in the AI open-source arena [7][10] - OpenAI's entry with gpt-oss models is expected to significantly alter the dynamics of the domestic AI model competition, providing opportunities for local companies to learn and innovate [10]

大模型开源

Artificial Intelligence

Artificial Intelligence

中国AI模型全面爆发，AI大模型技术体系综合开源影响力榜单重磅发布！

AI科技大本营· 2025-04-18 05:53

一提到"大模型"，很多人的第一反应往往是那个既能聊天，又会写代码、画画的"模型本身"。但其实，大模型远不止是一个"能输出结果的程序"这么简单，其背后有一整套复杂而庞大的技术体系作为支撑：从大规模、高质量、多样化的数据，到先进的模型架构与训练策略，再到推理部署、资源调度等支撑落地的系统能力，以及不可或缺的科学评测机制。大模型更像是一个由模型、数据、系统、评测平台等多要素构成的"技术共同体"，而非单一模块的堆叠。如今在闭源技术壁垒与高昂商用门槛的对比下，开源大模型正迅速崛起，成为推动 AI 技术普惠化的重要力量。但面对层出不穷的开源 AI 模型技术，我们该如何选型？不同的模型技术体系又各有怎样的优势与短板？在这一背景下，为系统呈现全球大模型生态的开源发展现状，CSDN 联合多家机构于 4 月 18 日在 2025 全球机器学习技术大会（ML-Summit 2025）现场重磅发布《AI 大模型技术体系综合开源影响力榜单》，全面评估全球范围内开源大模型技术体系的贡献与影响力，旨在为行业提供参考坐标，推动开源创新持续前行。注：这里大模型是指主要包括 decoder-only 以来的模型结构，包 ...

人工智能

大模型

开源大模型

Artificial Intelligence

Artificial Intelligence

LLaMA系列模型

通义千问（Qwen）系列