开源大模型

Search documents
国产开源大模型霸榜Design Arena,前十五名全数上榜展现强劲实力
Sou Hu Cai Jing· 2025-08-25 15:25
近期,国内开源大模型领域迎来了一波显著的发展浪潮,引发了业界的广泛关注。 在最新的技术动态中,新一代大语言模型(LLM)的更新让开源大模型再次站在了聚光灯下。值得注意的是,一位名为Rohan Paul的软件工程师兼自媒体 人在探索过程中发现了一个引人注目的现象:在Design Arena这一全球知名的众包AI生成设计Benchmark平台上,排名前列的开源AI模型几乎全部来自中 国。 | ට | DeepSeek-V3.1 | 1258 134W / 96L | 58.3% I | ±6.4% | 230 | DeepSeek | 46.85 | | --- | --- | --- | --- | --- | --- | --- | --- | | 10 | 19 Qwen3 Coder 30B A3B Instruct | 1258 1897 / 3505 | 58.3% | +3.3% | 839 | Alibaba | 35.55 | | 11 | GLM 4 32B | 1237 207W / 167L | 55.3% | ±5.0% | 374 | THUDM | 3m 21s | | 12 | K ...
全球开源大模型,前十五名全是中国的
机器之心· 2025-08-25 09:10
机器之心报道 机器之心编辑部 国产开源力量的集中爆发。 都在说国内大模型正在驰骋开源领域,具体的情况如何? 近日,随着新一代大语言模型(LLM)的一波更新,开源大模型再次成为了热门讨论话题。软件工程师、自媒体 Rohan Paul 发现了一个惊人的现象: Design Arena 排行榜上排名前十几位开源 AI 模型全部来自中国。 排名第一的是 DeepSeek-R1-0528,智谱的 GLM-4.5 和阿里的 Qwen 3 Coder 480B 紧随其后。 Design Arena 是目前全球最大规模的众包 AI 生成设计 Benchmark 平台,它的核心机制是让真实的人类用户进行评测,基于 Elo Rating(类似于国际象 棋评分体系)等级分制度进行模型对战。 用户在平台上会被随机展示两段由不同模型生成的回答,然后进行投票选择「哪一个更好」。每一次投票都会影响对应模型的 Elo 分数,进而形成动态的排 行榜。Elo 核心原理是,高分选手击败低分选手,得分会很少,而低分选手爆冷战胜高分选手时,得分会很多。因此用对弈的角度来看的话,这是一个相对 公平、符合认知的评分系统。 因此,不同于 MMLU、SWE- ...
传媒行业周观察(20250818-20250822):关注中报超预期标的及港股流动性变化,看好后续游戏、AI、IP、影视行情
Huachuang Securities· 2025-08-25 06:31
证 券 研 究 报 告 传媒行业周观察(20250818-20250822) 关注中报超预期标的及港股流动性变化,看 推荐(维持) 好后续游戏、AI、IP、影视行情 【市场观点】 行业研究 传媒 2025 年 08 月 25 日 华创证券研究所 证券分析师:刘欣 邮箱:liuxin3@hcyjs.com 执业编号:S0360521010001 证券分析师:王世豪 邮箱:wangshihao@hcyjs.com 执业编号:S0360525070005 行业基本数据 | | | 占比% | | --- | --- | --- | | 股票家数(只) | 140 | 0.02 | | 总市值(亿元) | 18,810.16 | 1.63 | | 流通市值(亿元) | 17,127.16 | 1.85 | 相对指数表现 | % | 1M | 6M | 12M | | --- | --- | --- | --- | | 绝对表现 | 12.1% | 11.1% | 81.8% | | 相对表现 | 5.8% | 1.0% | 49.7% | -5% 24% 53% 82% 24/08 24/11 25/01 25/03 ...
刚刚,字节开源Seed-OSS-36B模型,512k上下文
机器之心· 2025-08-21 01:03
| 机器之心报道 | 机器之心编辑部 | | --- | --- | | 开源赛道也是热闹了起来。 | | | 就在深夜,字节跳动 Seed 团队正式发布并开源了 Seed-OSS 系列模型,包含三个版本: | | | Seed-OSS-36B-Base(含合成数据) | | | Seed-OSS-36B-Base(不含合成数据) | | | Seed-OSS-36B-Instruct(指令微调版) | | Seed-OSS 使用了 12 万亿(12T)tokens 进行训练,并在多个主流开源基准测试中取得了出色的表现。 这三个模型均以 Apache-2.0 许可证发布,允许研究人员和企业开发者自由使用、修改和再分发。 主要特性: 模型架构 Seed-OSS-36B 的架构结合了多种常见的设计选择,包括因果语言建模、分组查询注意力(Grouped Query Attention)、SwiGLU 激活函数、RMSNorm 和 RoPE 位置 编码。 每个模型包含 360 亿参数,分布在 64 层网络中,并支持 15.5 万词表。 其最具代表性的特性之一是原生长上下文能力,最大上下文长度可达 512k token ...
传媒行业周观察(20250811-20250815):看好游戏、IP、AI、影视等景气度方向
Huachuang Securities· 2025-08-18 05:47
Investment Rating - The report maintains a "Recommendation" rating for the media industry, expecting the industry index to outperform the benchmark index by over 5% in the next 3-6 months [3][44]. Core Viewpoints - The report highlights optimism in sectors such as gaming, intellectual property (IP), artificial intelligence (AI), and film, indicating a favorable market outlook [1][3]. - The media sector is currently experiencing a resurgence, with AI applications gaining traction and cultural confidence being bolstered through content output [3][6]. - The report emphasizes the potential for significant growth in the AI application industry, particularly in public cloud services and user engagement scenarios [3][6]. Market Performance Overview - The media sector index rose by 1.00% last week, underperforming the CSI 300 index, which increased by 2.37%, resulting in a relative underperformance of 1.37% [7][10]. - The media sector's total market capitalization is approximately 178.65 billion yuan, with 140 listed companies [3]. Gaming Sector Insights - The gaming market shows positive trends, with high-frequency data indicating upward movement and favorable mid-year report expectations [3][15]. - Notable games such as "Peacekeeper Elite" and "Honor of Kings" continue to dominate the iOS sales rankings, reflecting strong daily active user (DAU) engagement [15][16]. Film Market Analysis - As of August 15, 2025, the film box office has reached 33.006 billion yuan, recovering approximately 85% of the pre-pandemic levels in terms of box office revenue [20][21]. - The average ticket price is reported at 32.6 yuan, with a total of 20.879 million viewers during the week of August 11-15, 2025 [21][26]. AI Sector Developments - The report notes the ongoing advancements in AI applications, with a focus on companies like Kuaishou and Youzan, which are expected to benefit from AI integration [3][29]. - The launch of new AI technologies and products by major companies like Huawei and Apple is anticipated to further drive growth in the sector [29][30][31]. Key Company Recommendations - The report suggests focusing on companies such as Tencent, Alibaba, Kuaishou, and Meitu, which are well-positioned to leverage the current market dynamics [3][6]. - Specific stocks like Giant Network, G-bits, and Perfect World are highlighted as potential investment opportunities within the gaming sector [3][6].
全球AI大模型迭代提速!中国开源生态爆发
Wind万得· 2025-08-12 22:37
Core Viewpoint - The global AI industry is experiencing a rapid acceleration in technological iterations, with major companies like OpenAI, Google DeepMind, and Baidu releasing or updating large model products, indicating a period of intensive innovation [1] Group 1: Major Company Developments - OpenAI launched GPT-5 on August 8, featuring enhanced reasoning, multimodal capabilities, and enterprise customization, with significant improvements in programming performance and reduced hallucination rates [3] - Baidu plans to release a new AI inference model by the end of August, aimed at enhancing complex task processing capabilities [3] - Google DeepMind introduced the "Genie3" model on August 6, capable of generating dynamic 3D worlds, although it still faces limitations in practical operability and multi-agent interactions [3] - Chinese companies are making significant strides in the open-source large model sector, with Tencent announcing the open-source "Hunyuan 3D World Model 1.0" and Alibaba releasing four open-source models, with one ranking third globally on an international evaluation platform [3][4] Group 2: Open Source Landscape - As of July 31, nine out of the top ten open-source large models globally are from Chinese companies, with Zhipu GLM-4.5 ranked first, showcasing China's transition from technology catch-up to ecosystem leadership [4] - The open-source approach adopted by Chinese companies contrasts with the closed-source model favored by U.S. tech firms like OpenAI, which has shifted from open-source to closed-source operations to maintain its technological edge [6] Group 3: Industry Challenges and Opportunities - The open-source model accelerates technology dissemination but faces challenges such as "fine-tuning internal competition," where most updates focus on parameter tuning rather than foundational architecture innovation [6] - Developers encounter compatibility issues due to frequent model updates and interface changes, complicating integration efforts [6] - The "combinatorial effect" of open-source models may weaken technological barriers, preventing significant capability gaps between companies [6] Group 4: Market Dynamics and Future Outlook - Differentiated AI applications are creating incremental opportunities, with companies like Kuaishou focusing on video and image generation, Alibaba leveraging AI in e-commerce, and Tencent exploring applications in advertising and gaming [7] - As of now, the total number of registered personal users for large models exceeds 3.1 billion, with API call users surpassing 159 million [7] - The next generation of large models is expected to benefit from increased reasoning demands, driving growth in computing power requirements [7] - By 2025, the AI large model industry is anticipated to exhibit accelerated technological iterations, a rising open-source ecosystem, and diverse commercialization paths, enhancing China's global influence in the AI sector [7]
现在就等梁文锋了
投资界· 2025-08-10 07:45
Core Insights - The article discusses the recent advancements in AI technology, particularly focusing on the competitive landscape among major players like OpenAI, Google, and Anthropic, highlighting their latest model releases and innovations [5][10][11]. Group 1: OpenAI Developments - OpenAI has released its first open-weight large language models, gpt-oss-120b and gpt-oss-20b, with parameters of 117 billion and 21 billion respectively, designed for local deployment [13][19]. - The gpt-oss-120b model achieves performance close to OpenAI's o4-mini on core reasoning benchmarks and can run efficiently on a single 80 GB GPU [13][19]. - The release aims to address local deployment needs and market demands, although it includes restrictions on commercial use for entities with annual revenues exceeding $100 million or daily active users over 1 million [19][26]. Group 2: Google Innovations - Google introduced Genie 3, a groundbreaking model that allows users to generate interactive 3D virtual worlds from text prompts, achieving 720p resolution at 24 FPS [27][28]. - The model requires precise physical feedback and interaction, presenting significant technical challenges, but has the potential to revolutionize fields like robotics and gaming if successfully developed [29][30]. - Despite its impressive capabilities, Genie 3 is currently in the demonstration phase and not available for public testing, indicating it remains a future prospect [30]. Group 3: Anthropic's Strategy - Anthropic has updated its top-tier model, Claude Opus 4.1, which reportedly improves AI programming capabilities by 2%, reflecting the current upper limit of AI coding abilities [34][38]. - The model's performance metrics show it has the highest market share and reputation in AI coding, positioning Anthropic as a strong competitor against OpenAI and Google [38][39]. - The focus on enhancing programming capabilities allows Anthropic to maintain relevance in the competitive landscape of large model commercialization [38]. Group 4: Contributions from Chinese Scientists - The article highlights the significant contributions of Chinese scientists and engineers in the development of these AI models, particularly within OpenAI and Google [40][42]. - Key figures include Ren Hongyu, who worked on language model training optimization at OpenAI, and Emma Wang, who contributed to the design and optimization of Genie 3 at Google [42][46].
三位90后,估值700亿
投资界· 2025-08-10 07:45
Core Viewpoint - The article highlights the rapid rise of Mistral AI, a startup founded by three young graduates, which has achieved a remarkable valuation of approximately $10 billion within two years, showcasing the explosive growth potential in the AI sector [2][6][12]. Group 1: Company Overview - Mistral AI was founded by three 90s graduates who previously worked at top AI firms and returned to France to capitalize on the AI revolution [6][8]. - The company launched its first open-source large model, Mistral 7B, which outperformed competitors in several benchmark tests, quickly gaining attention in the developer community [6][7]. - Mistral AI aims to lead the generative AI wave through open-source initiatives, contrasting with closed models from competitors like OpenAI [6][7]. Group 2: Funding and Valuation - Mistral AI completed a record seed round of $1.13 billion shortly after its establishment, achieving a valuation of over $2.6 billion [10]. - By the end of 2023, the company raised $415 million in Series A funding, increasing its valuation to $2 billion, and later secured $640 million in Series B funding, bringing its valuation to $6 billion [11][12]. - The latest funding round discussions could potentially elevate Mistral's valuation to around $10 billion, with significant interest from major investors [12][13]. Group 3: Competitive Landscape - The AI landscape is becoming increasingly competitive, with the emergence of other open-source models like DeepSeek, which has gained significant traction [7][8]. - Mistral AI has launched several products, including a chatbot and a reasoning model, to compete directly with other players in the market [8]. - Despite initial success in France, Mistral's international performance has been mixed, indicating challenges in scaling beyond local markets [8]. Group 4: Industry Trends - The article notes a trend of young entrepreneurs in the AI sector, with many 90s graduates leading startups that are rapidly gaining valuations and market presence [14][16]. - The rise of AI is compared to the historical impact of electricity, suggesting that AI will significantly influence GDP across nations [13].
中国“霸榜”全球开源大模型:光环下的隐忧与挑战
Zheng Quan Shi Bao· 2025-08-06 18:37
Core Viewpoint - The recent surge in open-source AI models in China is reshaping the global AI landscape, with significant implications for technology influence and application acceleration, while also presenting challenges related to model iteration and compatibility costs [1][2][3]. Group 1: Open-source Model Surge - In the past two weeks, Alibaba's Tongyi Qianwen has released six open-source models, marking a resurgence in China's large model development, reminiscent of the "hundred model battle" of 2023 [1]. - The recent open-source wave has seen major Chinese companies, including Alibaba and Tencent, rapidly releasing new models, with China occupying nine out of the top ten spots in the Hugging Face open-source model ranking [2]. - The success of DeepSeek is viewed as a turning point, prompting more Chinese companies to adopt open-source strategies and focus on model optimization and iteration [2]. Group 2: Competitive Landscape - The latest rankings from Chatbot Arena show Alibaba's Tongyi Qianwen 3 surpassing several closed-source models, indicating a shift towards open-source dominance in China [4]. - The divergence in paths between open-source and closed-source models is evident, with Chinese companies embracing open-source while U.S. firms lean towards closed-source strategies [4][5]. - Open-source models are seen as a way for latecomers in the AI field to break the dominance of established players, allowing for rapid optimization and ecosystem development [5]. Group 3: Challenges and Concerns - The rapid iteration of open-source models has led to a phenomenon of "tuning internal competition" and homogenization, raising concerns about a lack of disruptive innovation [7][8]. - Developers face challenges with frequent updates and compatibility issues, leading to increased adaptation costs and potential innovation stagnation [8]. - Experts suggest the need for unified API standards and a focus on foundational research to avoid low-level repetitive construction and to foster genuine algorithmic breakthroughs [8].
狂揽70亿挑战DeepSeek,AI创企被曝新融资,被英伟达押宝,团队大牛云集
3 6 Ke· 2025-08-05 08:12
Core Insights - Reflection AI, a US-based startup, is in talks to raise over $1 billion for developing open-source large models to compete with providers like DeepSeek, Mistral, and Meta [2] - The company was founded in 2024 by former Google DeepMind scientists Ioannis Antonoglou and Misha Laskin, who have significant experience in AI development [2][5] - Reflection AI aims to create super-intelligent autonomous systems and has already launched its first programming agent, Asimov, which assists developers in coding tasks [2][11] Company Overview - Reflection AI has raised $130 million in March 2023, with a current valuation of $545 million [3] - The founding team consists of experts from Google DeepMind, OpenAI, and Anthropic, focusing on large language models and reinforcement learning [9][11] - The company emphasizes the importance of autonomous programming as a key step towards achieving superintelligence [11] Product Development - The Asimov agent can analyze enterprise data and generate relevant code, already attracting paying clients in sectors like finance and technology [11][12] - Asimov has reportedly improved developer productivity by tenfold, according to insights from Sequoia Capital [12] Market Positioning - Reflection AI is positioning itself to become a leading provider of open-source AI models in the US, responding to the growing demand for customizable and cost-effective solutions [16][18] - The company is capitalizing on the limitations of closed-source models, particularly regarding data security concerns faced by US companies [16] Industry Trends - The rise of open-source models is prompting US AI companies to accelerate their development efforts, as seen with Reflection AI's ambitions [19] - Training costs for AI models are significant, with OpenAI projecting over $7 billion in training expenses for 2023, highlighting the challenges for startups in this space [19]