开源大模型

Search documents
中国力量在自动驾驶与通用AI领域集体崛起
Huan Qiu Wang· 2025-09-01 09:00
Group 1 - The TIME100 AI list for 2025 highlights influential figures in the AI field, with Peng Jun, CEO of Pony.ai, being the only representative from the autonomous driving sector [1] - Peng Jun is recognized as a leader in the autonomous driving revolution, aiming to deploy a fleet of 1,000 Robotaxis by 2025, pushing for large-scale operation of Level 4 autonomous driving [1] - The mission of using technology to improve human mobility remains a consistent goal for Pony.ai, as stated by Peng Jun during his award acceptance [1] Group 2 - Other notable Chinese AI leaders include Liang Wenfeng, CEO of DeepSeek, who made the list for breakthroughs in open-source large models and general AI, with their DeepSeek-V3 model gaining global recognition [2] - Wang Xingxing, CEO of Yushu Technology, also made the list, with the company holding two-thirds of the global market share in robotic dogs and being the best-selling humanoid robot [2]
沙利文最新报告:中国企业调用大模型日均超10万亿Tokens,阿里通义份额第一
Zheng Quan Shi Bao Wang· 2025-09-01 04:21
Core Insights - The report by Frost & Sullivan indicates that the Chinese enterprise-level generative AI market is experiencing explosive growth, with daily consumption of large models expected to reach 10.2 trillion tokens by the first half of 2025, a 363% increase compared to the second half of 2024 [1][2] - Alibaba Tongyi leads the market with a 17.7% share, followed by ByteDance's Doubao at 14.1% and DeepSeek at 10.3%, collectively accounting for over 40% of the market [1] Group 1: Market Trends - 70% of enterprises are opting for public cloud deployment or invocation of large models, with 71% planning to increase their use of generative AI services in public cloud formats [2] - The shift in focus from seeking a single strongest model to finding optimal solutions for specific business scenarios is noted, indicating a growing demand for diverse model types and applications [2] Group 2: Open Source Models - Open source models are identified as a key growth driver in the enterprise-level large model market, with predictions that over 80% of enterprises will adopt open source large models in the future [2] - The performance gap between domestic open source models and top international closed-source models is narrowing, with models like Qwen and DeepSeek expected to continue gaining traction [2] Group 3: Alibaba's Developments - Alibaba Tongyi has recently open-sourced several foundational large models, including Qwen3-Coder and Qwen-Image, leading to a surge in global interest in Chinese models [3] - The Qwen model has achieved a global market share of over 12.3%, surpassing OpenAI and Llama models, and Qwen3-Coder's usage increased by 1474% in one week, making it the second most used in the programming field [3] - Alibaba has open-sourced over 300 models, promoting a comprehensive approach to "full size," "full modality," and "multi-scenario" applications [3]
中国企业调用大模型日均超10万亿Tokens,阿里通义份额17.7%第一,字节豆包14.1%第二,DeepSeek为10.3%第三
Ge Long Hui· 2025-09-01 03:46
Group 1 - The core viewpoint of the report by Frost & Sullivan indicates that the enterprise-level large model invocation in China is experiencing explosive growth, with a projected increase of 363% in daily invocation volume by the first half of 2025 compared to the end of 2024, currently exceeding 10 trillion tokens [1] - Alibaba Tongyi holds the largest market share at 17.7%, making it the most chosen large model by Chinese enterprises [1] - The report anticipates that as domestic models like Qwen and DeepSeek continue to open source in 2025, the performance gap between open-source models and top international closed-source models will nearly close, leading to over 80% of enterprises adopting open-source large models, which will drive a new wave of growth in the enterprise market [1]
国产开源大模型霸榜Design Arena,前十五名全数上榜展现强劲实力
Sou Hu Cai Jing· 2025-08-25 15:25
Core Insights - The domestic open-source large model sector is experiencing significant growth, drawing widespread attention from the industry [1] - A notable observation is that the top-ranking open-source AI models on the Design Arena platform are predominantly from China [1][2] Group 1: Model Rankings - The Design Arena platform employs a unique evaluation mechanism where users vote on responses generated by different models, ensuring fairness and dynamism in rankings [2] - Among the top 15 models listed as open-source, all positions are occupied by Chinese models, with DeepSeek-R1-0528 leading the list, followed by Zhizhu's GLM-4.5 and Alibaba's Qwen 3 Coder 480B [2][3] - The ranking showcases multiple models from various manufacturers, including DeepSeek, Qwen, and GLM, with the first non-Chinese model, OpenAI's GPT OSS 120B, appearing only at the 16th position [2][3] Group 2: Industry Developments - Recent releases of new-generation open-source large models by domestic AI companies are propelling advancements in AI technology [4] - A total of 33 large models from various manufacturers, including Alibaba and Zhizhu, were released in July, indicating a robust trend in the domestic open-source model landscape [4] - The emergence of 19 leading open-source model laboratories in China, such as DeepSeek and Qwen, highlights the collaborative efforts driving the rise of domestic open-source models [4] Group 3: Competitive Landscape - Historically, closed-source models like the GPT series have maintained a technological edge, but the rise of open-source models, particularly the Llama series, is reshaping the global AI landscape [4] - Chinese open-source models like Qwen and DeepSeek are now recognized as competitive alternatives to top-tier closed-source models, facilitating a shift in focus towards model tuning and application optimization in the industry [4]
全球开源大模型,前十五名全是中国的
机器之心· 2025-08-25 09:10
Core Viewpoint - The article highlights the significant emergence of domestic open-source large language models (LLMs) in China, with all top-ranked models on the Design Arena leaderboard being Chinese [1][3]. Group 1: Overview of Design Arena - Design Arena is the largest crowdsourced AI-generated design benchmark platform, utilizing a user evaluation system based on Elo Rating, similar to chess scoring [2]. - The platform allows users to vote on which of two model-generated responses is better, creating a dynamic ranking system that reflects real user experiences [2]. Group 2: Rankings of Open-Source Models - The top 15 open-source models on Design Arena are all from China, with DeepSeek-R1-0528 ranked first, followed by Zhipu's GLM 4.5 and Alibaba's Qwen 3 Coder 480B [4][5]. - The ranking details show that DeepSeek has 5 models, Alibaba has 6 models, and Zhipu has 3 models in the top 15 [6][7]. Group 3: Recent Developments in Open-Source Models - Recently, domestic AI companies have been actively releasing new open-source LLMs, with 33 models launched by various firms including Alibaba and Tencent [7]. - A total of 19 leading open-source model laboratories in China have been identified, showcasing a diverse range of contributors to the open-source AI landscape [9]. Group 4: Impact on AI Research and Development - The rise of open-source models like DeepSeek and Qwen is shifting the focus of application companies towards model tuning and optimization, accelerating the deployment of AI technologies [10]. - The article suggests that the increasing prominence of Chinese AI models may reshape the global AI landscape, with a potential shift towards open-source as a standard in advanced model development [10].
传媒行业周观察(20250818-20250822):关注中报超预期标的及港股流动性变化,看好后续游戏、AI、IP、影视行情
Huachuang Securities· 2025-08-25 06:31
Investment Rating - The report maintains a "Recommendation" rating for the media industry, expecting it to outperform the benchmark index by over 5% in the next 3-6 months [42]. Core Insights - The media sector is currently experiencing a positive trend driven by the rise of AI applications and cultural confidence stemming from content output. The report anticipates 2025 to be a year of significant breakthroughs in China's open-source large model applications and industry growth [5][6]. - The report highlights the performance of the media sector, which saw a 5.17% increase last week, outperforming the CSI 300 index by 0.99% [6][19]. - Key areas of focus include gaming, IP, AI, and film, with specific recommendations for companies like Tencent, Alibaba, Kuaishou, and Meitu [5][19]. Market Performance - The media sector's total market capitalization is approximately 188.1 billion yuan, with a circulating market value of about 171.3 billion yuan [2]. - The absolute performance of the media sector over the past month is 12.1%, 6 months is 11.1%, and 12 months is 81.8% [3]. - The report notes that the gaming market is dominated by Tencent's products, with "Honor of Kings" consistently ranking first [14]. Gaming Market - The report emphasizes the importance of monitoring high-frequency data and the performance of key gaming titles, particularly following the release of mid-year reports [5][14]. - Notable upcoming game releases include "Blood of Heroes: Return" on August 27, which is expected to contribute positively to the sector [5][16]. IP Market - The report identifies a bullish trend in the IP market, particularly with the upcoming release of "mini labubu" by Pop Mart, which is expected to drive sales [5][27]. - Companies like Chuangyuan Co. and Pop Mart are highlighted for their strong IP portfolios and growth potential [5][29]. Film Market - As of August 22, 2025, the film market has generated a box office of 34.13 billion yuan, recovering approximately 85% of the box office compared to the same period in 2019 [19][20]. - The report notes that the average ticket price is 32.2 yuan, with a total of 8.75 billion viewers [19][20]. AI Market - The report discusses the continuous updates and innovations in AI models, with companies like Zhongwen Online and Zhejiang Data Culture making significant strides in product development [5][27]. - The launch of DeepSeek-V3.1 is noted as a significant advancement in AI capabilities [27].
刚刚,字节开源Seed-OSS-36B模型,512k上下文
机器之心· 2025-08-21 01:03
Core Viewpoint - ByteDance's Seed team has officially released and open-sourced the Seed-OSS series models, which include three versions: Seed-OSS-36B-Base (with synthetic data), Seed-OSS-36B-Base (without synthetic data), and Seed-OSS-36B-Instruct, trained on 12 trillion tokens and achieving excellent performance on various benchmarks [1][2]. Model Features - The Seed-OSS-36B architecture incorporates various design choices, including causal language modeling, Grouped Query Attention, SwiGLU activation function, RMSNorm, and RoPE positional encoding [4]. - Each model contains 36 billion parameters distributed across 64 layers and supports a vocabulary size of 155,000 [5]. - A notable feature is the native long-context capability, with a maximum context length of 512k tokens, allowing for the processing of long documents and reasoning chains without performance loss [6][7]. Inference Budget Control - The model introduces inference budget control, allowing developers to specify how much reasoning the model should perform before providing an answer [10]. - This design enables teams to adjust performance based on task complexity and deployment efficiency needs [12]. - Recommended budget values are multiples of 512 tokens, with a budget of 0 indicating direct answer output [13][26]. Benchmark Performance - The Seed-OSS-36B-Base model achieved scores of 65.1 on MMLU-Pro and 81.7 on MATH, demonstrating competitive performance [15]. - The Seed-OSS-36B-Instruct version achieved state-of-the-art (SOTA) results in various fields, including 91.7% on AIME24 and 67.4 on LiveCodeBench v6 [17]. - In long-context processing tests, the model reached a score of 94.6 on RULER (128K context length), marking the highest score among open-source models [18]. User Interaction and Token Management - During operation, the model informs users of token usage, enhancing user awareness of resource consumption [25]. - If no inference budget is set, the model defaults to unlimited length reasoning, while a budget of 0 prompts direct answer output [27].
传媒行业周观察(20250811-20250815):看好游戏、IP、AI、影视等景气度方向
Huachuang Securities· 2025-08-18 05:47
Investment Rating - The report maintains a "Recommendation" rating for the media industry, expecting the industry index to outperform the benchmark index by over 5% in the next 3-6 months [3][44]. Core Viewpoints - The report highlights optimism in sectors such as gaming, intellectual property (IP), artificial intelligence (AI), and film, indicating a favorable market outlook [1][3]. - The media sector is currently experiencing a resurgence, with AI applications gaining traction and cultural confidence being bolstered through content output [3][6]. - The report emphasizes the potential for significant growth in the AI application industry, particularly in public cloud services and user engagement scenarios [3][6]. Market Performance Overview - The media sector index rose by 1.00% last week, underperforming the CSI 300 index, which increased by 2.37%, resulting in a relative underperformance of 1.37% [7][10]. - The media sector's total market capitalization is approximately 178.65 billion yuan, with 140 listed companies [3]. Gaming Sector Insights - The gaming market shows positive trends, with high-frequency data indicating upward movement and favorable mid-year report expectations [3][15]. - Notable games such as "Peacekeeper Elite" and "Honor of Kings" continue to dominate the iOS sales rankings, reflecting strong daily active user (DAU) engagement [15][16]. Film Market Analysis - As of August 15, 2025, the film box office has reached 33.006 billion yuan, recovering approximately 85% of the pre-pandemic levels in terms of box office revenue [20][21]. - The average ticket price is reported at 32.6 yuan, with a total of 20.879 million viewers during the week of August 11-15, 2025 [21][26]. AI Sector Developments - The report notes the ongoing advancements in AI applications, with a focus on companies like Kuaishou and Youzan, which are expected to benefit from AI integration [3][29]. - The launch of new AI technologies and products by major companies like Huawei and Apple is anticipated to further drive growth in the sector [29][30][31]. Key Company Recommendations - The report suggests focusing on companies such as Tencent, Alibaba, Kuaishou, and Meitu, which are well-positioned to leverage the current market dynamics [3][6]. - Specific stocks like Giant Network, G-bits, and Perfect World are highlighted as potential investment opportunities within the gaming sector [3][6].
全球AI大模型迭代提速!中国开源生态爆发
Wind万得· 2025-08-12 22:37
Core Viewpoint - The global AI industry is experiencing a rapid acceleration in technological iterations, with major companies like OpenAI, Google DeepMind, and Baidu releasing or updating large model products, indicating a period of intensive innovation [1] Group 1: Major Company Developments - OpenAI launched GPT-5 on August 8, featuring enhanced reasoning, multimodal capabilities, and enterprise customization, with significant improvements in programming performance and reduced hallucination rates [3] - Baidu plans to release a new AI inference model by the end of August, aimed at enhancing complex task processing capabilities [3] - Google DeepMind introduced the "Genie3" model on August 6, capable of generating dynamic 3D worlds, although it still faces limitations in practical operability and multi-agent interactions [3] - Chinese companies are making significant strides in the open-source large model sector, with Tencent announcing the open-source "Hunyuan 3D World Model 1.0" and Alibaba releasing four open-source models, with one ranking third globally on an international evaluation platform [3][4] Group 2: Open Source Landscape - As of July 31, nine out of the top ten open-source large models globally are from Chinese companies, with Zhipu GLM-4.5 ranked first, showcasing China's transition from technology catch-up to ecosystem leadership [4] - The open-source approach adopted by Chinese companies contrasts with the closed-source model favored by U.S. tech firms like OpenAI, which has shifted from open-source to closed-source operations to maintain its technological edge [6] Group 3: Industry Challenges and Opportunities - The open-source model accelerates technology dissemination but faces challenges such as "fine-tuning internal competition," where most updates focus on parameter tuning rather than foundational architecture innovation [6] - Developers encounter compatibility issues due to frequent model updates and interface changes, complicating integration efforts [6] - The "combinatorial effect" of open-source models may weaken technological barriers, preventing significant capability gaps between companies [6] Group 4: Market Dynamics and Future Outlook - Differentiated AI applications are creating incremental opportunities, with companies like Kuaishou focusing on video and image generation, Alibaba leveraging AI in e-commerce, and Tencent exploring applications in advertising and gaming [7] - As of now, the total number of registered personal users for large models exceeds 3.1 billion, with API call users surpassing 159 million [7] - The next generation of large models is expected to benefit from increased reasoning demands, driving growth in computing power requirements [7] - By 2025, the AI large model industry is anticipated to exhibit accelerated technological iterations, a rising open-source ecosystem, and diverse commercialization paths, enhancing China's global influence in the AI sector [7]
超越OpenAI医疗能力,百川发布开源大模型Baichuan-M2
Feng Huang Wang· 2025-08-11 07:32
此外,百川智能针对医疗领域用户隐私考虑下的模型私有化部署需求,对Baichuan-M2进行了极致轻量 化,量化后的模型精度接近无损,可以在RTX4090上单卡部署,相比DeepSeek-R1 H20双节点部署的方 式,成本降低了57倍。 凤凰网科技讯 8月11日,百川大模型正式发布开源医疗增强大模型Baichuan-M2。据官方介绍,该模型 以32B的较小尺寸,不仅反超OpenAI最新开源(300109)模型gpt-oss120b,更是力压Qwen3-235B、 Deepseek R1、Kimi K2等当前世界所有开源大模型。 ...