开源大模型
Search documents
传媒行业周观察(20250526-20250530)
Huachuang Securities· 2025-06-03 00:25
Investment Rating - The report maintains a "Recommendation" rating for the media industry, expecting the industry index to rise more than 5% over the next 3-6 months compared to the benchmark index [49]. Core Viewpoints - The report expresses a positive outlook on the IP toy sector, highlighting its long-term growth potential driven by diverse product categories. The recent success of the "Jinli Naju" limited edition merchandise from Alibaba Pictures during the Dragon Boat Festival is noted as a significant indicator of market interest [5][6]. - The media sector is currently experiencing a resurgence in AI applications, with a focus on cultural confidence stemming from popular IPs like "Nezha." The report anticipates a reshaping of the application landscape in 2023, particularly in public cloud services and B-end SaaS enterprises [5][6]. - The gaming market is highlighted as a key area of interest, with recommendations to focus on companies like Huatuo, Perfect World, and JiBit, driven by product cycles and deepening AI integration [5][6]. Summary by Sections Market Performance Review - The media sector index rose by 1.74% last week, outperforming the CSI 300 index, which fell by 1.08%, resulting in a relative outperformance of 2.82% [8]. - The total market capitalization of the media sector is approximately 1,569.05 billion yuan, with 140 listed companies [2]. Gaming Market - Tencent's games dominate the iOS sales rankings, with "Honor of Kings" and "Peacekeeper Elite" leading the charts. New releases from other companies are also noted, indicating a competitive landscape [16][17]. Film Market - As of May 30, 2025, the film market has achieved a box office of 24.545 billion yuan, recovering approximately 98% of the box office compared to the same period in 2019. The total number of viewers is around 588 million, recovering about 86% [19][22]. - The top films during the week of May 26 to May 30 include "Mission: Impossible 8" and "Lilo & Stitch," with significant box office contributions [26]. Key Company Announcements - Meituan reported a revenue of 86.6 billion yuan for Q1 2025, exceeding market expectations by 18.1%, with a net profit of 10.95 billion yuan, reflecting a year-on-year growth of 46.2% [33]. - Kuaishou's Q1 2025 revenue reached 32.608 billion yuan, showing an 8.8% year-on-year increase, with a net profit of 3.978 billion yuan [34].
“开源大模型之城”,为何是杭州?
Sou Hu Cai Jing· 2025-05-30 07:09
在软件领域,开源与闭源两种路线之争由来已久。此前大模型以闭源为主,硅谷已写好了全球AI竞赛的剧本:闭源模式,限制技术扩散;算力堆砌,抬 高追赶壁垒;垄断优势,获得高昂商业利润。 "DeepSeek、通义千问等一批大模型加速发展",写入了2025年的杭州市政府工作报告中。以低成本打破赛道壁垒、震动全球同业的DeepSeek开源大模型背 后,是创新活力的迸发。杭州是如何发展开源大模型的,"开源大模型之城"为什么是杭州? 随着DeepSeek以开源模式引发行业变革,开源迅速成为大模型主流开发模式。 4月2日,全球最大AI开源社区HuggingFace发布最新榜单,排在前三的开源大模型分别来自阿里通义千问、DeepSeek和群核科技,领先于英伟达、谷歌等 公司。 榜单发布后,杭州再次引起业界瞩目。因为杭州包揽了前三,成为全球少有的、同时拥有3个世界顶级开源模型的城市,因此被誉为"开源大模型之城"。 开源大模型对AI普及应用、构建AI产业生态至关重要。目前,北京等地都在积极打造"全球开源之都",而杭州走在了前列。 杭州"开源大模型之城"是如何炼成的? 01 深厚土壤 然而,DeepSeek反其道而行之,凭借开源和低成本 ...
早报|特朗普称哈佛大学国际生比例最高15%;泡泡玛特回应Labubu品控问题;苹果计划全面重命名操作系统;荣耀回应机器人业务
虎嗅APP· 2025-05-28 23:55
Group 1: Education and International Relations - The U.S. government is imposing restrictions on Harvard University regarding international students, suggesting a cap of 15% on foreign students, which currently stands at approximately 31% [2] - The U.S. government has also announced the cancellation of federal funding for Harvard and has suspended new student visa interviews [2] Group 2: Financial Services and Investment - Chinese Vice Premier He Lifeng met with Morgan Stanley's co-president, expressing a commitment to high-level openness and inviting more U.S. financial institutions to deepen cooperation in China's capital market [3] - The Chinese Foreign Ministry emphasized that the essence of Sino-U.S. economic relations is mutual benefit, highlighting the significant bilateral demand reflected in increased orders from U.S. buyers [4] Group 3: Consumer Goods and Quality Control - Pop Mart's Labubu plush toys have gained popularity, but there are reports of quality control issues, including defects like misalignment and paint loss, leading to customer dissatisfaction [6] - Pop Mart's customer service stated that all products undergo quality checks before shipment, but minor imperfections may occur during production [6] Group 4: Technology and Innovation - Didi Enterprise Edition has become the first travel service provider for 3M in China, offering innovative services that have led to a 39% year-on-year increase in ride orders from foreign clients [8] - DeepSeek has released an open-source version of its R1 model, which reportedly performs comparably to OpenAI's latest models [9] Group 5: Pharmaceuticals and Healthcare - Fosun Pharma has signed exclusive commercialization agreements for several biopharmaceutical products with Nine Sources Gene, covering regions including the Middle East and parts of Southeast Asia [17] - The National Healthcare Security Administration is conducting checks on retail pharmacies to address potential issues of pharmacists' credentials being misused [16]
78%主创跳槽,Llama 14名作者只剩3人,Meta最强开源模型团队大溃散引争议
3 6 Ke· 2025-05-27 12:19
AI 人才争夺战愈演愈烈,就算是顶级大厂,如果没有"护城河",也留不住人。 据外媒 Business Insider 最新消息,曾在开源大模型圈子里一度领跑的 Meta,如今正面临严重的人才流失。在 Llama 模型最初的 14 位核心作者 中,已有 11 位离职。有的自立门户,有的跳槽去了竞争对手。 这波"出走潮"也让外界再次把目光投向 Meta。毕竟他们曾豪赌元宇宙,四年"烧掉"450 亿美元,却被直指至今几乎未见显著成效。现在 AI 项目 也出问题了,不少人开始质疑:Meta 还行不行?为什么留不住顶尖 AI 人才?它的创新能力,还能支撑它在这场 AI 竞赛中跑多远? Llama 论文的 14 位作者,已有 11 人离开 Meta 回头看 2023 年那篇引发轰动的 Llama 论文,共署名 14 位研究者。短短两年,Meta 只留下了其中三位:研究科学家 Hugo Touvron、研究工程师 Xavier Martinet 和项目负责人 Faisal Azhar。 论文地址:https://arxiv.org/pdf/2302.13971 其他 11 人,大多已经离开,分散到了全球多家科技公司,有的还 ...
夸克升级“深度搜索”功能,AI应用方向催化丰富,关注影视、游戏景气度回暖
Huachuang Securities· 2025-05-12 00:15
Investment Rating - The report maintains a "Recommendation" rating for the media industry, expecting the industry index to rise more than 5% over the next 3-6 months compared to the benchmark index [49]. Core Insights - The media sector is experiencing a resurgence in the film and gaming markets, driven by advancements in AI applications and cultural confidence stemming from popular IPs like "Nezha" [3][6]. - The report highlights the importance of AI applications in reshaping the industry landscape, with a focus on public cloud value reconstruction and the return to growth for related companies [6][9]. - The gaming market shows strong performance, with Tencent and NetEase leading in iOS sales rankings, indicating a healthy competitive environment [16][19]. - The film market is recovering, with total box office revenue reaching 239 billion yuan as of May 9, 2025, which is approximately 102% of the 2019 level [20][23]. Market Performance Review - The media (Shenwan) index rose by 1.40% last week, underperforming the CSI 300 index, which increased by 2.00%, resulting in a relative underperformance of 0.61% [9][10]. - The report notes that the media sector ranks 24th among all sectors in terms of performance [9]. Industry Highlights - The report emphasizes the potential of AI applications in various sectors, including gaming and education, suggesting a focus on companies like Huya, Giant Network, and Perfect World for gaming, and New Oriental and TAL Education for education [6][30]. - The film industry is expected to benefit from a strong pipeline of upcoming releases, with several key films set to debut in mid-May [29][30]. Company Announcements - Perfect World announced a 2025 employee stock ownership plan, indicating a commitment to employee engagement and retention [36]. - Wanda Film disclosed plans for a share reduction by a major shareholder, which may impact market sentiment [38].
贸易战下的产业韧性(二):AI大模型的商业“回旋镖”,重新落到了云计算
3 6 Ke· 2025-05-11 23:28
Core Viewpoint - The domestic large model industry is attempting to break through its current challenges and reconstruct a new order, but the unstable market environment poses significant risks [1] Group 1: Open Source Trends - DeepSeek has disrupted the industry's perception of open-source models, prompting OpenAI's CEO to reconsider the validity of open-source strategies [1] - Domestic large model companies like Alibaba, Baidu, and SenseTime are accelerating their open-source initiatives [1] - Open-source is seen as a key strategy to reduce dependency on foreign software and hardware, but the commercial viability of open-source projects remains complex [2][5] Group 2: Challenges in Implementation - Developers face significant technical adaptation and maintenance costs, despite open-source models lowering the technical barrier [4] - The integration of large models into existing systems requires extensive customization, which can be resource-intensive for companies [4] - The complexity of data acquisition, cleaning, and labeling poses additional challenges for businesses, particularly small and medium-sized enterprises [4] Group 3: Investor Sentiment - Investors are cautious about the open-source model due to the unclear profitability and traditional software sales evaluation methods not being applicable [5] - The potential for significant financial loss if investments in proprietary models are undermined by open-source alternatives is a concern for investors [4][5] Group 4: Business Models - Chinese large model companies are adopting a "free-to-use plus value-added services" model to build a commercial framework around open-source models [6][8] - Companies like Baidu are leveraging their cloud services to monetize the usage of their open-source models, creating a win-win situation for developers and the company [8] - The success of open-source models may depend more on the quality of cloud services than on the models themselves, as seen in the strategies of Meta and Hugging Face [9][10] Group 5: Future Outlook - Open-source is viewed as a pathway for the Chinese large model industry to overcome technological barriers, but commercial sustainability is equally important [10] - The increasing tariff barriers from the U.S. add pressure to the large model industry, making the choice of cloud platforms more critical than the open-source models themselves [10]
9点1氪:5月10日起结婚离婚都无需出示户口本;贾跃亭主动回应还债回国时间;心相印客服辱骂顾客并送冥币
36氪· 2025-05-09 15:30
Group 1 - The revised Marriage Registration Regulations will take effect on May 10, 2025, eliminating the requirement to present a household registration book for marriage and divorce [3] - The new regulations include three main aspects: expanding marriage and family service content, implementing nationwide marriage registration, and optimizing marriage registration services [3] - The marriage registration authority is prohibited from charging fees for processing marriage and divorce registrations [3] Group 2 - Xiamen Jihong Technology Co., Ltd. has passed the listing hearing on the Hong Kong Stock Exchange, with China International Capital Corporation and China Merchants Jinling International serving as joint sponsors [2] - Panasonic Group announced plans to lay off 10,000 employees globally, with 5,000 from Japan and 5,000 from overseas, during the fiscal year 2025-2026 [6] - Ningde Times is reportedly seeking to raise at least $4 billion through a Hong Kong listing [6] Group 3 - The Italian company Moltiply has filed a lawsuit against Google's parent company Alphabet, seeking €2.97 billion ($3.34 billion) in damages for abusing its market dominance [5] - The U.S. tariff war has significantly increased costs for American companies, with one bicycle manufacturer reporting a nearly threefold increase in wheel costs due to tariffs [6] - The recent divorce of the controlling shareholder of Zhu Cheng Technology involves the transfer of approximately 3.81 million shares, valued at around 3.81 million yuan [6]
中国电子:国产开源模型千帆竞发,阿里 Qwen-3、小米 MiMo、DeepSeek Prover 集中发布
Haitong Securities International· 2025-04-30 15:15
Investment Rating - The report indicates that Alibaba's Qwen currently ranks at the top of the open-source model rankings, with expectations for continued leadership in model capability and ecosystem monetization [2]. Core Insights - The report highlights a surge in domestic open-source models, with significant releases from Alibaba, Xiaomi, and DeepSeek, showcasing advancements in large language models (LLMs) [1][8]. - Alibaba's Qwen-3 series demonstrates substantial performance improvements, achieving 10-30% accuracy gains on various benchmarks and enhancing inference speed by 20-40% [9][12]. - Xiaomi's MiMo model, with 7 billion parameters, excels in reasoning and code generation tasks, outperforming larger proprietary models through innovative training strategies [10][12]. - DeepSeek's Prover-V2-671B model shows strong performance in formal logic reasoning, indicating a strategic focus on specialized AI applications [11][12]. - The report anticipates that as more domestic models are released, the industry may face challenges related to homogenization and competition, pushing for more customized solutions in vertical industries [5]. Summary by Sections Alibaba Qwen-3 - The Qwen-3 series includes models ranging from 1.5 billion to 72 billion parameters, designed for various inference needs, with notable performance enhancements over previous generations [9]. - Deployment costs are significantly lower, requiring only 4 H20 GPUs for full-capacity operation, which is advantageous compared to similar models from OpenAI and Grok [2][12]. Xiaomi MiMo - MiMo's training involved 25 trillion tokens and innovative mechanisms to improve training efficiency, achieving a 2.29x increase in training speed and a 1.96x acceleration in verification processes [10]. DeepSeek-Prover-V2-671B - This model excels in mathematical theorem proving, particularly in formal logic, and serves as a precursor to DeepSeek's upcoming models, reflecting the company's commitment to advancing AI capabilities [11]. Industry Trends - The report suggests that the next phase for open-source models will involve customization based on user data and feedback, aiming to establish long-term barriers and user loyalty in specific industries [5].
Qwen3真香!通义App满血接入,一手实测在此
量子位· 2025-04-30 04:10
鱼羊 一水 发自 凹非寺 量子位 | 公众号 QbitAI 开源大模型新王者,正在受到空前关注。 Qwen3预告一出,直接开启不眠夜模式。 △ 来自编辑部本部 等到深夜正式上线并宣布登顶全球最强开源模型,更是瞬间引爆全网热议。 | | | Hope you enjoy our new models! | | | | | | | | --- | --- | --- | --- | --- | --- | --- | --- | --- | | 22B | Qwen3-32B Dense | OpenAl-o1 2024-12-17 | Deepseek-R1 | Grok 3 Beta BB Think | QwQ-32B | Qwen3-4B Dense | Qwen2.5-72B-Instruct | Gemma3-27BIT | | | 93.8 | 92.1 | 93.2 | | 89.5 | 76.6 | 81.2 | 86.8 | | | 81.4 | 74.3 | 79.8 | 83.9 | 79.5 | 73.8 | 18.9 | 32.6 | | | 72.9 | 79.2 | 70.0 | ...
Qwen 3 发布,开源正成为中国大模型公司破局的「最优解」
Founder Park· 2025-04-29 12:33
阿里新一代的大模型 Qwen 3 今早发布,新旗舰 Qwen3-235B-A22B 的评测成绩,和 DeepSeek R1、Grok-3、Gemini-2.5-Pro 不相上下。这一代全系列模 型都支持混合推理,对 Agent 的支持也上了新台阶。 随着 Qwen 2.5 和 3 的发布,全球的开源模型生态也呈现了一种新形态:以 DeepSeek+Qwen 的中国开源组合,取代了过去 Llama 为主,Mistral 为辅的开 源生态。Qwen 系列的衍生模型目前已经是 HuggingFace 上最受欢迎的开源模型,衍生模型的数量也超过了 Llama 系列。而 DeepSeek 对于开源模型生态 的冲击和贡献,也有目共睹。 与大模型六小龙相比,主打开源的 Qwen 和 DeepSeek 无疑在国际市场赢得了更多开发者和创业者的关注,来自开源社区的代码贡献、更多优秀微调版本 的出现,也在以另外一种方式推动模型能力的进步。 可以说, 开源,正在成为中国大模型公司进入全球市场的最佳路径。 而对阿里云来说,Qwen+阿里云的配合,「模型-云-行业应用」的打法,走出了国内 MaaS 模式的新方向,也在很大程度上降低了国 ...