DeepSeek - filings, earnings calls, financial reports, news

DeepSeek

Search documents

未知机构：2026春节期间AI行业动态汇总一国内模型与产品发布-20260224

未知机构· 2026-02-24 04:05

Summary of AI Industry Dynamics During the 2026 Spring Festival Domestic Model and Product Releases 1. **Zhiyuan AI** (February 11): Released GLM-5 with a HumanEval code pass rate of 96.2%, ranking first in global open source and fourth overall; focuses on programming and intelligent capabilities [1] 2. **ByteDance** (February 14): Launched Doubao Model 2.0 (Pro/Lite/Mini/Code), achieving top rankings in math/programming benchmarks and reducing inference costs by an order of magnitude; simultaneously launched Seedance 2.0, capable of generating movie-quality multi-shot audio and video in 60 seconds, becoming a key visual creation tool for the Spring Festival Gala [1] 3. **Alibaba** (February 16): Open-sourced Qwen3.5-Plus with a total parameter count of 397 billion and only 17 billion activated, achieving a 60% reduction in memory usage and a 19-fold increase in inference efficiency; API priced at 0.8 yuan per million tokens, which is 1/18 of Gemini 3 Pro [1] Additional Domestic Releases 4. **DeepSeek** (February 17): Upgraded context window from 128K to 1M tokens, capable of processing ultra-long texts equivalent to the "Three-Body Problem" trilogy [2] 5. **MiniMax** (February 18): Released M2.5 model with native agent design, achieving a 37% speed increase for complex tasks [2] 6. **Tencent** (February 19): AI creation reached 1 billion instances over 16 days during the Spring Festival [2] Overseas Model and Product Releases 7. **Google** (February 19): Released Gemini 3.1 Pro, significantly enhancing inference capabilities and setting a new technical benchmark [3] 8. **OpenAI** (February 18): Launched a proprietary model based on Cerebras chips, achieving a training efficiency increase of 5-10 times and a cost reduction of 70% [3] 9. **Anthropic** (February 18): Released Claude Sonnet 4.6, priced at 1/5 of flagship models, with performance approaching Opus, offering excellent cost-performance for enterprise applications [3] 10. **xAI** (February 20): Updated Grok 4.2, with simultaneous upgrades in multimodal and inference capabilities [3] Financing and Capital Dynamics 11. **Moon's Dark Side (Kimi)** (February 20): Completed a $700 million financing round led by Alibaba, Tencent, and Wuyuan, with a valuation exceeding $10 billion; cumulative financing in January and February surpassed $1.2 billion [4] 12. **Zhiyuan AI/MiniMax** (February 20): Market capitalization on the first trading day after the Hong Kong holiday exceeded 300 billion HKD each, totaling over 580 billion HKD [4] 13. **Anthropic** (February 21): Completed a $30 billion Series G round, with a valuation reaching $380 billion, led by GIC and Coatue [4] 14. **OpenAI** (February 21): Nearing completion of the first phase of over $100 billion in financing, with a valuation expected to exceed $850 billion [4] 15. **Runway** (February 21): AI video company completed a $315 million Series E round, with a valuation of $5.3 billion, focusing on world model development [4]

Artificial Intelligence

Artificial Intelligence

未知机构：2026春节期间AI行业动态汇总一国内模型与产品发布1智谱AI2-20260224

未知机构· 2026-02-24 04:00

Summary of AI Industry Dynamics During the 2026 Spring Festival Domestic Model and Product Releases 1. **Zhiyuan AI** (Feb 11): Released GLM-5 with a HumanEval code pass rate of 96.2%, ranking first in global open source and fourth overall; focuses on programming and intelligent capabilities [1] 2. **ByteDance** (Feb 14): Launched Doubao Model 2.0 (Pro/Lite/Mini/Code), achieving top rankings in math/programming benchmarks and reducing inference costs by an order of magnitude; simultaneously launched Seedance 2.0, capable of generating movie-quality multi-shot audio and video in 60 seconds, becoming a key player in Spring Festival visual creation [1] 3. **Alibaba** (Feb 16): Open-sourced Qwen3.5-Plus with a total parameter count of 397 billion and only 17 billion activated, achieving a 60% reduction in memory usage and a 19-fold increase in inference efficiency; API priced at 0.8 yuan per million tokens, which is 1/18 of Gemini 3 Pro [2] 4. **DeepSeek** (Feb 17): Upgraded context window from 128K to 1M tokens, capable of processing ultra-long texts equivalent to the "Find Books" trilogy [2] 5. **MiniMax** (Feb 18): Released M2.5 model with native agent design, achieving a 37% speed increase for complex tasks [2] 6. **Tencent** (Feb 19): During the 16 days of the Spring Festival, AI creation reached 1 billion instances [2] Overseas Model and Product Releases 7. **Google** (Feb 19): Released Gemini 3.1 Pro, significantly enhancing inference capabilities and setting a new technical benchmark [3] 8. **OpenAI** (Feb 18): Launched a proprietary model based on Cerebras chips, achieving a training efficiency increase of 5-10 times and a cost reduction of 70% [3] 9. **Anthropic** (Feb 18): Released Claude Sonnet 4.6, with costs at 1/5 of flagship models and performance approaching Opus, offering high cost-performance for enterprise-level applications [3] 10. **xAI** (Feb 20): Updated Grok 4.2, with simultaneous upgrades in multimodal and inference capabilities [3] Financing and Capital Dynamics 11. **Moon's Dark Side (Kimi)** (Feb 20): Completed a $700 million financing round led by Alibaba, Tencent, and Wuyuan, with a valuation exceeding $10 billion; cumulative financing in January and February surpassed $1.2 billion [4] 12. **Zhiyuan AI/MiniMax** (Feb 20): On the first trading day after the Hong Kong stock market holiday, both companies' market capitalizations exceeded 300 billion HKD, totaling over 580 billion HKD [4] 13. **Anthropic** (Feb 21): Completed a $30 billion Series G round, with a valuation reaching $380 billion, led by GIC and Coatue [4] 14. **OpenAI** (Feb 21): Nearing completion of the first phase of financing exceeding $100 billion, with a valuation expected to surpass $850 billion [4]

Artificial Intelligence

Artificial Intelligence

炸了！Claude深夜怒撕DeepSeek、月之暗面、MiniMax，1600万次交互引争议

Xin Lang Cai Jing· 2026-02-24 01:28

Core Viewpoint - The article discusses the controversy surrounding Anthropic's claims about other AI companies allegedly using its model for distillation, which has sparked significant debate and skepticism within the AI community [1][14]. Group 1: Controversy and Reactions - Anthropic accused DeepSeek, Moonshot AI, and MiniMax of using 24,000 accounts to interact with its model Claude for 16 million times, leading to a public outcry [1][15]. - The public reaction has been largely critical of Anthropic, with many questioning the legitimacy of its claims and expressing support for the accused companies [2][11]. - Some commentators have drawn parallels to reverse engineering, arguing that paying customers should have the right to use products as they see fit [2][16]. Group 2: Model Distillation Explanation - Model distillation is described as a common method in AI training, aimed at creating smaller, more efficient models by transferring knowledge from larger models [13][31]. - The process involves a large model acting as a teacher, simplifying its knowledge for a smaller model, which can then operate more efficiently on less powerful devices [13][31]. - This technique is essential for making AI applications more accessible and usable on everyday devices, thus lowering operational costs and improving performance [29][31]. Group 3: Industry Implications - The ongoing debate highlights the competitive nature of the AI industry, where companies invest heavily to protect their intellectual property while also facing pressure to democratize technology [14][32]. - The article suggests that defining the boundaries of data usage and balancing copyright protection with innovation will be critical challenges for the industry moving forward [14][32].

Seek .(US:SKLTY)

大模型蒸馏

开源人工智能

Artificial Intelligence

Claude

大模型蒸馏

开源人工智能

Artificial Intelligence

Claude

北京企业搬迁指南：河马企享的全程托管实践

Sou Hu Cai Jing· 2026-02-24 00:46

在北京，企业搬迁远不止是"把东西从A点搬到B点"那么简单。它是一场涉及资产安全、业务连续性、员工体验和成本控制的综合战役。对于行政负责人而言，每一次搬迁都伴随着巨大的压力：如何确保服务器、精密仪器等核心资产万无一失？如何将搬迁对日常运营的干扰降至最低？如何安抚员工因个人物品打包、工位变动而产生的焦虑情绪？面对市场上服务标准不一、价格模糊的搬家公司，选择一家真正专业、可靠的服务商成为决定成败的关键。本文将聚焦于企业级搬迁服务，通过推荐几家在北京市场表现卓越的服务商，并结合一份实用的选择指南，帮助您在这场复杂的"空间迁移"中运筹帷幄。 l 503 call . 3 a a alle and | 2 - 8 第一部分：优质企业搬家公司推荐 1. 河马企享（北京河马到家信息科技有限公司）推荐指数：★★★★★ 核心定位：专注企业级点对点搬迁的"员工无感搬家"理念倡导者与践行者。服务与经验：成立于2018年，已成功服务超过万余次搬迁项目，覆盖从10人初创团队到千人规模集团的全场景。长期服务于梅卡曼德机器人、宜信财富、百川智能、DeepSeek等科技与金融领域标杆企业，对北京各商圈、产业园区的搬迁环境（如国 ...

Exclusive: China's DeepSeek trained AI model on Nvidia's best chip despite US ban, official says

Reuters· 2026-02-24 00:10

Core Viewpoint - The Chinese AI startup DeepSeek has reportedly trained its latest AI model on Nvidia's most advanced AI chip, the Blackwell, which may violate U.S. export controls [1][2]. Group 1: Company Information - DeepSeek's new AI model is expected to be released as soon as next week [1]. - The U.S. government believes that DeepSeek may attempt to remove technical indicators that could reveal its use of American AI chips [2]. - The U.S. policy emphasizes that Blackwell chips are not being shipped to China, indicating that DeepSeek's possession of these chips could represent a violation of export controls [2][4]. Group 2: Industry Context - The situation could further complicate discussions among Washington policymakers regarding Chinese access to advanced American AI semiconductor technology [3]. - The Chinese embassy in Washington has expressed opposition to the politicization of economic and technological issues, criticizing the expansive use of export controls [2].

Nvidia(US:NVDA)

Artificial Intelligence

Artificial Intelligence

Anthropic Says Chinese Labs Used 24,000 Fake Accounts To Rip Off Claude: Here's What It Means For AMZN, PLTR - Alphabet (NASDAQ:GOOGL)

Benzinga· 2026-02-23 20:58

Anthropic accused three Chinese AI companies of running 24,000 fraudulent accounts to siphon capabilities from its Claude chatbot, in what may be the largest documented case of AI model theft to date.DeepSeek, Moonshot AI and MiniMax generated over 16 million exchanges with Claude, violating terms of service and geographic access restrictions. The labs used a technique called distillation, where a weaker model trains on the outputs of a stronger one, to extract Claude’s most advanced reasoning, coding and t ...

Artificial Intelligence

Artificial Intelligence

Claude

Geopolitical Tensions Flare as US Envoys Set Iran Nuclear Talks; Anthropic Accuses Chinese Rivals of Data Siphoning

Stock Market News· 2026-02-23 18:38

Key TakeawaysUS Envoys Steve Witkoff and Jared Kushner are scheduled to meet Iranian officials in Geneva this Thursday for a third round of high-stakes nuclear negotiations.Anthropic has alleged that Chinese AI firms DeepSeek, Moonshot AI, and MiniMax used over 24,000 fraudulent accounts to siphon data from its Claude model to accelerate their own AI development.The U.S. Embassy in Beirut has ordered the immediate departure of all non-emergency staff and their families due to a deteriorating security situat ...

Alphabet(US:GOOGL)

Quantitative Tightening

Model Distillation

Artificial Intelligence

Claude model

Quantitative Tightening

Model Distillation

Artificial Intelligence

Claude model

Citi Nears Banamex Stake Sale; DeepSeek AI Launch Pressures Nasdaq

Stock Market News· 2026-02-23 17:08

Group 1: Citigroup and Banamex Divestiture - Citigroup is nearing a deal to sell stakes in its Mexican consumer banking arm, Banamex, to Blackstone and the Co-CEOs of Televisa, following a previous $2.3 billion sale of a 25% stake to Fernando Chico Pardo in late 2025 [2][10] - This divestiture is part of CEO Jane Fraser's strategy to simplify the bank's global footprint and focus on higher-return institutional businesses, with plans for an initial public offering (IPO) for the remaining portion of Banamex in 2026 [3][10] - The Banamex divestiture remains a core strategic priority for Citigroup as it prepares for the full IPO expected later in 2026 [10] Group 2: DeepSeek V4 and Nasdaq Valuations - The anticipated release of DeepSeek V4, a new large language model from a Chinese AI firm, is expected to challenge the high-margin hardware model currently dominated by Nvidia, potentially leading to a rough period for Nasdaq tech stocks [4][10] - Analysts warn that if DeepSeek demonstrates that advanced AI can be run on significantly cheaper hardware, it could trigger a valuation correction for major tech stocks like Microsoft and Alphabet [5] Group 3: Eurozone Inflation Divergence - The European Central Bank (ECB) faces a policy dilemma as inflation trends diverge in Germany and France, complicating the maintenance of a unified interest rate policy for the Eurozone [6][7] - Germany is experiencing persistent price pressures, while France's inflation has dipped below the ECB's 2% target, suggesting a need for monetary easing to prevent economic slowdown [6][7]

塑造自己的下一个版本2026前沿科技趋势报告解读（40页附下载）

Sou Hu Cai Jing· 2026-02-23 09:39

我来为您详细解读这份腾讯研究院发布的《塑造自己的下一个版本：2026前沿科技趋势》报告。这份报告以"用户视角"出发，眺望2030年的自己，围绕五个维度展开前沿科技趋势分析。 --- 一、生命力2030：从"活得久"到"活得好" 核心观点：人类生命正经历"第三次转型" 报告开篇指出一个关键转折：过去一百年人类寿命翻了一倍，但从1900年到2000年的快速增长后，预期寿命增速已大幅放缓。2024年《自然·衰老》期刊的研究表明，通过消除早夭和中年疾病来延长寿命的"容易摘的果实"已被采摘殆尽。 - Alnylam Pharmaceuticals开发的RNA干扰技术仅需每六个月一次皮下注射，即可控制高血压 - 斯坦福大学开发的mRNA CAR-T技术在小鼠淋巴瘤模型中实现75%的长期无瘤生存新范式诞生：全球正在从追求单纯的"寿命"（Lifespan）转向追求"健康寿命"（Healthspan）——即在没有严重慢性病、残疾或认知功能衰退的情况下维持良好生活质量的年限。据世界经济论坛报告，若将人类健康寿命延长1年，产生的全球经济价值将高达38万亿美元。三大技术支柱基因疗法进入"生命代码优化"时代 - CRI ...

ICLR 2026 | 北航开源Code2Bench：双扩展动态评测，代码大模型告别躺平刷分

机器之心· 2026-02-21 04:06

在衡量大语言模型（LLM）代码生成能力的竞赛中，一个日益严峻的问题正浮出水面：当模型在 HumanEval、MBPP 等经典基准上纷纷取得近乎饱和的成绩时，我们究竟是在评估其真实的泛化推理能力，还是在检验其对训练语料库的「记忆力」？现有的代码基准正面临两大核心挑战：数据污染的风险，以及测试严谨性不足。前者使评测可能退化为「开卷考试」，后者则常常导致一种「正确的幻觉」（Illusion of Correctness）—— 模型生成的代码或许能通过少数示例，却在复杂的真实世界边缘场景中不堪一击。为了打破这种「高分幻觉」，来自北京航空航天大学的研究团队提出了一种全新的基准构建哲学 —— 双重扩展（Dual Scaling），并基于此构建了端到端的自动化框架 Code2Bench 。该研究旨在为代码大模型的评估，建立一个更动态、更严苛、也更具诊断性的新范式。目前，该论文已被 ICLR 2026 接收。论文标题：Code2Bench: Scaling Source and Rigor for Dynamic Benchmark Construction 我们需要什么样的 Benchma ...

大语言模型代码生成能力评估

双重扩展（Dual Scaling）

Artificial Intelligence

Artificial Intelligence