DeepSeek - filings, earnings calls, financial reports, news

DeepSeek

Search documents

上海楼市重磅新政，非沪籍大松绑；传飞天茅台出厂价涨130元；宝马降价27万上热搜；美国公司指控中企“偷”模型，马斯克嘲讽|| 大件事

Sou Hu Cai Jing· 2026-02-25 08:51

Group 1: Guizhou Moutai Price Adjustment - Recent rumors suggested that the factory price of Feitian Moutai increased from 1169 yuan to 1299 yuan per bottle, which attracted market attention [3] - Guizhou Moutai's representative denied the rumors, stating that any price adjustments would be disclosed as significant events [3] - The last official price adjustment occurred on November 1, 2023, with an average increase of about 20%, while the market guidance price remains at 1499 yuan per bottle [5] - Analysts believe that the price increase rumors may stem from stock market activities, as the date mentioned coincides with the first trading day of A-shares [6] - Feitian Moutai's revenue is crucial for the company's performance, accounting for 85.29% of total revenue in 2024, with a projected growth rate of 15.28% [6] Group 2: Shanghai Real Estate Policy Changes - On February 25, 2024, Shanghai's housing authorities announced new policies to optimize the real estate market, effective from February 26 [7] - The new policy reduces the social insurance or income tax payment duration required for non-local residents to purchase homes from three years to one year [11] - Non-local residents with a residence permit for over five years can now purchase one property in the city without needing to provide proof of social insurance or tax payments [11] - The maximum loan amount for first-time homebuyers using housing provident funds has been increased from 1.6 million yuan to 2.4 million yuan, with potential increases for families with multiple children [14] - In 2023, Shanghai's real estate market saw a decline in new housing starts by 31.8% and a decrease in residential sales area by 9.7% [14] Group 3: Luxury Car Price Reductions - The BMW 7 Series has seen a price reduction of approximately 270,000 yuan, trending on social media [15] - Luxury fuel vehicles are experiencing significant discounts, with brands like Mercedes-Benz and Audi offering substantial price cuts on various models [16] - The market share of fuel vehicles has decreased from 72% to 45% over the past three years, attributed to the rise of domestic electric vehicles [16] - Predictions indicate that luxury car prices may continue to drop by 10%-15% in the first half of 2026 due to competition from electric vehicles [17] Group 4: Yacht Industry Developments - Liu Qiangdong announced the establishment of a yacht brand, Sea Expandary, with a personal investment of approximately 5 billion yuan [19] - The brand aims to make yachts more accessible, targeting a price point of around 100,000 yuan for entry-level models [19] - The Chinese yacht market is rapidly growing, with new registrations accounting for 54.7% of the total number of yachts [20] Group 5: AI Industry Controversies - Anthropic accused three Chinese AI companies of conducting "distillation attacks" on its Claude model, claiming over 16 million interactions through 24,000 fake accounts [21] - The accusations have been met with skepticism, with critics questioning Anthropic's data sources and the legitimacy of its claims [22] - Elon Musk criticized Anthropic's allegations, suggesting that they themselves have engaged in data theft [23]

KWEICHOW MOUTAI(SH:600519)

Anthropic攻击中国AI模型，到底给谁做了广告？

Guan Cha Zhe Wang· 2026-02-25 06:42

Core Viewpoint - Anthropic accuses three Chinese AI companies, DeepSeek, Moonlight, and MiniMax, of conducting "industrial-grade distillation attacks" on its flagship model Claude, claiming systematic extraction of model capabilities through 16 million requests and thousands of fake accounts [1][2] Group 1: Accusations and Implications - Anthropic's accusations appear to be more than just business competition, as they seem to elevate the issue to a political level aimed at hindering the development of Chinese AI companies [1] - The term "distillation" refers to a common technique for transferring capabilities from large models to smaller ones, which is more akin to mimicking learning rather than outright copying [1] - The company emphasizes that the targeted capabilities of Claude are its most differentiated features, suggesting that competitors are resorting to unethical means due to the strength of their model [2] Group 2: Monitoring and Privacy Concerns - Anthropic claims to have the ability to trace certain accounts back to specific researchers at DeepSeek and to infer undisclosed product release plans based on request patterns, raising significant privacy concerns [2] - The statement implies that users of Claude may inadvertently disclose their questions, thought processes, and work plans to Anthropic, highlighting fears regarding data monitoring and privacy in proprietary AI models [2] Group 3: Industry Reactions and Criticism - Elon Musk criticized Anthropic, pointing out that the company itself relied on "infringement" by using vast amounts of data for training, which raises questions about the ethical standards in the industry [3] - The criticism of competitors for "stealing" capabilities while ignoring their own history of knowledge extraction from the internet reflects a double standard in the industry [3] - Anthropic's actions inadvertently serve as a strong advertisement for open-source AI, as they demonstrate the risks associated with closed-source AI systems regarding user privacy and autonomy [4] Group 4: Broader Implications and Company Stance - The narrative suggests that commercial competition is being framed as a matter of national security, indicating that only companies from specific countries are deemed qualified to develop powerful AI [5] - This approach not only risks fostering distrust in closed-source AI systems but also highlights the potential dangers of using national security as a guise for business competition [5]

人工智能

闭源AI

蒸馏技术

Artificial Intelligence

Artificial Intelligence

Claude

Anthropic一条推文，引发了全球AI圈同仇敌忾的群嘲。

数字生命卡兹克· 2026-02-25 02:38

Core Viewpoint - The article discusses the controversy surrounding three Chinese AI companies—DeepSeek, Moonshot, and MiniMax—accused of conducting "industrial-scale distillation attacks" on Anthropic's Claude model, raising significant national security concerns [3][4]. Group 1: Distillation and Its Implications - Distillation is a common AI training technique where a larger model (teacher) is used to train a smaller model (student), allowing the smaller model to learn from the larger one without needing to replicate its entire complexity [11][12]. - The article highlights that while distillation is a normal practice, the legality of how training data is obtained is crucial, especially when it involves using outputs from other companies' models [13]. - Anthropic claims that the Chinese companies created 24,000 fake accounts to extract Claude's outputs, which they argue violates their service terms [13][16]. Group 2: Legal and Ethical Concerns - Anthropic faced a lawsuit for using over 7 million copyrighted books from piracy sites like LibGen and PiLiMi to train Claude, leading to a historic $1.5 billion settlement [14][16]. - The court ruled that using legally purchased books for AI training is fair use, while using pirated books constitutes infringement, highlighting the importance of legal data acquisition methods [17]. - The article points out the double standards in the industry, where Anthropic criticizes others for data extraction while having engaged in similar practices [19]. Group 3: Broader Implications of AI and Copyright - The article raises philosophical questions about the nature of "theft" in the context of AI, suggesting that traditional notions of theft do not apply to data, as copying does not deplete the original [25][26]. - It discusses the historical context of copyright issues, noting that technological advancements have consistently sparked debates about intellectual property rights [30][31]. - The emergence of AI challenges existing frameworks of creativity and ownership, as AI can learn from and generate new content based on existing works [35][36]. Group 4: Future Considerations - The article emphasizes the need for a nuanced discussion on AI and copyright, moving beyond binary views of "theft" to consider the complexities involved [44][45]. - It warns against the monopolization of AI training capabilities by a few companies, particularly in the U.S., and the implications this has for global equity in AI development [47][48].

马年首涨：中概股破局，A股引领全球资本新节奏

Sou Hu Cai Jing· 2026-02-25 01:03

1 State 46.5 0 - 10 1- . 40, 10 0 in r t 9 14.0 8 8 e y t a 1 11 the first 12 27 h ge 197 th - " ice 0 . 版权留片 a 在黄金因美元强势而黯然跳水、A股于春节后首个交易日释放出久违的磅礴巨量之际，大洋彼岸的美股市场，第一次真切且清晰地捕捉到了来自东方的强劲"发令枪"声。截至凌晨 5 点收盘，纳斯达克中国金龙指数（HXC）一路狂飙，大涨 1.37%，收于 7581.04 点，涨幅甚至力压纳斯达克指数本身的 1.04%。分时图上，HXC 指数宛如一条优雅的平滑曲线，从盘中低点 7507.97 点稳步攀升至高点 7596.64 点，尽显强势。成交量的贡献为 1.19%，这是全球资本用实实在在的真金白银，对 A 股白天上涨的有力回应。个股表现同样可圈可点，万国数据、世纪互联涨幅超 6%，金山云涨超 4%，小鹏汽车涨超 6%，台积电涨超 4%。这些领涨的板块——科技、新能源、半导体，恰恰与 A 股白天表现最为强劲的方向不谋而合。至此，一个完整的传导链条清晰呈现：A 股白天强势大涨，中概股夜间紧密跟涨。至 ...

3 6 Ke· 2026-02-25 00:48

Core Insights - DeepSeek has maintained a rare silence during the Spring Festival, contrasting sharply with competitors like Zhiyu, Qianwen, and Jimeng, who have actively released new models [1][3] - The company has not been able to replicate the explosive success of its initial model R1, which was launched a year ago and created unprecedented attention in the AI sector [3][5] - DeepSeek's decline in public interest appears to be a deliberate choice, reflecting a complex strategy in the current AI landscape [4][12] Company Performance - The R1 model launched in February 2025 marked a significant milestone for DeepSeek, leading to a record growth in consumer AI applications [5] - Subsequent models like V3 and DeepSeekMath have not achieved the same level of impact, although user retention and daily active users remain strong, indicating no significant loss of users [6][9] - DeepSeek has continued to innovate with frequent model updates, maintaining a leading position in the competitive landscape [9][11] Industry Dynamics - The competitive environment has intensified, with new entrants and existing players accelerating their technological advancements, which has diluted DeepSeek's media presence [13][14] - DeepSeek's low-profile approach to marketing and product releases contrasts with the aggressive promotional strategies of global giants like OpenAI and Google [15][17] - The company has focused on niche applications, which may limit its appeal to a broader audience, as seen with its specialized models targeting developers rather than general consumers [17] Strategic Choices - DeepSeek's decision to remain low-key is seen as a strategic move to avoid unnecessary attention and potential backlash in a politically sensitive environment [21][22] - The company aims to maintain a collaborative image within the tech community, enhancing trust in its models among global users [21][22] - By avoiding excessive publicity, DeepSeek seeks to navigate the complexities of the AI industry while fostering a multi-model approach that aligns with market expectations [23][25]

Seek .(US:SKLTY)

DeepSeek R1

指责中国公司窃取技术的Anthropic，才是北美最大“偷子”？

3 6 Ke· 2026-02-24 23:36

Core Viewpoint - The article discusses the controversy surrounding Anthropic's accusations against Chinese AI companies for allegedly stealing its Claude model's capabilities, highlighting the irony of Anthropic's own past copyright infringement issues [1][2][10]. Group 1: Allegations and Accusations - Anthropic accused three Chinese AI companies—DeepSeek, Moonshot AI, and MiniMax—of using fraudulent methods to train their models by interacting with Claude, claiming over 24,000 fake accounts were created for approximately 16 million interactions [1]. - The accusations are seen as a strategic move in the context of the ongoing AI competition between the US and China, positioning Anthropic as a victim of intellectual property theft [1][10]. - Elon Musk criticized Anthropic's stance, questioning how these companies could steal from Anthropic when it had previously taken from human programmers [2]. Group 2: Anthropic's Past Issues - Anthropic previously faced a significant legal challenge, paying $1.5 billion to settle a lawsuit for illegally using copyrighted books to train its Claude models, which were sourced from piracy sites [7][9]. - The court ruling established that using illegally obtained data for AI training constitutes copyright infringement, which directly impacted Anthropic's operations and reputation [8][9]. - The settlement was the largest copyright settlement in AI history, covering around 500,000 works, and required Anthropic to destroy all illegally obtained training data [9]. Group 3: Ethical and Legal Implications - The article raises questions about the ethical implications of AI companies using vast amounts of data, often including copyrighted material, to train their models, highlighting a broader industry issue [15]. - The distinction between "distillation" as a method of knowledge transfer and outright theft is debated, with the former being a common practice in AI development [4][5]. - The controversy reflects a double standard in the industry, where companies like Anthropic can utilize global data for training but seek to restrict others from using their models' outputs [16].

AI蒸馏技术

AI知识产权

Artificial Intelligence

Artificial Intelligence

Claude

R1模型

How one AI company is helping businesses navigate Trump’s new tariff chaos following the Supreme Court ruling

Yahoo Finance· 2026-02-24 19:39

Group 1 - The global AI Impact Summit in New Delhi resulted in voluntary commitments to distribute AI technology benefits more equitably and secured $200 billion in new AI investment for India [2] - Anthropic accused Chinese AI companies DeepSeek, Moonshot AI, and MiniMax of conducting an industrial-scale campaign to distill its Claude models, creating 24,000 fake accounts to generate 16 million exchanges with Claude [4] - The U.S. government suspects that DeepSeek trained its upcoming V4 model using Nvidia's Blackwell AI GPUs, potentially violating U.S. export controls [4] Group 2 - The situation indicates that Chinese AI labs may be resorting to covert methods to match U.S. AI performance, suggesting that U.S. companies may maintain their edge in state-of-the-art AI technology [5] - Despite performance concerns, the adoption of Chinese AI models is increasing outside the U.S. and Europe due to their open-source nature and lower costs compared to American models [5] - Chinese efforts to develop domestic AI chips capable of competing with Nvidia's have not yet succeeded, as indicated by the reliance on Nvidia's technology [5]

Nvidia(US:NVDA)

Artificial Intelligence

Frontier AI agent platform

Blackwell AI GPUs

Claude

Artificial Intelligence

Frontier AI agent platform

Blackwell AI GPUs

Claude

DeepSeek、月之暗面、MiniMax被指大规模蒸馏Claude，MiniMax交互超1300万次

Ge Long Hui· 2026-02-24 18:00

Core Insights - Anthropic reported that three organizations systematically created over 24,000 fraudulent accounts, resulting in more than 16 million interactions with Claude, aimed at extracting model capabilities for their own training and optimization [1][7]. Group 1: Distillation Activities - The three distillation actions exhibited highly similar operational methods, utilizing fake accounts and proxy services for large-scale access to evade platform detection [7]. - Anthropic identified these actions through multiple technical evidences, including IP address associations and request metadata, achieving high-confidence attribution [7]. - The attacks primarily targeted Claude's differentiated capabilities in agentic reasoning, tool usage, and code generation [7]. Group 2: DeepSeek Investigation - In the investigation of DeepSeek, Anthropic confirmed that the scale of operations exceeded 150,000 interactions, focusing on multi-task reasoning and sensitive question rephrasing [8]. - DeepSeek's accounts displayed synchronized traffic patterns and payment methods, resembling a "load balancing" feature to enhance throughput and reduce detection risk [8]. - One identified technique involved prompting Claude to "retrace and write out its internal reasoning process," generating large-scale chain-of-thought training data [8]. Group 3: Moonshot AI and MiniMax - For Moonshot AI, Anthropic disclosed over 3.4 million interactions, concentrating on agentic reasoning, programming, and computer vision capabilities [8]. - Moonshot employed hundreds of fraudulent accounts and mixed various access paths to lower the overall recognizability of their actions [8]. - The largest distillation activity was attributed to MiniMax, with over 13 million interactions, focusing on agentic programming capabilities and tool orchestration [8]. - Anthropic was able to observe the entire process of a distillation attack from data generation to model release, as MiniMax adjusted its strategy shortly after the release of a new model [8]. Group 4: Security Measures - Anthropic stated that the findings have been used to enhance the platform's security and abuse detection mechanisms, although further details on actions taken were not disclosed [9]. - As of the report's publication, DeepSeek, Moonshot AI, and MiniMax had not responded to the situation [9].

Seek .(US:SKLTY)

工业化规模蒸馏攻击

Artificial Intelligence

Claude

工业化规模蒸馏攻击

Artificial Intelligence

Claude

中金：：人工智能十年展望）：越过“遗忘”的边界，模型记忆的三层架构与产业机遇

中金· 2026-02-24 14:20

Investment Rating - The report maintains the profit forecasts, target prices, and ratings for relevant companies unchanged [6] Core Insights - The evolution of large models is fundamentally a history of combating "forgetting." The lack of a memory retention architecture leads to costly "repeated calculations" each time historical information is processed. The current model faces physical limits of memory walls and context windows. The report suggests that the AI infrastructure battlefield will increasingly focus on "model memory" starting in 2026 [3][14] - The report introduces a three-layer memory framework: short-term, medium-term, and long-term memory, each corresponding to different software and hardware requirements. This framework aims to provide a structured analysis paradigm for investment logic in AI infrastructure [14][20] Summary by Sections Short-term Memory - Short-term memory constitutes the "current view" of large models during single inference tasks. It is characterized by high-frequency read/write and extreme sensitivity to latency. The core challenge lies in the dual occupation of memory capacity and bandwidth by KV Cache. Software optimizations include PagedAttention virtualization and cutting-edge architectures like Infini-attention to support million-token context windows. Key hardware elements include HBM and on-chip SRAM [4][30][50] Medium-term Memory - Medium-term memory ensures situational continuity across sessions and is foundational for agents. The need for cross-session windows indicates a shift from stateless short-term intelligence to a complex system capable of "storage-retrieval-update-forget" dynamic management. Software advancements like GraphRAG and MemoryOS facilitate this transition, while hardware requirements include large-capacity DRAM and enterprise-grade SSDs to address high-concurrency random read/write bottlenecks [4][56] Long-term Memory - Long-term memory supports the transition from pre-training to "continuous evolution." The need for real-time updates blurs the lines between model training and inference. Long-term memory aims to break the limitations of pre-training cut-off times, allowing for continuous knowledge accumulation through implicit parameters, explicit semantics, and parameterized lookup tables. This new paradigm will drive demand for various databases and compute-storage hardware [5][21] Hardware and Software Requirements - The report outlines the hardware and software requirements for each memory layer, emphasizing the need for high-bandwidth memory (HBM), large-capacity DRAM, and enterprise SSDs. It also highlights the importance of software solutions like KV Cache management and advanced attention mechanisms to optimize memory usage and enhance performance [16][50][64]

人工智能

模型记忆

大语言模型

Artificial Intelligence

Artificial Intelligence

Google Titans

Google MIRAS

Anthropic这波操作，把当婊子和立牌坊玩到了极致

Sou Hu Cai Jing· 2026-02-24 12:16

Core Viewpoint - Anthropic's accusations against Chinese AI companies for distilling Claude's capabilities reflect a deeper issue of competition and insecurity within the AI industry, suggesting that when American AI companies face challenges, they resort to political maneuvering instead of fair competition [1][3][5] Group 1: Business and Commercial Logic - Distillation, the process of using another company's API to train one's own model, is a common practice in the AI industry, and many companies engage in it without issue [1] - Anthropic, while accusing Chinese companies of misconduct, itself relies on vast amounts of publicly available data for training its models, raising questions about its own practices regarding data acquisition [1][2] - The competitive landscape has shifted, with Chinese AI companies offering effective open-source models and cheaper API prices, leading to a decline in Anthropic's market position [3] Group 2: Political and Ethical Implications - Anthropic's claims about Chinese AI models being used for military and intelligence purposes appear to be a strategic move to undermine competitors rather than a genuine legal concern [4][5] - The accusations serve as a means for Anthropic to leverage political influence against Chinese AI companies, indicating a growing sense of threat from these competitors [5]

AI模型蒸馏

Artificial Intelligence

Claude

AI模型蒸馏

Artificial Intelligence

Claude

Previous Next