DeepSeek - filings, earnings calls, financial reports, news

DeepSeek

Search documents

Sou Hu Cai Jing· 2025-08-06 08:43

港股研究社讯，据彭博社消息，强脑科技目前正就以超13亿美元的估值筹集资金展开洽谈，后续或选择在香港或中国内地进行首次公开募股（IPO）。有知情人士透露，这家由哈佛校友韩璧丞于2015年创立的公司，正在洽谈获取约1亿美元的IPO前融资事宜。该知情人士还称，这家初创企业已着手准备上市相关文件，不过上市地点及其他细节尚未确定。目前，有关融资和IPO的讨论仍存在诸多变数，具体细节可能会随市场状况的改变而调整。截至目前，强脑科技的代表尚未对相关置评请求作出回应。强脑科技由哈佛大学校友韩璧丞于2015年正式创办。该公司与埃隆·马斯克（Elon Musk）旗下的脑机接口企业Neuralink在业务领域存在竞争关系。强脑科技和DeepSeek等公司一同被并称为杭州"六小龙"，其主要业务聚焦于开发仿生肢体技术以及人脑控制计算机的相关技术。 ...

新财富· 2025-08-06 08:03

Core Viewpoint - The article discusses the recent decline in the usage and market share of DeepSeek, questioning the validity of the reported statistics and emphasizing the importance of considering third-party API usage in evaluating its performance [2][4][10]. Summary by Sections Market Share and Usage Statistics - Reports from Semianalysis indicate that DeepSeek's market share has dropped to below 5%, with a significant decline noted since January [4][10]. - The statistics cited by Semianalysis primarily focus on the official API usage, potentially overlooking significant third-party integrations and deployments [10][12]. Third-Party API Usage - DeepSeek's third-party API calls have reportedly increased nearly 20 times since the release of versions V3 and R1, indicating sustained interest from developers [11][12]. - The article argues that the decline in official API usage does not reflect the overall demand for DeepSeek, as many applications integrate it without being captured in the official statistics [10][12]. Comparative Performance - Data from OpenRouter shows that DeepSeek V3 has a tokens consumption of 378 billion, ranking it third behind Claude Sonnet 4 and ahead of Google’s Gemini [17][22]. - Despite a decline in market share, DeepSeek maintains over 50% of the domestic B-end demand, indicating its strong position in the market [33]. User Preference and Community Engagement - A survey by Artificial Analysis found that 53% of respondents still prefer DeepSeek, placing it fourth among AI product providers [39]. - DeepSeek-R1 continues to lead in popularity on platforms like Hugging Face, indicating strong community support despite market fluctuations [44]. Industry Context and Future Outlook - The rapid evolution of AI technology suggests that a decline in DeepSeek's market share may not indicate a loss of relevance but rather reflects the dynamic nature of the industry [49]. - The article highlights the importance of open-source contributions from DeepSeek in promoting AI equity, contrasting it with other companies that are moving away from open-source models [49][50].

Claude 小升级就赢了OpenAI 9年“开源神作”？高强度推理直接歇菜、幻觉率高达50%，写作还被Kimi 2吊锤？

AI前线· 2025-08-06 04:25

Core Viewpoint - OpenAI has released its first open-source language model series, gpt-oss, which includes gpt-oss-120b and gpt-oss-20b, both of which are fully customizable and support structured output [2][3]. Model Specifications - gpt-oss-120b requires 80GB of memory to run, while gpt-oss-20b only needs 16GB [2]. - The models utilize a mixture of experts (MoE) architecture, activating 5.1 billion parameters per token for gpt-oss-120b and 3.6 billion for gpt-oss-20b, with total parameters of 117 billion and 21 billion respectively [9]. - Both models support a context length of up to 128k and are designed for efficient deployment on consumer-grade hardware [10]. Training and Performance - The training process for gpt-oss models combines reinforcement learning and techniques from OpenAI's advanced internal models, focusing on reasoning capabilities and efficiency [8]. - gpt-oss models have shown strong performance in reasoning tasks, with gpt-oss-120b performing comparably to OpenAI's proprietary models in core inference benchmarks [10]. Comparison with Competitors - Claude Opus 4.1 has demonstrated superior programming performance with a score of 74.5% in SWE-bench Verified programming evaluations, outperforming previous versions [5]. - Independent benchmark tests indicate that gpt-oss-120b is less intelligent than DeepSeek R1 and Qwen3 235B, although it has advantages in efficiency due to its smaller parameter size [13]. User Feedback and Limitations - Users have reported mixed experiences with gpt-oss models, noting that gpt-oss-120b is particularly unstable for coding tasks, while gpt-oss-20b performs better [6][17]. - The models exhibit a higher hallucination rate, with gpt-oss-120b and gpt-oss-20b generating hallucinations at rates of 49% and 53% respectively, significantly higher than OpenAI's previous models [16]. Open Source and Accessibility - gpt-oss models are released under the flexible Apache 2.0 license, making them accessible for various applications, including agent workflows and tool usage [11][10]. - The models are available for free download on Hugging Face, promoting wider adoption and experimentation within the developer community [2][3].