DeepSeek
Search documents
美媒:中国AI在全球愈发受欢迎,全球多地机构正转向中国模型
Huan Qiu Wang· 2025-07-03 08:27
Core Insights - Chinese AI models are gaining popularity globally, challenging the dominance of the US in the AI industry [1] - Users from various sectors, including multinational banks and public universities, are turning to Chinese companies like DeepSeek as alternatives to US products like ChatGPT [1] Group 1: Adoption and Usage - HSBC and Standard Chartered have begun internal testing of DeepSeek's models [3] - Saudi Aramco has deployed DeepSeek in its main data center [3] - Major US cloud service providers, including Amazon AWS, Microsoft, and Google, are offering DeepSeek services to their clients [3] Group 2: Competitive Advantage - Chinese AI companies can provide comparable performance at significantly lower prices, attracting customers quickly [3] - DeepSeek's overall quality is reported to be similar to competitors, but its price is only 1/17 of that of alternatives, making it particularly appealing in countries with limited funding and computing resources [3][4] Group 3: Global Reach and Customization - The platform Latenode, based in Cyprus, helps global enterprises customize AI tools, with up to 20% of users opting for DeepSeek's models [3] - Abeja, a Tokyo-based AI startup, chose Chinese-developed AI products over those from Google and Meta for a project with Japan's Ministry of Economy, Trade and Industry [4] - The University of Johannesburg in South Africa selected DeepSeek for a pilot project due to its open-source nature and offline capabilities, which enhance data security [4]
deepseek崩了
Xin Lang Cai Jing· 2025-07-03 07:47
Core Insights - DeepSeek experienced a dramatic 24-hour period, achieving a peak of 30 million daily active users but subsequently suffering multiple service outages, highlighting the AI industry's focus on performance over stability [1][3] - The incident revealed systemic risks within DeepSeek's infrastructure, prompting a reevaluation of AI companies' approaches to stability as they transition from speed competition to endurance challenges [1][3] Group 1: Service Outage Details - The outage was not an isolated incident but a systemic risk, starting with a complete API service interruption that saw a 100% failure rate for developer calls [3] - The first failure occurred at 10:55 AM, with partial recovery by 11:32 AM, but a more severe crash followed, leading to a total service outage until 4:43 PM [3] - The economic impact was significant, with one enterprise reporting a 500% increase in customer complaints and direct losses exceeding 2 million yuan [3] Group 2: Technical Challenges - DeepSeek attributed the outages to sudden traffic spikes, system upgrades, and infrastructure fluctuations, but three structural issues were identified [5] - The first issue was a failure in traffic prediction, as user growth surged from zero to 30 million in just seven days, overwhelming server resources [5] - The second issue was the vulnerability of GPU clusters, which faced severe delays and data loss during peak traffic, leading to system protection mechanisms being triggered [5] - The third issue stemmed from the open-source model, which increased third-party deployments by 300%, further straining server capacity [6] Group 3: Recommendations for Stability - The incident underscored the need for a comprehensive stability assurance system in the AI industry, encompassing both technical and commercial aspects [7] - Upgrading technical architecture is essential, with examples like GMI Cloud's high-bandwidth GPU interconnects and Meta's software optimizations to improve task scheduling efficiency [7][8] - Innovations in business models, such as DeepSeek's private deployment option for enterprise clients, can alleviate pressure on public services [8] - The establishment of industry standards for AI service stability is crucial, with proposed requirements for top service providers to achieve 99.9% availability [8]
DeepSeek加入AI抢人大战,数月来首次在领英上发布招聘信息,剑指海外顶尖人才
Hua Er Jie Jian Wen· 2025-07-03 07:22
全球AI人才竞争白热化,继OpenAI和Meta竞相吸引顶尖AI人才之后,DeepSeek正在LinkedIn上发布招聘信息,可能寻求从海外吸引人才。 周三,这家总部位于杭州的公司在过去一周内在微软旗下的这一求职和社交网络平台领英上发布了10个职位,这是该公司数月来首次在该平台发 布招聘信息。 这些职位包括三个专注于通用人工智能(AGI)的岗位,工作地点位于北京和杭州。所有职位描述均以中文发布。 | 全球的职位 10 条结果 | | 订阅职位 | DeepSeek Al | | --- | --- | --- | --- | | | 前端开发工程师 | 4 | 前端开发工程师 | | | DeepSeek Al | × | 中国 浙江省 杭州 · 1 天前 · 10 位申请者 | | | 中国 浙江省 杭州 (现场办公) | | | | | 已查看 抢先申请 同 快速申请 | | 由招聘者推广·尚无可用回复洞察 | | | 全栈工程师 | × | 现场办公 录品 ● 0 / 3 项技能匹配 | | | DeepSeek Al | | | | | 中国 浙江省 杭州 (现场办公) | | 聞 快速申请 收藏 | ...
华人科学家撑起AI世界?这组数据太震撼了
2 1 Shi Ji Jing Ji Bao Dao· 2025-07-03 06:43
Group 1 - The rise of Chinese AI talent is becoming an undeniable phenomenon in the global tech landscape, with a significant increase in the proportion of top AI talent from Chinese universities [2][5] - In 2019, 27% of top AI talent in the US came from Chinese universities, which increased to 38% by 2022, surpassing the 37% from US universities [2] - Huang Renxun, CEO of Nvidia, noted that 50% of global AI researchers are from China, indicating a major contribution to AI research [2][5] Group 2 - The increase in Chinese AI talent is attributed to educational reforms, corporate innovation, and policy support, transitioning from "follower" to "leader" in AI [2][5] - The number of AI-related programs in Chinese universities has grown significantly, with over 600 institutions offering such programs by 2024 [2] - Companies like DeepSeek demonstrate that local teams in China are capable of independent R&D in top-tier AI technologies [3] Group 3 - The World Intellectual Property Organization reported that since 2017, patent applications in generative AI have increased by over 800%, with China leading in the number of patents [3] - In 2024, 61.5% of newly published generative AI patents originated from China, highlighting its dominance in this field [3] - Beijing is a key hub for AI talent, with a significant number of returnees working in AI and related fields [4] Group 4 - Despite the rapid growth of AI talent in China, challenges remain in foundational technologies, particularly in high-end chips [4] - The overall number of engineers in China is large, but there is a shortage of those with master's degrees, with a projected gap of 4 million high-skilled AI talents by 2030 [5] - Balancing the cultivation of exceptional engineers, breakthroughs in basic research, and global collaboration will be crucial for China's AI talent to take a central role on the world stage [5]
X @Bloomberg
Bloomberg· 2025-07-03 05:12
DeepSeek is ramping up its recruitment on LinkedIn, suggesting the Chinese AI startup may be looking to lure talent from outside its homeland https://t.co/yXyDCJqao2 ...
DeepSeek-R2!?神秘模型惊现竞技场,真实身份引网友猜测
量子位· 2025-07-03 04:26
Core Viewpoint - The article discusses the emergence of a mysterious model named "steve" from DeepSeek, sparking speculation about its identity and performance in comparison to other models like R2 and V4 [1][5][19]. Group 1: Model Identity and Speculation - Users are speculating about the identity of "steve," with suggestions ranging from it being R2, V4, or an upgraded version of an older model [3][19]. - "steve" has been confirmed to be associated with DeepSeek, although further details about its identity remain undisclosed [8][19]. - The model's presence is not visible on the public page, but traces of it can be found in the front-end code [5][6]. Group 2: Performance Comparison - Initial tests show that "steve" has passed certain intelligence tests, but it has also failed some questions [11]. - Comparisons between "steve" and V3 indicate that "steve" produced approximately 300 lines of game code, while V3 generated around 800 lines [13]. - Overall, "steve's" performance is perceived as underwhelming compared to V3 and R1, leading to doubts about it being R2 [22][19]. Group 3: Development and Release Timeline - The anticipated release of R2 has been delayed again, attributed to dissatisfaction from CEO Liang Wenfeng regarding its performance [25]. - The slow progress of R2's development may be linked to a shortage of NVIDIA H20 chips [26]. - Speculation about R2's capabilities includes parameters such as 1.2 trillion parameters and 5.2 petabytes of training data, although these claims remain unverified [32].
插件式AI应用异军突起,手机厂商原生智能助手陷增长瓶颈
Bei Jing Ri Bao Ke Hu Duan· 2025-07-03 00:52
Core Insights - The AI application market in China is undergoing significant changes, with "In-App AI" applications experiencing explosive user growth, reshaping the mobile internet ecosystem [1] Group 1: AI Application Types - The current AI applications in China are categorized into four main types: In-App AI applications, native AI apps from smartphone manufacturers, native apps from internet and AI tech companies, and PC web-based AI applications [2] Group 2: In-App AI Applications - In-App AI applications have seen a surge in active users, reaching 580 million in May, marking a year-on-year growth rate of 106.0%, making it the leading category among AI applications [5] - These applications are deeply integrated into major platforms like WeChat, Douyin, and Baidu, leveraging their vast user bases, with an average of 34.7 uses per user per month [5] - Major players like Douyin and Tencent have rapidly captured market share, with monthly active users of 200 million and 160 million, respectively [5] Group 3: Native AI Apps from Smartphone Manufacturers - Native AI applications from smartphone manufacturers are facing growth challenges, with 500 million monthly active users in May, reflecting a modest year-on-year growth of 9.5% [5] - Applications like Huawei's Xiaoyi and OPPO's Xiaobu Assistant, which previously thrived due to pre-installation advantages, are now experiencing user diversion due to competition from internet-native apps [5] - These applications are struggling with a lack of differentiated features and high levels of homogeneity, resulting in reduced user growth and engagement, with an average of 17.7 uses per user per month [5] Group 4: Internet and AI Tech Companies' Native Apps - Native apps from internet and AI tech companies show a stark polarization, with a total of 27 million monthly active users in May, driven by top applications like DeepSeek and Doubao, which have 16.8 million and 13 million users, respectively [6] - However, 83.8% of smaller AI applications have fewer than 1 million monthly active users, with many experiencing a decline in user numbers [6] - The PC web-based AI application sector also reflects this polarization, with a total of 19 million monthly active users [6] Group 5: Competitive Landscape - The AI application competition has entered an ecosystem-building phase, with major players optimizing the placement and form of AI plugins to attract more users [9] - Companies are leveraging their strengths to bind AI plugins, creating new competitive advantages, such as Tencent's focus on AI search and social interaction plugins, and Douyin's emphasis on AI search and image/video processing plugins [9] - This shift is not only squeezing the market share of native AI apps from smartphone manufacturers but is also changing traditional user behaviors in searching for news and processing images [9]
OpenAI 研究员 Noam Brown:Mid-training 是新的 pre-training
海外独角兽· 2025-07-02 11:03
Core Insights - The article discusses the emergence of reasoning capabilities in AI models, highlighting a shift from mere pattern matching to complex cognitive reasoning, which is essential for scientific discovery and decision-making [4][5]. Group 1: Reasoning as an Emergent Capability - Reasoning is an emergent ability that models can only benefit from once pre-training reaches a certain level [5][11]. - The analogy of "fast thinking and slow thinking" is used to explain the relationship between non-reasoning and reasoning models, where the former corresponds to intuitive responses and the latter to deliberate reasoning [8][11]. - The performance of models in multi-modal tasks depends on their ability to integrate complex information and logical reasoning [12][13]. Group 2: Need for a Universal Reasoning Paradigm - Achieving superintelligence requires a universal reasoning paradigm, as merely scaling pre-training is insufficient [20][21]. - OpenAI's leadership recognized the need for a shift towards reasoning paradigms and reinforcement learning, leading to significant resource allocation in these areas [21][24]. Group 3: Efficient Data Utilization through Reinforcement Learning - Reinforcement learning can enhance the efficiency of data usage, which is crucial as data becomes scarcer than computational power [25]. - Current machine learning models require significantly more samples than humans to learn new concepts, highlighting the need for improved sample efficiency [25][26]. Group 4: Non-Consensus Views on Reasoning Ability - Reasoning is not limited to tasks with clear reward functions; it can also excel in subjective fields where results are harder to quantify [33]. - The alignment of AI with user preferences is critical, and reasoning capabilities can help achieve this alignment while mitigating ethical risks [34][35]. Group 5: Bottlenecks in Test-Time Compute Development - Test-time compute faces cost limitations similar to those encountered during pre-training scaling, where increased model size leads to exponentially rising costs [36]. - The absolute time constraints on model responses hinder the speed of experimental iterations, impacting research efficiency [37][38]. Group 6: Mid-Training as a New Pre-Training Phase - Mid-training is introduced as a phase that adds new capabilities to models before the completion of pre-training, enhancing their generalization and practicality [40][41]. - OpenAI has adopted mid-training strategies in its model training processes to improve alignment and safety [41][42]. Group 7: Insights from The Bitter Lesson for Multi-Agent Systems - The concept of multi-agent systems may lead to the emergence of an "AI civilization" through long-term collaboration and competition among AI agents [44]. - Noam's team is exploring a principled research path that contrasts with traditional heuristic-based approaches in multi-agent research [45][46].
“美国自毁长城,中企凭高性价比一路高歌猛进”
Guan Cha Zhe Wang· 2025-07-02 08:45
Core Viewpoint - Chinese AI companies are challenging the dominance of American AI models globally, with increasing adoption of Chinese AI models like DeepSeek across various international sectors, despite U.S. restrictions [1][5]. Group 1: Market Dynamics - International users, including major banks like HSBC and Standard Chartered, are testing Chinese AI models, indicating a shift in preference from American products to Chinese alternatives [1]. - The download figures for OpenAI's ChatGPT stand at 910 million, while DeepSeek has reached 125 million, showcasing a significant gap but also a growing presence of Chinese models [2]. - Chinese AI models are closing the performance gap with American counterparts, with DeepSeek scoring 1,424, closely following Google's Gemini and OpenAI's ChatGPT [3]. Group 2: Strategic Approaches - Chinese AI companies are focusing on practical applications and open-source models, which have spurred global interest and usage, contrasting with the closed-source, high-cost models of American firms [4]. - The open-source strategy has led to significant adoption of Chinese models, with reports indicating that DeepSeek is chosen by one in five users on a global AI platform due to its cost-effectiveness [4]. Group 3: Geopolitical Implications - U.S. restrictions on Chinese AI development have backfired, leading to losses for Western chip manufacturers and failing to halt the progress of Chinese AI [5]. - The increasing adoption of Chinese AI models globally poses a risk to the market share and revenue of American companies like Google and Meta [5]. - The divide between U.S. and Chinese AI systems may hinder global cooperation on AI safety, potentially weakening the ability to address future AI risks [6].
中国AI,正在全球突围
Hua Er Jie Jian Wen· 2025-07-02 08:14
Core Insights - Chinese AI is challenging the dominance of American AI in the global market by leveraging cost advantages and application-oriented strategies [1][2][4] - A significant portion of users, up to one-fifth globally, are opting for Chinese AI models like DeepSeek over American counterparts [1][2] - The performance of Chinese AI models is comparable to that of American models, but at significantly lower prices, making them particularly attractive in regions with limited resources [2][3] Cost Advantage - Chinese AI companies are successfully penetrating the global market by adopting a low-cost strategy, even in the face of American barriers [2] - DeepSeek is reported to be 17 times cheaper than its American counterparts, which is especially appealing in countries like Chile and Brazil [2] - Alibaba has seen over 100,000 derivative models created based on its open-source AI model Qwen, indicating strong developer engagement [2] Application Focus and Open Source - The Chinese AI industry prioritizes practical applications of AI, which may facilitate quicker user acquisition compared to American companies focused on major breakthroughs [3] - Leading Chinese AI firms, including Tencent and Baidu, benefit from open-sourcing their AI models, allowing users to customize them to meet specific needs [3] - The open-source nature of models like DeepSeek has made them attractive for institutions like South Africa's University of the Witwatersrand, which values data security and offline capabilities [3] Industry Implications - The weakening dominance of American AI companies diminishes their power in setting global technology standards [4] - The outcome of the AI competition will ultimately depend on which technology gains broader adoption worldwide, as highlighted by Microsoft's president [4]