Workflow
DeepSeek
icon
Search documents
为什么大厂必须抢郭达雅?
36氪· 2026-03-23 13:42
以下文章来源于字母AI ,作者苗正 字母AI . 聚焦前沿科技,抢先看到未来。 DeepSeek正在经历一场严峻的考验。 文 | 苗正 编辑 | 王靖 来源| 字母AI(ID:faceaibang) 封面来源 | Unsplash 有这样一则消息在AI圈悄然流传:DeepSeek研究员郭达雅已经离职。 大家第一时间的反应普遍是"谁?谁是郭达雅?" 郭达雅不一样,他是代码智能和数学方向的,他刚好 可以补强 字节在Vibe Coding以及AGI这两大板块。 如果是去百度,那也说得通。文心快码在3月份刚刚完成了4.0版本迭代,推出了多agent协同全链路开发的功能。 但是你知道文心快码3.0是什么时候发布的吗?是2024年11月。两个大版本中间相隔了一年多,这在以周为单位的AI圈是不太常见的。 这不难理解,因为郭达雅的知名度远不如创始人梁文锋以及"天才AI少女"罗福莉。 但是在学术研究以及对DeepSeek大模型的贡献上,郭达雅要比后两者高许多。 截止至发稿,郭达雅发表的论文已经被引用超过37000次,远远超过了同龄的研究者。 郭达雅的h指数为37,i-10指数为46,说明他不仅学术产出非常稳定,而且他还发表了多 ...
未知机构:多家AI模型厂商已上调其API定价-20260323
未知机构· 2026-03-23 02:15
Summary of Conference Call Records Industry Overview - Multiple AI model vendors have raised their API pricing, reflecting high and rising costs of computing, memory, and electricity, alongside rapidly growing inference demand driven by agents like OpenClaw [1][2] - In the U.S., API pricing remains approximately six times higher than in China, indicating a tight supply of computing resources and previously unsustainable low pricing levels in China [1][2] Key Points and Arguments - The increase in API pricing is driven by expensive and tight supply of computing and memory resources, with many U.S. and Chinese AI vendors adjusting their model API pricing due to soaring costs [1][2] - The average API price in the U.S. has been raised by 17% to 67% by companies like Anthropic, Google, and OpenAI, while memory prices have surged by 3 to 5 times, and next-generation AI servers and GPUs are becoming more costly and power-hungry [2] - Despite the growth in inference demand, the rapid increase in API pricing may help control this demand, as most AI vendors face pressure to raise their API prices [2] Company-Specific Insights - In China, independent AI model vendors may face greater margin pressure, with five AI vendors raising their model API pricing and two lowering it, including Grok and Alibaba [3] - MiniMax plans to reduce the price of its M2.7 model by 50% by October 2025, making it the second cheapest AI model after DeepSeek [3] - Alibaba Cloud has increased its pricing for third-party computing/storage by 5% to 34% while reducing its model API pricing by 42%, likely to enhance competitiveness but indicating potential margin pressure for independent AI vendors renting computing/storage from Alibaba Cloud [3] Investment Risks and Opportunities - The value of AI is primarily flowing to upstream hardware manufacturers, presenting investment return risks [4] - AI model vendors must invest heavily in computing to enhance model performance and support growing inference demand, suggesting that current investment opportunities are mainly concentrated in upstream hardware suppliers such as CPU/GPU, memory, optical communication, and data centers [4] - The potential for investment returns remains a significant risk in the global AI development landscape [4]
大厂抢郭达雅进行时!DeepSeek核心成员还是个“综艺巨佬”
量子位· 2026-03-22 06:28
Core Viewpoint - The article discusses the departure of Guo Dayan, a key engineer at DeepSeek, who has significantly contributed to various models including V2, V3, and R1, raising concerns about the potential impact on DeepSeek's future developments [1][6][7]. Group 1: Guo Dayan's Background and Achievements - Guo Dayan is recognized as a technical prodigy with a remarkable academic and competitive history, often referred to as the "Lei Jun of Sun Yat-sen University" [2][42]. - He completed his doctoral thesis requirements just three days after starting his postdoctoral studies, showcasing exceptional research efficiency [3][35]. - Guo has won multiple championships in competitions such as the Tencent Advertising Algorithm Competition and the ATEC Technology Elite Competition, earning substantial monetary rewards [4][44][46]. Group 2: Contributions to DeepSeek - Guo Dayan joined DeepSeek after completing his PhD in 2023, focusing on code intelligence and large language model inference [8][10]. - He was a core contributor to several models, including DeepSeek-Coder, DeepSeek-Math, and DeepSeek-Prover, which have shown significant advancements in mathematical reasoning and formal proof generation [13][18][21]. - The training cost for the DeepSeek-R1 model was approximately $294,000, indicating a relatively low investment for the capabilities achieved [25]. Group 3: Future Implications - Guo's departure raises questions about the continuity of DeepSeek's innovative projects, particularly the development of the upcoming DeepSeek-V4 model [6][10]. - His contributions have been pivotal in demonstrating that large models can achieve reasoning capabilities without relying on human annotations, which could influence future AI model development strategies [24].
35岁魔咒失效,中年人逆袭掌权AI革命?
创业邦· 2026-03-21 01:11
以下文章来源于秦朔朋友圈 ,作者阳淼 秦朔朋友圈 . 秦朔朋友圈是由中国著名媒体人、财经观察家秦朔牵头创立的一个新媒体与专业服务品牌,包括微信公众号、微博、视频节目、音频节目等。内容聚焦于经 济、金融和商业领域,关注重点为全球和中国财经商业热点、企业家精神、创新与发明创造、商业文明探索等。 来源丨秦朔朋友圈(ID: qspyq2015 ) 作者丨 阳淼 最近有一个很有趣的洞察, AI 创业大潮中的 " 老头乐现象 " : 在这一轮 AI 革命中弄潮的,很多都是四五十岁的中年人,比如 OpenAI 的 Altman , 41 岁, Anthropic 的 Amodei , 42 岁 ; DeepMind 的 Hassabis , 48 岁。最近大红大紫的 OpenClaw 的开发者 Steinberger , 38 岁,都已经退休过一回了。 这个现象放在中国也有类似情况,智谱 AI 的张鹏 44 岁,D eepSeek的 梁文锋 41 岁,阶跃星辰的姜大昕 40 岁, MiniMax 闫俊杰也 37 岁了! 当然要说这群人是"老头乐",那也有点伤人 。 不过说是中年革命,应该不过分。 这跟 30 年 前 的互联 ...
从阿里云涨价看算力通胀演绎的节奏和阶段
2026-03-20 02:27
从 2026 年 1 月至今,算力通胀的传导路径和市场演变节奏是怎样的? 2026 年以来,算力通胀的传导链条呈现出从上游向下游逐步外溢的趋势。1 月 中旬起,市场需求侧已观察到 Token 消耗的快速增长,预示了全年算力通胀的 趋势。具体来看,通胀首先体现在 GPU 和存储环节,1 月份甚至 CPU 价格也 出现过小幅上涨。随后,通胀传导至云服务领域。1 月下旬,亚马逊云科技率 先提价,1 月 25 日谷歌云也宣布上调海外 CDN 价格,引发了市场对国内云厂 商涨价的预期。 进入 2 月,国内市场跟进趋势明显。2 月 5 日,网宿科技正式 公布 CDN 涨价;2 月 11 日,优刻得也宣布涨价。然而,当时市场主流观点认 为,在阿里巴巴和字节跳动两大巨头未明确表态前,中小云厂商的涨价行为更 多是试探性的,整个行业处于观望状态。尽管如此,当时产业内已形成共识, 即存储产品的价格上涨是确定性趋势,同时 GPU 服务器的价格也随着各批次到 货成本动态调整。 近期,随着阿里云和百度云正式宣布涨价,加之腾讯云针对 特定模型以及智谱 AI 的 Token 价格连续两轮上调,标志着算力通胀已明确传 导至国内主流云服务商和模 ...
U.S. tech execs smuggled Nvidia chips to China, prosecutors say
CNBC· 2026-03-19 22:22
Core Viewpoint - The U.S. Attorney's Office has charged individuals associated with a U.S. server manufacturer for illegally diverting billions of dollars in AI servers to China, highlighting concerns over unauthorized access to high-powered chips by Chinese companies [1]. Group 1: Legal Actions and Allegations - The U.S. government has filed an indictment against Yih-Shyan "Wally" Liaw, Ruei-Tsan "Steven" Chang, and Ting-Wei "Willy" Sun for violating the Export Control Reform Act [2]. - The indictment states that products containing Nvidia chips are subject to strict U.S. export controls, which prohibit their sale to China without a license, aimed at protecting U.S. national security [3]. Group 2: Industry Context and Responses - Nvidia's graphics processing units are in high demand globally for training generative AI models, indicating the competitive landscape between U.S. and Chinese companies [2]. - U.S. President Trump previously sought to prevent China from obtaining processors, but later indicated that Nvidia could ship H200 GPUs to China under specific conditions to maintain national security [3]. - Nvidia had received licenses to export the H20 chip to China last summer, with an agreement to provide the U.S. with 15% of its sales in China [4].
35岁魔咒失效,中年人逆袭掌权AI革命?
虎嗅APP· 2026-03-19 00:21
Core Insights - The article discusses the phenomenon of middle-aged entrepreneurs leading the current AI revolution, contrasting it with the younger leaders of the internet revolution [2][3] - It emphasizes that the AI revolution favors individuals with accumulated experience, emotional intelligence, and a sense of responsibility, which are often found in middle-aged professionals [3] Funding and Investment Landscape - AI entrepreneurship requires significant capital investment, likened to heavy industry, whereas internet startups were more akin to light industry with lower entry costs [5][6] - Training advanced AI models demands substantial resources, with costs reaching millions of dollars, making it challenging for younger entrepreneurs without access to large funding pools [6][7] - The shift in venture capital strategies has moved from broad investment in young entrepreneurs to a focus on experienced middle-aged leaders who can provide certainty and stability [14][16] Technical and Engineering Expertise - AI projects necessitate deep engineering knowledge and experience, which often excludes younger individuals who may lack the requisite background [8][9] - The complexity of AI model training requires extensive time and effort for system adjustments, contrasting sharply with the rapid iteration seen in internet startups [9] Organizational and Networking Advantages - Middle-aged entrepreneurs possess superior organizational skills and networks, which are crucial for managing the multifaceted demands of AI projects [10] - Established connections and industry knowledge enable these leaders to attract talent and resources that younger entrepreneurs may struggle to secure [10] Shifts in Capital and Regulatory Environment - The capital landscape has evolved to prioritize experienced entrepreneurs, with a focus on those who can navigate regulatory challenges and ethical considerations in AI development [13][18] - Regulatory scrutiny has increased, necessitating a deeper understanding of compliance and ethical implications, which middle-aged leaders are better equipped to handle [19][20] Opportunities for Younger Entrepreneurs - Despite the dominance of middle-aged leaders, there remains space for young entrepreneurs to innovate and contribute significantly to the AI landscape [22][24] - Young professionals often excel in technical execution and can drive rapid product development, complementing the strategic oversight of their older counterparts [24] Strategic Directions for Middle-aged Entrepreneurs - Middle-aged leaders are encouraged to define industry problems accurately, leverage their accumulated knowledge to create competitive advantages, and manage AI-human collaboration effectively [28][31] - Establishing ethical frameworks and regulatory compliance will be essential for long-term success in the AI sector, where trust is a critical asset [33]
DeepSeek又出手了?一个神秘的AI模型引起全球开发者热议
凤凰网财经· 2026-03-18 13:21
Core Viewpoint - The article discusses the emergence of a new AI model named "Hunter Alpha," which has sparked speculation about its connection to the upcoming DeepSeek V4 model due to its impressive performance metrics and anonymous release [3][4][6]. Group 1: Performance Metrics - Hunter Alpha boasts a parameter scale of 1 trillion, placing it among the leading models in the industry [4]. - The model claims to have a context window of up to 1 million tokens, significantly surpassing most commercial models, allowing it to handle longer texts and more complex tasks [4]. - As of the latest statistics, Hunter Alpha has processed over 160 billion tokens, indicating rapid adoption among developers [5]. Group 2: Connection to DeepSeek - The model's self-identification as a "Chinese AI model trained primarily in Chinese" and its knowledge cutoff date of May 2025 align with the specifications of DeepSeek's existing models [6]. - Some developers suggest that the reasoning style of Hunter Alpha may reveal its "heritage," with its scale and memory capacity matching expectations for DeepSeek V4 [7]. - Despite the similarities, some analysts remain cautious about definitively linking Hunter Alpha to DeepSeek V4, noting differences in token behavior and architectural patterns [9][10]. Group 3: Industry Practices - The anonymous release of AI models for real feedback has become a standard practice in the industry, with platforms like OpenRouter facilitating testing across multiple AI systems [8]. - Notifications on Hunter Alpha's profile indicate that all prompts and completions are recorded for model improvement, a common practice in the field [9].
Nvidia will resume H200 AI chip sales in China, Jensen Huang says
Yahoo Finance· 2026-03-18 12:39
Nvidia has received purchase orders from Chinese customers for its H200 processors and is restarting production, CEO Jensen Huang said this week — marking the first concrete movement toward resuming chip sales to China after months of regulatory maneuvering in both the U.S. and China. "We have received purchase orders, and we're in the process of restarting our manufacturing," Huang told reporters at the company's GTC conference in San Jose, according to CNBC. "Our supply chain is getting fired up." Huang ...
Wall Street Breakfast Podcast: The AI No One Claims
Seeking Alpha· 2026-03-18 10:55
Group 1: AI Developments - An AI model named Hunter Alpha has emerged on the OpenRouter platform, speculated to be linked to DeepSeek's next-generation system [4][5][6] - Hunter Alpha is described as a 1-trillion-parameter model, indicating a significant scale in its training data and processing capabilities [6] Group 2: Lululemon Athletica (LULU) - Lululemon reported better-than-expected fourth-quarter results, surpassing both top- and bottom-line estimates, but its stock fell 2% in premarket trading due to disappointing guidance [7][8] - The company anticipates a net revenue increase of 1% to 3% for the first quarter, projecting revenue between $2.4 billion and $2.43 billion, which is below market expectations [8] - For 2026, Lululemon expects sales of $11.35 billion to $11.5 billion, also falling short of the $11.52 billion estimate, with anticipated earnings between $12.10 and $12.30, below the $12.54 estimate [9] Group 3: Amazon and USPS - Amazon plans to significantly reduce the number of packages sent through the U.S. Postal Service, aiming to cut shipments by at least two-thirds by September [10] - USPS is facing financial challenges, with the Postmaster General indicating that the service may run out of funds within a year, suggesting potential delivery cuts or price increases as solutions [11]