DeepSeek
Search documents
欧盟拟推「高风险供应商」禁令,华为回应;DeepSeek新模型「MODEL1」曝光;某汽车品牌LOGO撞脸小米?网友:百分百在蹭小米丨雷峰早报
雷峰网· 2026-01-22 00:31
Key Points - The European Union plans to implement a ban on "high-risk suppliers," which Huawei has criticized as violating fair principles based on nationality [4][5] - DeepSeek has unveiled a new model called "Model 1," which is expected to be more efficient and suitable for edge devices [10][11] - Yu Minhong has launched a "retirement club" targeting individuals aged 50 to 75, offering low-cost experience classes [7][8] - iQIYI's CFO Wang Jun has resigned, with Senior Vice President Zeng Ying taking over as acting CFO [16][17] - Tesla's CEO Elon Musk has restarted the Dojo3 chip project, aiming for advancements in space AI [37][38] - Nvidia's CEO Jensen Huang expressed regret over selling Nvidia stock to buy a car for his parents, calling it the most expensive car in the world [39] - Vivo has maintained a leading position in the Indian smartphone market with a 23% share, significantly ahead of competitors [40][41]
What Bubble? Nvidia CEO Says AI Needs Trillions More in Investments
Yahoo Finance· 2026-01-21 22:57
Core Insights - The AI industry requires "trillions of dollars" in investment for infrastructure development to avoid failure, according to Nvidia's CEO Jensen Huang [1] - Huang describes AI as a "five-layer cake," emphasizing that each layer, from energy to applications, necessitates significant investment, with current commitments at around $1.5 trillion for 2025 alone [2] - Nvidia's market capitalization is now comparable to the total value of all mined silver, highlighting the financial impact of the AI boom [3] Investment and Market Dynamics - Huang's statements come amid market volatility, particularly after a Chinese startup's chatbot caused a 17% drop in Nvidia shares [4] - Despite substantial investments in generative AI, a study from MIT indicates that 95% of organizations are seeing no return on their investments, raising concerns about potential waste [5] - The financing structure within the AI sector has been criticized for creating a closed loop, where Nvidia's investment in OpenAI leads to increased demand for its chips [6] Competitive Landscape - Companies are taking measures to mitigate Nvidia's market dominance, with OpenAI signing a $10 billion deal with Cerebras for faster AI chip technology and partnerships with AMD and Broadcom [7] - Google is promoting its custom Tensor Processing Units (TPUs) as alternatives, with Anthropic agreeing to utilize up to one million TPUs, while Meta is also exploring Google's silicon for its data centers [8]
DeepSeek新模型曝光?“MODEL1”现身开源社区
Shang Hai Zheng Quan Bao· 2026-01-21 21:31
Core Insights - DeepSeek has updated its FlashMLA code on GitHub, revealing the previously undisclosed "MODEL1" identifier, which may indicate a new model distinct from the existing "V32" [3][4] - The company plans to launch an "open source week" in February 2025, gradually releasing five codebases, with Flash MLA being the first project [4] - Flash MLA optimizes memory access and computation processes on Hopper GPUs, significantly enhancing the efficiency of variable-length sequence processing, particularly for large language model inference tasks [4] Company Developments - DeepSeek's upcoming AI model, DeepSeek V4, is expected to be released around the Lunar New Year in February 2025, although the timeline may vary [4] - The V4 model is an iteration of the V3 model released in December 2024, boasting advanced programming capabilities that surpass current leading models like Anthropic's Claude and OpenAI's GPT series [5] - Since January 2026, DeepSeek has published two technical papers introducing a new training method called "optimized residual connections (mHC)" and a biologically inspired "AI memory module (Engram)" [5] Industry Context - The introduction of the Engram module aims to improve knowledge retrieval and general reasoning, addressing inefficiencies in the Transformer architecture [5] - The support from Liang Wenfeng's private equity firm, which has achieved a 56.55% average return in 2025, has bolstered DeepSeek's research and development efforts [5]
腾讯研究院AI速递 20260122
腾讯研究院· 2026-01-21 16:01
Group 1 - DeepSeek's Model 1 has been discovered in the FlashMLA codebase, potentially indicating an upcoming release, featuring a 512-dimensional architecture and support for NVIDIA's Blackwell architecture [1] - Liquid AI has launched the open-source inference model LFM2.5-1.2B-Thinking, which operates on a liquid neural network architecture and requires only 900MB of memory on mobile devices, achieving a score of 88 on MATH-500 [2] - The xAI engineer revealed that AI is being tested as a "colleague" in the MacroHard project, achieving human speeds eight times faster, and the company is considering utilizing idle computing power from approximately 4 million Tesla vehicles in North America [3] Group 2 - Research indicates that models like DeepSeek-R1 can spontaneously form multi-role debate mechanisms, significantly improving accuracy through internal social dialogue [4][5] - Medical SAM3, a new model developed by the University of Central Florida, allows for expert-level segmentation in medical imaging using only text prompts, achieving an average accuracy increase from 11.9% to 73.9% across 33 datasets [6] - Anthropic's CEO predicts that AI will fully take over software engineering roles within 6-12 months, with a significant portion of entry-level jobs expected to disappear in the next 1-5 years [7] Group 3 - The Sequoia xbench team reported that top agents can handle over 60% of 104 daily tasks, indicating that foundational agent capabilities have become commoditized [8] - OpenAI's CFO discussed the maturation of multi-agent systems by 2026, emphasizing that AI bubbles should be measured by API call volumes rather than stock prices, with productivity increases of 27-33% for cutting-edge companies [9]
计算机行业周报:千问App接入阿里生态业务
Guoxin Securities Co., Ltd· 2026-01-21 13:25
Investment Rating - The report gives a "Positive" rating for the computer industry, expecting the industry index to outperform the market index by over 5% in the next six months [33]. Core Insights - The computer industry index rose by 3.82% from January 12 to January 16, outperforming the CSI 300 index by 4.39 percentage points, making it the top-performing sector among other industries [2][11]. - Key stocks that performed well include Tongda Hai with a 39.73% increase, Haohan Deep with a 30.57% increase, and Jiechuang Intelligent with a 28.95% increase. Conversely, *ST Lifang saw a decline of 33.66%, followed by Aerospace Information at -14.46% and Haixia Innovation at -13.40% [14][15]. - Significant developments include the announcement of the integration of Qianwen App into Alibaba's ecosystem, enabling AI-driven services for tasks like ordering food and booking flights [3][31]. Market Performance - The computer industry has a total of 335 listed companies, with 234 companies seeing a rise, accounting for 69.85% of the sector [14]. - The report highlights the performance of individual stocks, with notable gains and losses during the specified period [15]. Recent Developments - Elon Musk announced the open-sourcing of the latest recommendation algorithm for X, promising updates every four weeks [3]. - Apple and Google have entered a partnership where Google's Gemini will support Apple's AI initiatives, with Apple expected to pay around $1 billion annually for this technology [18][19]. - Meta's CEO Mark Zuckerberg announced the Meta Compute initiative, aiming to build a GW-level AI infrastructure over the next decade [21][22]. - The U.S. has relaxed export controls on NVIDIA's H200 chips to China, which is expected to restart shipments to Chinese customers [24].
需求太火爆!智谱AI因算力告急“限购”:GLM编程计划每日仅售20%,老用户优先
Hua Er Jie Jian Wen· 2026-01-21 13:22
Core Viewpoint - The rapid increase in user demand for the newly released GLM-4.7 language model has led to significant computational bottlenecks for Zhipu AI, prompting the company to implement emergency throttling measures to prioritize existing users' experience [1][2]. Company Summary - Zhipu AI announced that starting January 23, it will drastically reduce the daily new subscription limit for its programming assistant service "GLM Coding Plan" to 20% of its previous levels, ensuring that existing users' access is prioritized [1][2]. - The company has experienced frequent throttling errors and significant response time delays during peak hours due to the surge in user numbers, which it attributes to a phase of resource strain caused by rapid growth [1][2]. - The GLM Coding Plan is positioned as a competitor to Claude, and the company is in direct competition with leading firms like OpenAI and Anthropic [1][2]. Industry Summary - The implementation of throttling measures in response to user surges has become a common phenomenon in the rapidly growing AI industry, as seen previously with DeepSeek, which also limited API access due to server resource constraints [3]. - This "throttling" action highlights the temporary mismatch between the explosive growth in AI application demand and the pace of foundational computational infrastructure development [3]. - The computational bottleneck reflects strong end-user demand while revealing the operational challenges AI companies face in transitioning from technological breakthroughs to stable service delivery [3].
AI进化速递 | Meta 新AI团队已交付首批人工智能模型
Di Yi Cai Jing· 2026-01-21 12:49
Core Insights - The article highlights the launch of Shanghai Zhangjiang's first automated production line for robot joints, which accelerates the mass production of humanoid robots [1] Group 1: Industry Developments - The Ministry of Industry and Information Technology reports that AI has penetrated over 70% of business scenarios in leading smart factories [1] - The automated production line in Shanghai Zhangjiang aims to speed up the mass production of humanoid robots [1] - Beijing Renxing and Xiaowu Intelligence have formed a strategic partnership to promote the industrialization of embodied intelligence [1] Group 2: Technological Advancements - DeepSeek has unveiled its new model "MODEL1" [1] - The monthly active users of Keling AI have surpassed 12 million, with daily revenue increasing by approximately 30% compared to December of the previous year [1] - Meta's CTO announced that the new AI team has delivered its first batch of artificial intelligence models [1] Group 3: Investments and Collaborations - OpenAI has launched an educational program aimed at various countries [1] - NVIDIA has invested $150 million in the AI inference startup Baseten [1] - ServiceNow has entered into a three-year collaboration with OpenAI [1]
朱宁:2026中国经济增速放缓,但体量更大、更全球化
Di Yi Cai Jing· 2026-01-21 11:12
中国经济体量的持续增长,意味着即使增速更为温和,其在全球几乎所有经济领域中的份额也将越来越大。 外界普遍认为,2026年中国经济增长目标极有可能再次设定在"5%左右",相较于2025年,今年的经济增速或将进一步放缓。其中一个关键影响因素是出口 ——2025年中国经济曾受益于超预期的出口需求激增,而这一有利背景在未来12个月内恐难重现。 这里我需要澄清一个常见的误解,中国GDP增长目标放缓,并非因为决策者忽视经济增长,而是因为自主创新、国家安全以及可持续的高质量发展,已经取 代"GDP至上"的理念,成为首要政策目标。 人们需要意识到,即便是约5%的增速(大致2008年中国经济增长率的一半),2026年中国的整体经济规模相比当年已扩大4倍。 我们可以通过一个例子更直观地理解这个变化:2008年中国经济的体量跟德国相近,彼时德国是全球第四大经济体。而如今中国的GDP规模大约是当前德国 经济规模的4倍。即使增速放缓,中国对全球经济增长的贡献,依然约等于第二大和第三大贡献国——印度和美国的总和。 2024年全球名义GDP占比 步入2026年,中国经济释放出复杂多元的信号:一方面,DeepSeek、智谱AI等国内人工智能 ...
选择开源,杭州正在下注AI时代“最贵的投资”
2 1 Shi Ji Jing Ji Bao Dao· 2026-01-21 10:37
Core Viewpoint - Hangzhou is positioning itself as a leader in AI innovation by focusing on an open-source ecosystem, integrating private enterprises and tech companies to drive technological advancement and industrial upgrades [1][2]. Economic Goals - The expected economic growth target for Hangzhou in 2026 is set at 5% to 5.5%, with a focus on controlling the urban unemployment rate around 5% and maintaining a consumer price increase of approximately 2% [2]. - The city aims to achieve a retail sales growth of about 5%, surpassing 1 trillion yuan by 2026 [4]. AI Development Strategy - Hangzhou plans to establish itself as the leading city for AI innovation, with initiatives to create a hub for open-source large models and to support the development of high-end chips and foundational software [2][8]. - The city aims to cultivate over three internationally top-tier open-source foundational models by 2030 [2]. Education and Talent Development - To address its previous shortcomings in educational resources, Hangzhou will enhance the integration of education and technology, supporting the development of world-class research universities [3]. Industry Focus - The city will prioritize the development of the AI terminal industry, targeting a scale of 300 billion yuan by 2027, and aims to create 10 AI-themed industrial parks and attract over 1,000 open-source model ecosystem enterprises [5][6]. - By 2026, the city plans to increase the value added by core digital economy industries by 6.5% and raise R&D investment intensity to around 4.1%, with total R&D spending reaching 263.5 billion yuan [4][6]. Future Industry Development - Hangzhou will focus on emerging industries such as synthetic biology, aerospace, and quantum technology, establishing future industry pilot zones [7]. - The city aims to create a collaborative development framework across the entire AI industry chain, emphasizing key areas like embodied intelligence and intelligent driving [8]. Open-source Initiative - The open-source approach is seen as a critical strategy for overcoming barriers in AI development, allowing for broader application and integration of AI technologies across various sectors [9]. - The city plans to implement the "Hangzhou AI+" initiative, which includes opening 200 benchmark scenarios in AI and establishing national pilot bases in key fields [10].
DeepSeek新模型“MODEL1”曝光
Di Yi Cai Jing Zi Xun· 2026-01-21 09:05
Core Insights - The article discusses the emergence of a new model named "MODEL1" from DeepSeek, coinciding with the one-year anniversary of the DeepSeek-R1 release, indicating potential advancements in AI model architecture [2][6]. Group 1: Model Development - "MODEL1" has been referenced in the updated FlashMLA code on GitHub, suggesting it may represent a new model distinct from the existing "V32" architecture [2][3]. - There are differing opinions in the industry regarding whether "MODEL1" is a version 4 model or an advanced inference model, with some developers speculating it could be the ultimate version of the V3 series [2][5]. - Key technical differences between "MODEL1" and "V32" include variations in key-value (KV) cache layout, sparsity handling, and support for FP8 data format decoding, indicating targeted design for memory optimization and computational efficiency [5]. Group 2: Anticipated Release and Features - The structure of the model files suggests that "MODEL1" is nearing completion or inference deployment, awaiting final weight freezing and testing validation, which implies a forthcoming launch [5]. - There are expectations for DeepSeek to release its next flagship model, DeepSeek V4, in February, with preliminary tests indicating it may surpass other top models in programming capabilities [6]. - Recent technical papers from DeepSeek introduce new training methods and an AI memory module, hinting that these innovations may be integrated into the upcoming model [6]. Group 3: Industry Impact - The DeepSeek-R1 model has been recognized as the most praised model on Hugging Face, significantly lowering barriers in inference technology and production deployment, thus influencing the open-source strategy of major Chinese companies [9]. - Over the past year, Chinese AI models have seen increased downloads on Hugging Face, surpassing those from the U.S., indicating a shift in reliance on Chinese-developed open-source models within the global supply chain [9].