Workflow
DeepSeek
icon
Search documents
DeepSeek正式发布新模型,还透露国产AI芯片关键信息
Xuan Gu Bao· 2025-08-21 23:22
Group 1 - DeepSeek's latest V3.1 version utilizes UE8M0 FP8 Scale parameter precision, designed for the upcoming domestic chip release [1] - FP8 is a cutting-edge low-precision format for AI computing, significantly enhancing GPU performance and reducing memory usage for large language model training [1] - Domestic GPUs are rapidly developing, transitioning from "usable" to "user-friendly" stages, although they have not yet matched international products [1] Group 2 - Companies like Cambrian, Haiguang Information, and Huawei are leading the A-share computing chip market [3] - Moer Thread provides AI training and inference cards, with its latest GPU supporting FP8 precision, significantly boosting AI computing power [1] - Muxi offers C series GPUs for integrated training and inference, and N series GPUs focused on cloud AI inference, showcasing strong mixed-precision computing capabilities [2] Group 3 - The global GPU market is projected to reach 36,119.74 billion yuan by 2029, with China's GPU market expected to reach 13,635.78 billion yuan, increasing its global market share from 30.8% in 2024 to 37.8% in 2029 [2] - DeepSeek is driving the shift of AI applications from centralized cloud services to mass terminals, necessitating high-cost performance dedicated chips [2] - The domestic chip manufacturers and application enterprises are accelerating their integration with DeepSeek, anticipating a significant increase in domestic computing power by 2025 [2] Group 4 - Huawei's Ascend ecosystem includes companies like Tuo Wei Information, Digital China, and Huafeng Technology, enhancing mixed inference architecture and agent capabilities [4] - The upgraded model shows significant improvements in tool usage and intelligent agent tasks through Post-Training optimization [4] Group 5 - Related companies include Dingjie Zhizhi, Fanwei Network, and Kute Intelligent [5]
滴滴等多家网约车平台降低抽成;货车司机月均净收入10512元,超过网络主播;7月全社会用电量超1万亿度丨每经早参
Mei Ri Jing Ji Xin Wen· 2025-08-21 23:14
Group 1 - Didi Chuxing, Cao Cao Mobility, and T3 Mobility have announced a reduction in commission rates, with Didi and T3 lowering their maximum commission to 27% and Cao Cao reducing it to 22.5% [19] - Gaode Dache has also stated that it will work with partners to ensure that the commission rate for at least 80 ride-hailing platforms does not exceed 27% [19] - The average net income of truck drivers is reported to be 10,512 yuan per month, making it the highest among six new employment groups [10][11] Group 2 - The total electricity consumption in China reached 1.02 trillion kilowatt-hours in July, marking a historic first and a year-on-year increase of 8.6% [11] - This figure is equivalent to the total annual electricity consumption of ASEAN countries and has doubled compared to ten years ago [11] - The increase in electricity consumption is attributed to high temperatures and stable industrial production [11] Group 3 - The Hong Kong Stock Exchange is considering extending trading hours but emphasizes the need for caution due to the potential impact on the market [6] - The National Development and Reform Commission plans to conduct central frozen pork reserve storage to stabilize the pork market amid price fluctuations [7] - The State Council has approved a plan for the Jiangsu Free Trade Zone to promote the open innovation development of the biopharmaceutical industry [8] Group 4 - The National Foreign Exchange Administration is launching a pilot program for green foreign debt in 16 provinces and cities to support green finance development [9] - The CEO of Zhiyuan Robotics expects to complete a Series C financing round by the end of the year, aiming to attract more international investment [20] - Xiaopeng Motors' CEO He Xiaopeng has purchased 3.1 million shares at an average price of 80.49 HKD, signaling confidence in the company's future [21] Group 5 - Sinopec plans to repurchase A-shares worth between 5 billion and 10 billion yuan, which is expected to enhance earnings per share and boost stock prices [22] - WanTai Biologics has received approval for its nine-valent HPV vaccine, marking a new revenue growth point for the company [25] - NIO has launched the new ES8 model with a starting price of 416,800 yuan, indicating a strong market presence in the electric SUV segment [26]
2025《财富》中国科技50强:杭州“科技小龙”崭露头角,DeepSeek等引领创新
Sou Hu Cai Jing· 2025-08-21 21:27
Group 1 - The "2025 Fortune China Tech 50" list highlights Huawei, DeepSeek, and CATL as the top three companies, recognizing those that are "born in China and influence the world" [1] - Huawei continues to be a dominant player in the tech sector, while DeepSeek's rise is a notable highlight, achieving a score of 88.5 in the MMLU benchmark test with its DeepSeek-R1 model [1][3] - DeepSeek has surpassed 163 million monthly active users, positioning itself as a leader in the global AI-generated content application market [1] Group 2 - Emerging companies like Unitree Technology and Yunsen Technology are gaining recognition alongside traditional tech giants, with Unitree achieving a global sales volume of 18,000 units and a market share of 23% in quadruped robots [3] - Yunsen Technology focuses on humanoid and quadruped robots, with its "Mountain Cat" all-terrain robot being widely applied across various sectors [3] - The list's changes reflect the vigorous development of China's tech industry and indicate an intensifying future competition in the tech sector [5]
There is demand for Nvidia chips in China regardless of what Beijing says: Bernstein’s Stacy Rasgon
CNBC Television· 2025-08-21 18:50
Market Dynamics & Geopolitics - 中国公司被要求减少或停止购买英伟达H20芯片,转而依赖华为等本土选项[2][3] - 尽管中国有本土替代品,但由于英伟达在生态系统和软件兼容性方面的优势,中国市场对英伟达芯片仍有需求[14][15] - 深势科技(Deepseek)曾因华为芯片难以使用而延迟R2项目,表明其更倾向于使用英伟达产品[16] - 中国刺激本地芯片需求的举措是必然趋势,与“被冒犯”无关[13] Nvidia's Financials & Revenue Impact - 目前英伟达的财报数字中,来自中国的收入为零[6][10] - 上一季度,中国市场贡献了46亿美元的收入,约占总收入的近13%[7] - 市场对H20禁令解除后,中国市场恢复出货对英伟达总收入的影响存在分歧[8][9] - 即使可以自由销售,恢复供应链也需要时间,预计本季度不会有明显收入,下季度影响也很小,可能要到年底才能在数据上有所体现[10][11] Competitive Landscape - 华为的昇腾芯片在计算能力上不如英伟达,但价格更低,更本土化[3][4] - 中国AI初创公司DeepSeek同时使用华为和英伟达的芯片,表明本土芯片有一定竞争力[4] - 英伟达在性能方面有所限制,但其生态系统和软件兼容性具有优势[14]
腾讯研究院AI速递 20250822
腾讯研究院· 2025-08-21 16:01
Group 1 - Google launched the Pixel 10 series with four models, featuring the Tensor G5 chip and Gemini Nano model, emphasizing deep AI integration as a hallmark characteristic [1] - The new models include various AI functionalities such as Gemini Live voice assistant, Voice Translate for real-time speech translation, Nano Banana photo editor, and Camera Coach for photography guidance [1] - Pro Res Zoom supports up to 100x smart zoom, and Magic Cue intelligently extracts content from Gmail and calendar, marking the end of the traditional smartphone era according to Google [1] Group 2 - DeepSeek officially released the V3.1 model, utilizing a hybrid reasoning architecture that significantly enhances both thinking efficiency and agent capabilities [2] - The new model shows notable improvements in programming agent assessments and search agent evaluations, while reducing output tokens by 20%-50% without compromising performance [2] - The model is fully open-source, employing UE8M0 FP8 Scale parameter precision, with API upgrades supporting Anthropic API format and extending context to 128K [2] Group 3 - ByteDance's Seed team open-sourced three models: Seed-OSS-36B-Base (with and without synthetic data) and Seed-OSS-36B-Instruct [3] - The models were trained on 12 trillion tokens and are licensed under Apache-2.0, supporting a 512K ultra-long context window and flexible reasoning budget control [3] - The Instruct version achieved new state-of-the-art records in various open-source benchmark tests, particularly in MMLU-Pro, MATH, and AIME24 [3] Group 4 - The University of Hong Kong and Kuaishou's Keling team introduced Context as Memory technology, achieving long-term scene memory retention in video generation, comparable to Google's Genie 3 and released earlier [4] - This innovative technology uses historical generated context as "memory" and designs a memory retrieval mechanism based on camera trajectory, significantly enhancing computational efficiency [4] - Research indicates that video generation models can implicitly learn 3D priors without explicit 3D modeling, maintaining static scene memory within seconds [4] Group 5 - Baidu released the MuseSteamer video model 2.0, utilizing integrated Chinese audio-video generation technology to address the unnatural dialogue issue in AI video generation [5] - The new model offers four versions (turbo, pro, lite, and voiced), accurately matching Chinese lip movements, supporting emotional expression and dialects, and enabling static photos to speak [6] - This technology synchronizes sound and visuals during conception, eliminating the need for post-production matching, and employs a "multi-modal latent space planner" to significantly reduce video production costs and complexity [6] Group 6 - Tencent's Yuanbao integrated Tencent Video functionality, allowing users to view videos directly from search results during conversations with Yuanbao [7] - Users can search for films by title, receive personalized recommendations based on scene descriptions, and retrieve films they can't remember by vague memories [7] - In addition to searching and recommending, Yuanbao can engage users in discussions about film creation backgrounds, plot meanings, and genre styles, with direct links to watch related works [7] Group 7 - Boston Dynamics showcased a new video of the Atlas humanoid robot, demonstrating evolution based on the latest large behavior models (LBMs) for precise control in multi-tasking and language-driven operations [8] - The system consists of four components: collecting embodied behavior data through remote control, processing labeled data, training a unified neural network policy model, and evaluating the policy model through testing tasks [8] - The Atlas robot can now smoothly perform "repair station" tasks, including complex movement operations, dexterous grasping, and secondary gripping, intelligently responding to unexpected situations, advancing general AI robotics [8] Group 8 - OpenAI researchers stated that GPT-5's behavior design intentionally addresses "flattery issues," aiming to balance interactivity with healthy assistant attributes, with significant improvements in creative writing and programming capabilities [9] - As evaluation benchmarks become saturated, the future differentiation of models will primarily depend on actual use cases, with the team designing internal assessments based on real-world needs [9] - OpenAI's agent development strategy has evolved from ChatGPT to Deep Research and more complete functional agents, aiming to build systems capable of asynchronous task execution and maintaining cross-platform memory over time [9] Group 9 - Index Ventures' investment director emphasized that founder traits are more important than market size, as exceptional founders can expand small markets, as demonstrated by Adyen and Figma [10] - There are notable differences between American and European founders: American founders tend to have more global ambitions and fundraising capabilities, while European founders are more pragmatic but often limited by market fragmentation and insufficient capital [10] - For Europe to produce global AI giants, three core issues must be addressed: increasing capital density, accelerating market integration, and improving talent systems to retain top researchers and entrepreneurs [10]
广告收入缩水!百度动刀最大钱袋,核心搜索业务面临 AI 转型阵痛
Hua Xia Shi Bao· 2025-08-21 15:04
Core Viewpoint - Baidu's Q2 2025 financial report shows a decline in core revenue and online marketing income, highlighting the challenges faced by the company in adapting to the AI era while its traditional revenue sources diminish [1][2][4]. Group 1: Financial Performance - Baidu's core revenue for Q2 2025, excluding iQIYI, was 26.3 billion yuan, a year-on-year decrease of 2% [1]. - The net profit attributable to Baidu's core business was 7.4 billion yuan, representing a year-on-year increase of 35% [1]. - Online marketing revenue for the same period was 16.2 billion yuan, down 15% year-on-year, marking a continuous decline over five consecutive quarters [2][4]. Group 2: Business Transformation - Baidu is undergoing significant changes in its search business, with a major redesign of its search interface to incorporate AI-generated content, which now constitutes 64% of mobile search results [2]. - The company has shifted its marketing strategy by eliminating exclusive agency agreements in several cities, moving towards a service provider model to enhance competition and efficiency [4][5]. Group 3: AI Development and Competition - Baidu's non-online marketing revenue exceeded 10 billion yuan in Q2 2025, driven by its AI and cloud services, with a 34% year-on-year growth [1][6]. - The company is actively developing its next-generation Ernie model and has shifted from a closed-source to an open-source approach, aiming to capture market opportunities in the AI sector [6][7]. - Despite advancements in AI, Baidu faces increasing competition from other tech giants and emerging players, impacting its market position and user engagement [6][7].
DeepSeek官宣!新模型、新突破、新价格
Core Insights - DeepSeek officially released DeepSeek-V3.1, a large model featuring a hybrid reasoning architecture that supports both thinking and non-thinking modes, resulting in higher efficiency compared to its predecessor DeepSeek-R1-0528 [1] - The new model shows significant improvements in tool usage and agent tasks, with notable advancements in code repair assessments and complex task testing in command-line environments [1] - The release is seen as a step towards the "Agent era" in AI, with market predictions estimating the Chinese AI agent market to reach 6.9 billion yuan by 2025 and nearly 30 billion yuan by 2030 [1] Performance Enhancements - Testing results indicate that DeepSeek-V3.1's efficiency in thinking mode has improved significantly, achieving similar average performance to R1-0528 while reducing output token count by 20%-50% [2] - In non-thinking mode, the output length has been effectively controlled, which helps users manage costs [2] API Pricing Adjustments - Starting from September 6, 2023, DeepSeek will adjust its API pricing, removing the previous night-time discount. The new pricing will be 0.5 yuan per million input tokens (cache hit) / 4 yuan (cache miss), and 12 yuan per million output tokens [2] - The previous API pricing was 0.5 yuan per million input tokens (cache hit) / 2 yuan (cache miss), and 8 yuan per million output tokens [2] Technical Specifications - DeepSeek-V3.1 utilizes UE8M0 FP8 Scale parameter precision, which is designed for the upcoming generation of domestic chips [2] - Recent tests by the China Academy of Information and Communications Technology indicate that products deploying the DeepSeek model have achieved accuracy in language understanding and logical reasoning tasks comparable to foreign systems [3]
华为、DeepSeek、宇树科技,最强中国科技榜单来了!
Group 1: Core Companies and Industries - Notable companies such as Huawei, DeepSeek, CATL, Zhongzhong Group, Alibaba, Tencent, and BYD are recognized in the 2025 Fortune China Technology 50 list, with a focus on popular sectors like artificial intelligence, robotics, biomedicine, and green energy [3][9][10] - The robotics sector is highlighted with companies like Yushu Technology, Yundong Technology, and Luoshi Robotics, which are innovating in areas such as motion control, high-performance joint motors, and autonomous inspection robots [5][6] Group 2: Artificial Intelligence Developments - DeepSeek has gained significant attention, ranking in the top 10 for global open-source model downloads, with 163 million monthly active users as of June 2025, leading the AI-generated content application market [7] - ByteDance is also recognized for its substantial investment in AI, with a projected capital expenditure of 80 billion yuan in 2024, surpassing the combined total of Baidu, Alibaba, and Tencent [7] - Drip Technology, a provider of enterprise-level AI application solutions, ranks first in the Chinese market for its general enterprise operational decision-making model [8] Group 3: Biomedicine and Green Energy Innovations - The biopharmaceutical sector features companies like CSPC Pharmaceutical Group, which is developing over 200 innovative drug projects across various therapeutic areas, with over 50 new drugs expected to be submitted for approval by the end of 2028 [11] - Kangfang Biotech is noted for its comprehensive drug development system, with over 50 innovative drug candidates, 24 of which are in clinical stages, making it one of China's leading antibody drug developers [11] - Trina Solar has initiated a 49.9MW solar-storage integration project in the UK, which will supply power to over 16,500 households and reduce carbon emissions by nearly 15,000 tons annually [11] Group 4: Overall Industry Trends - The companies listed in the Fortune China Technology 50 are focusing on practical applications of large models in vertical sectors like finance and healthcare, optimizing efficiency while advancing robotics for high-risk tasks [12] - The emphasis is also on developing cleaner and more efficient energy solutions, promoting a harmonious relationship between humanity and nature [12]
DeepSeek-V3.1发布:更高思考效率、更强智能体能力
(原标题:DeepSeek-V3.1发布:更高思考效率、更强智能体能力) 21世纪经济报道记者 陈归辞 在DeepSeek-V3推出5个月后,DeepSeek-V3低调发布升级版模型DeepSeek-V3.1。 8月21日下午,DeepSeek 正式发布 DeepSeek-V3.1(简称"V3.1"),称其为"迈向 Agent 时代的第一 步"。8月19日晚间,DeepSeek 小助手于官方群内宣布线上模型版本已升级至V3.1,引发广泛关注,目 前 V3.1 在HuggingFace趋势榜排名已冲上第二。 据DeepSeek方面介绍,V3.1的升级主要包含三大变化:混合思考模式、更高的思考效率和更强的Agent (智能体)能力。 编程任务方面,DeepSeek测试结果显示,在代码修复测评 SWE 与命令行终端环境下的复杂任务 (Terminal-Bench)测试中,DeepSeek-V3.1 相比之前的 DeepSeek 系列模型有明显提高。 从业内实测反馈来看,V3.1在AiderPolyglot多语言编程测试中,拿下了71.6%的高分,超越了Claude 4 Opus和DeepSeek R1等模型。并且, ...
迈向智能体时代“第一步” DeepSeek-V3.1 发布
Xin Jing Bao· 2025-08-21 14:09
Core Viewpoint - DeepSeek officially released DeepSeek-V3.1, marking a significant step towards the "Agent era" with enhanced capabilities in reasoning and task performance [1] Group 1: Product Upgrade - The upgrade includes a mixed reasoning architecture that supports both thinking and non-thinking modes in a single model [1] - DeepSeek-V3.1-Think can provide answers in a shorter time compared to its predecessor, DeepSeek-R1-0528 [1] - The new model shows significant improvements in tool usage and intelligent agent tasks through Post-Training optimization, resulting in stronger agent capabilities [1] Group 2: User Experience - The official app and web model have been synchronized to DeepSeek-V3.1, allowing users to switch freely between thinking and non-thinking modes via a "deep thinking" button [1]