DeepSeek API

Search documents
DeepSeek 重磅发布!
Zheng Quan Shi Bao· 2025-08-21 15:05
Core Insights - DeepSeek officially released DeepSeek-V3.1 on August 21, 2023 [1] - The upgrade includes significant enhancements in search evaluation metrics, particularly in complex search tests and multidisciplinary expert-level challenges [3][4] Upgrade Features - The official app and web model have been upgraded to DeepSeek-V3.1, allowing users to switch between thinking and non-thinking modes via a "Deep Thinking" button [2][3] - DeepSeek API has also been upgraded, with deepseek-chat corresponding to non-thinking mode and deepseek-reasoner corresponding to thinking mode, expanding context to 128K [3] - The API Beta interface supports strict mode Function Calling to ensure outputs meet schema definitions [3] Performance Improvements - DeepSeek-V3.1 shows significant performance improvements over R1-0528 in complex search tests (browsecomp) and multidisciplinary expert-level problem tests (HLE) [3][4] - The model utilizes UE8M0 FP8 Scale parameter precision, with substantial adjustments made to the tokenizer and chat template, resulting in noticeable differences from DeepSeek-V3 [3] Market Reaction - DeepSeek concept stocks experienced a sharp rise in the market towards the end of the trading day [3]
DeepSeek,重磅发布!
Zheng Quan Shi Bao Wang· 2025-08-21 10:35
Core Insights - DeepSeek officially launched DeepSeek-V3.1 on August 21, featuring significant upgrades in its architecture and performance [1]. Group 1: Major Changes in DeepSeek-V3.1 - The new hybrid reasoning architecture allows the model to support both thinking and non-thinking modes simultaneously [2]. - Enhanced thinking efficiency enables DeepSeek-V3.1-Think to provide answers in a shorter time compared to DeepSeek-R1-0528 [2]. - Improved agent capabilities through post-training optimization have led to better performance in tool usage and agent tasks [2]. Group 2: API and User Experience Enhancements - The official app and web model have been upgraded to DeepSeek-V3.1, allowing users to switch freely between thinking and non-thinking modes via a "deep thinking" button [2]. - The DeepSeek API has also been upgraded, with deepseek-chat corresponding to non-thinking mode and deepseek-reasoner to thinking mode, expanding context to 128K [2]. - The API Beta interface now supports strict mode function calling to ensure outputs meet schema definitions [2]. Group 3: Performance Metrics - DeepSeek-V3.1 has shown significant improvements in multiple search evaluation metrics, outperforming R1-0528 in complex search tests requiring multi-step reasoning and expert-level multidisciplinary challenges [2]. Group 4: Technical Adjustments - DeepSeek-V3.1 utilizes UE8M0FP8Scale parameter precision and has made substantial adjustments to the tokenizer and chat template, resulting in noticeable differences from DeepSeek-V3 [3]. Group 5: Market Reaction - Following the announcement of DeepSeek-V3.1, DeepSeek concept stock Daily Interaction (300766) experienced a sharp rise in the late trading session [3].
DeepSeek-V3.1发布:更高效思考、更强Agent能力、更长上下文
生物世界· 2025-08-21 08:00
Core Insights - DeepSeek has officially released DeepSeek-V3.1, introducing a hybrid reasoning architecture that allows users to switch between "Deep Thinking" mode and "Non-Thinking" mode for enhanced interaction [2][3]. Group 1: Hybrid Reasoning Architecture - The "Deep Thinking" mode (DeepSeek-Reasoner) is designed for tasks requiring deep reasoning, such as mathematical calculations and complex logic analysis, providing higher reasoning efficiency [3]. - The "Non-Thinking" mode (DeepSeek-Chat) is tailored for everyday conversations and information queries, offering quicker responses [4]. - Users can easily switch modes via a "Deep Thinking" button on the official app and web interface, enhancing the user experience [5]. Group 2: Enhanced Agent Capabilities - DeepSeek-V3.1 has significantly improved tool usage and agent task performance through Post-Training optimization, resulting in fewer required iterations and higher efficiency in code repair and command line tasks [6]. - Benchmark results show that DeepSeek-V3.1 outperforms its predecessor, DeepSeek-R1-0528, in various tasks, including SWE-bench and Terminal-Bench, with scores of 66.0 and 31.3 respectively [7][8]. Group 3: Efficiency Improvements - The new version employs a thought chain compression training method, reducing output tokens by 20%-50% while maintaining performance levels comparable to DeepSeek-R1-0528, leading to faster response times and lower API call costs [9]. Group 4: API Upgrades and Model Availability - The DeepSeek API has been upgraded to support a context length of 128K, facilitating easier handling of long documents [10][12]. - The base and post-training models of DeepSeek-V3.1 are now open-sourced on platforms like Hugging Face and ModelScope, with a price adjustment for the API set to take effect on September 6, 2025 [11].
DeepSeek-V3.1正式发布,上下文均扩展为128K
Di Yi Cai Jing· 2025-08-21 07:19
Core Insights - DeepSeek has officially released the upgraded model DeepSeek-V3.1, which features a hybrid reasoning architecture that supports both thinking and non-thinking modes [1] - The new model, DeepSeek-V3.1-Think, demonstrates improved thinking efficiency, providing answers in a shorter time compared to its predecessor DeepSeek-R1-0528 [1] - Enhanced agent capabilities have been achieved through post-training optimization, significantly improving performance in tool usage and agent tasks [1] Pricing Adjustments - Starting from September 6, 2025, the pricing for API calls on the DeepSeek open platform will be adjusted according to a new price list, with the cancellation of night-time discounts [2] - Until September 6, 2025, all API services will continue to be billed under the original pricing policy [4]
潞晨科技官宣停用DeepSeek背后:创始人受指责,投资人很无奈
创业邦· 2025-03-04 03:02
Core Viewpoint - The article discusses the recent decision by Lu Chen Technology to suspend its DeepSeek API service, primarily due to cost considerations, despite DeepSeek's high theoretical profit margin of 545% [1][2][3]. Group 1: Cost Considerations - Lu Chen Technology's decision to halt DeepSeek API access is largely attributed to the high costs associated with providing stable service, which smaller MaaS providers struggle to manage compared to larger cloud companies [6][9]. - The theoretical profit margin of 545% reported by DeepSeek is based on a scenario where user demand is maximized, which is not typical for standard MaaS products that require significantly more resources to maintain stable output [3][4]. - The cost of providing DeepSeek services is exacerbated by the need for redundant computing resources to handle unpredictable user demand, leading to higher operational costs for smaller providers [9][10]. Group 2: Industry Impact - The suspension of DeepSeek API services by Lu Chen Technology reflects broader challenges faced by smaller MaaS companies in the wake of DeepSeek's competitive pricing and open-source initiatives, which threaten their business models [10]. - As DeepSeek continues to open-source its technology, many third-party MaaS providers are finding it increasingly difficult to maintain a competitive edge, leading to a potential disruption in the industry [10]. - The article highlights that numerous companies across various sectors, including technology, finance, and government, have integrated DeepSeek, indicating its widespread influence and the potential risks for smaller players in the market [8].
突发!潞晨科技宣布将暂停DeepSeek API服务,时间在一周后
证券时报· 2025-03-01 23:43
Core Viewpoint - Lu Chen Technology announced the discontinuation of DeepSeek API services, citing high operational costs and potential losses for users [2][5]. Group 1: Company Overview - Lu Chen Technology focuses on "liberating AI productivity" and has a team from prestigious institutions like UC Berkeley and Stanford [6][7]. - The company offers distributed software systems, large-scale AI platforms, and enterprise-level cloud computing solutions [7]. Group 2: Financial Performance - Lu Chen Technology has shown strong growth, with over 100,000 users and 2,476 paying customers, including four Fortune 500 companies [7]. - The company expects revenue to reach 77 million RMB in 2024, 150 million RMB in 2025, and 300 million RMB in 2026 [7]. Group 3: DeepSeek API and Cost Analysis - The CEO of Lu Chen Technology indicated that the full version of DeepSeek-R1 could lead to significant losses for users, estimating a monthly loss of 400 million RMB if output reaches 100 billion tokens [2]. - DeepSeek's theoretical cost and profit margin were disclosed, showing a potential profit margin of 545% based on a daily revenue of $562,027 against a cost of $87,072 [6].
DeepSeek宣布:活动正式收官
21世纪经济报道· 2025-02-28 08:46
Core Insights - DeepSeek's "Open Source Week" has successfully concluded, showcasing its commitment to transparency and collaboration in the AI field [1][7]. Group 1: Open Source Projects - The "Open Source Week" launched five projects from February 24 to February 28, covering various aspects of computing, communication, and storage [3]. - On February 24, the first open-source library, FlashMLA, was released, optimized for Hopper GPU, focusing on variable-length sequences and is now in production [4]. - On February 25, DeepEP was announced for public access, designed for MoE model training and inference, enabling efficient all-to-all communication and supporting low-precision operations [4]. - On February 26, DeepGEMM was open-sourced, a library for FP8 general matrix multiplication, featuring fine-grained scaling and supporting both standard and MoE group GEMM [5]. - On February 27, two tools (DualPipe and EPLB) and a performance analysis dataset were released, along with detailed explanations of parallel computing optimization techniques [5]. - On February 28, the release of 3FS was announced, which serves as an accelerator for all DeepSeek data access [6]. Group 2: API and Pricing Adjustments - DeepSeek reopened its API recharge function on February 25 after a 19-day suspension, accompanied by a structural adjustment in pricing [9]. - The pricing for the DeepSeek-chat based on the V3 model is set at 2 yuan per million input tokens and 8 yuan per million output tokens, while the DeepSeek-reasoner based on the R1 model is priced at 4 yuan per million input tokens and 16 yuan per million output tokens [9]. - On February 26, a peak-shifting discount pricing strategy was introduced, with significant reductions during specific hours, offering V3 at 50% off and R1 at 25% off [10]. Group 3: Market Impact - According to CITIC Securities, DeepSeek's open-source initiatives are expected to catalyze the AI+ theme, enhancing AI penetration across various industries and increasing demand for computing power [7].
速递|大模型价格战再升级,DeepSeek降价最高达75%
Z Finance· 2025-02-27 11:36
Group 1 - DeepSeek announced a significant reduction in API calling prices during off-peak hours, with R1 and V3 model APIs seeing price cuts of 75% and 50% respectively [1] - The off-peak hours defined by DeepSeek cover the daytime in Europe and the US, indicating a strategic pricing approach to attract developers [1] - This pricing strategy follows a trend initiated last year, which sparked a price war in the AI model market, particularly after the release of DeepSeek's V2 model [1] Group 2 - The recent price cuts by DeepSeek have caused significant reactions in both domestic and international AI industries, highlighting the competitive landscape [1] - Following the launch of its AI assistant, DeepSeek's pricing strategy has prompted responses from competitors like OpenAI and Google, who have also adjusted their pricing [1]
特斯拉市值跌破1万亿美元!百度斥资21亿美元收购YY直播业务!微信测试版支持电脑上收红包!DeepSeek重新开放API充值!
新浪财经· 2025-02-26 00:47
Group 1 - Tesla's stock price dropped over 8%, resulting in a market value loss of approximately $89.2 billion, bringing its total market value below $1 trillion [2][3][4] - Major technology stocks experienced declines, with Nvidia and Google down over 2%, and Microsoft and Meta down over 1% [4] - Chinese concept stocks mostly rose, with the Nasdaq China Golden Dragon Index increasing by 0.58%, driven by significant gains in Li Auto and XPeng [4] Group 2 - Baidu announced a $2.1 billion acquisition of YY Live from JOYY, with plans to invest the returned funds into cloud and AI infrastructure [7][8] - WeChat's Windows 4.0.2 test version now supports receiving red envelopes on PC, although sending red envelopes from PC is not yet available [9][11] - DeepSeek reopened its API recharge service, with updated pricing for token inputs and outputs, following a previous halt due to server resource constraints [12][14]