Workflow
DeepSeek
icon
Search documents
DeepSeek R1模型完成“小版本试升级”,编程、逻辑理解上了一个层次!
华尔街见闻· 2025-05-29 00:57
Core Viewpoint - DeepSeek has released an updated version of its R1 model, enhancing its capabilities in semantic understanding, complex logical reasoning, and long text processing stability, amidst escalating competition in the AI sector [1][2]. Group 1: Model Enhancements - The R1 model has significantly improved its understanding capabilities, with user feedback indicating a notable increase in performance, particularly in activating parameters and presenting key information logically [3]. - Programming capabilities have also seen a substantial upgrade, with users reporting the ability to generate over 1000 lines of code without bugs [4]. - The R1 model is now considered competitive with Claude 4, a leading programming model [5]. Group 2: Previous Model Performance - Earlier this year, DeepSeek released the DeepSeek-V3-0324 model, which outperformed Claude-3.7-Sonnet in various assessments, particularly in mathematics and coding tasks, and was noted for its strong performance in reasoning tasks despite being a non-reasoning model [6]. - The cost-effectiveness of the R1 model is highlighted, being priced at only 1/11 of Claude-3.7-Sonnet and 1/277 of GPT-4.5, while also being open-source and free for commercial use [7]. Group 3: Market Impact - The emergence of the R1 model has led to a decline in global tech stocks, as investors question the necessity of significant investments by companies like Microsoft in developing advanced AI models and services [8]. Group 4: Future Developments - There is ongoing speculation regarding the release of the R2 model, which is expected to enhance code generation capabilities and reasoning in multiple languages. Initial plans for its release were set for early May [9]. - The R2 model is anticipated to utilize a more advanced mixture of experts model, with a total parameter count projected to reach 1.2 trillion, significantly reducing reasoning costs compared to GPT-4 [10]. - Despite the speculation, DeepSeek has not officially confirmed any details regarding the R2 model's release timeline [11].
DeepSeek-R1,升级;近万亿元规模金融机构,换帅!上海乐高乐园门票开售→
新华网财经· 2025-05-29 00:27
Group 1 - DeepSeek-R1 model has completed a minor version upgrade and is available for testing on official platforms [16] - China Ping An Group appointed Wang Xin as the Party Committee member and Secretary of Ping An Trust, with plans for him to become Chairman pending qualification approval [11] - Shanghai Lego Resort announced ticket and hotel sales starting May 28, with the first-day ticket price set at 549 yuan for standard tickets and 439 yuan for children and seniors [8] Group 2 - From January to April 2025, state-owned enterprises in China reported total operating income unchanged from the previous year, while total profit decreased by 1.7% [3] - The State Administration for Market Regulation is drafting regulations for quality safety supervision of key industrial products sold online, encouraging e-commerce platforms to conduct self-inspections [3] - The Ministry of Industry and Information Technology aims for over 85% of key processes in the electronic information manufacturing industry to be numerically controlled by 2027 [3] Group 3 - In April 2025, China issued new local government bonds totaling 693.3 billion yuan, including 140.7 billion yuan in general bonds and 552.6 billion yuan in special bonds [4] - The National Development Bank issued over 400 billion yuan in loans to provinces along the Yangtze River Economic Belt from January to April 2025, focusing on ecological governance and infrastructure [4] - The National Medical Insurance Administration found potential violations in retail pharmacies regarding the use of pharmacist information, affecting nearly 24 provinces and thousands of pharmacies [5] Group 4 - The International Robotics Skills Competition will be held in Shanghai, focusing on the application of robotics in industrial and household settings [6] - 14 listed companies have launched shareholder benefit activities this year, enhancing investor relations through product giveaways and discounts [6] - Guotai Junan Securities approved a capital increase of 1.5 billion yuan for Guotai Junan Futures to support its net capital [7] Group 5 - Yuao Co. plans to acquire 100% of Shenzhen Shangyangtong Technology Co., Ltd. for 1.58 billion yuan, pending shareholder and regulatory approvals [8] - Jiaying Pharmaceutical is under investigation by the China Securities Regulatory Commission for suspected information disclosure violations [8] - ByteDance announced a ban on third-party AI development software within its company, opting for its own AI programming tool, Trae [12]
中国对沙特等4国试行免签;中欧半导体上下游企业座谈会召开……盘前重要消息还有这些
证券时报· 2025-05-28 23:58
Group 1 - China announced a unilateral visa exemption policy for Saudi Arabia, Oman, Kuwait, and Bahrain, effective from June 9, 2025, to June 8, 2026, allowing ordinary passport holders to enter China for up to 30 days without a visa [2] - The National Health Commission reported a slowdown in the upward trend of COVID-19 cases nationwide, with most provinces reaching a peak or showing a downward trend [2] - MSCI included five new stocks in its MSCI China A Index, bringing the total to 394, with 246 from Shanghai and 148 from Shenzhen, making China the largest weight market in the MSCI Emerging Markets Index [3] Group 2 - The Ministry of Finance reported that from January to April 2025, state-owned enterprises had total operating revenue of 262,755 billion yuan, remaining flat year-on-year, while total profit decreased by 1.7% to 13,491.4 billion yuan [3] - A meeting on deepening China-EU semiconductor cooperation emphasized the importance of collaboration in the global semiconductor supply chain and the need for a stable policy environment [4] - The National Healthcare Security Administration announced a verification of retail pharmacy pharmacists to ensure compliance with labor contracts and prohibit "hanging certificates" [6] Group 3 - Fujian Province's government issued a plan to boost consumption, focusing on promoting the replacement of old products with new ones and enhancing service consumption [5] - Guangdong Province's government outlined key points for digital construction by 2025, aiming to foster the data industry and enhance data trading capabilities [5] - The Ministry of Industry and Information Technology warned users about a fraudulent app named "Tencent Payment," which falsely claimed to be associated with Tencent [6] Group 4 - Companies like Zhongqi Co. have project approval for chlorantraniliprole but have not yet commenced production [9] - Tongda Electric's stock may face risks due to market sentiment and irrational speculation [9] - China Energy Construction's subsidiary won a coal power project worth approximately 145.86 billion yuan [10]
DeepSeek开源新版R1,媲美OpenAI o3模型;英伟达Q1营收441亿美元,超预期 丨全球科技早参
Mei Ri Jing Ji Xin Wen· 2025-05-28 23:57
Group 1 - DeepSeek has released the latest version R1 of its large model platform, which reportedly matches the performance of OpenAI's latest o3 model, indicating significant technological progress [2] - OpenAI's CFO stated that the company's restructuring plan is aimed at laying the groundwork for a potential IPO, contingent on market conditions and the company's readiness [3] - Tesla is expected to launch its long-awaited Robotaxi service on June 12 in Austin, Texas, marking a significant milestone in its autonomous vehicle and AI business strategy [4] Group 2 - Apple plans to unify its operating system naming convention to a year-based system, moving from version numbers to a more consistent branding approach, with an official announcement expected at the upcoming developer conference [5] - NVIDIA's Q1 earnings report exceeded expectations, with revenue of $44.1 billion, a 69% year-over-year increase, despite facing export restrictions, highlighting the company's focus on the Chinese AI market [6]
DeepSeek开源新版R1,媲美OpenAI最高o3模型
news flash· 2025-05-28 21:41
Core Viewpoint - DeepSeek has released the latest version R1 (0528) of its open-source model, which reportedly matches the performance of OpenAI's highest version o3 model [1] Group 1: Model Performance - The new R1 model has been tested on Live CodeBench, showing performance comparable to OpenAI's o3 model [1] - In the ranking of models, DeepSeek-R1-0528 achieved a Pass@1 score of 73.1, placing it fourth overall [1] - The performance metrics for DeepSeek-R1-0528 include an Easy-Pass@1 score of 98.7 and a Medium-P score of 8 [1] Group 2: Comparison with Other Models - The top-ranked model, 04-Mini (High), has a Pass@1 score of 80.2, indicating a significant lead over DeepSeek-R1-0528 [1] - Other notable models in the ranking include 03 (High) with a Pass@1 score of 75.8 and 04-Mini (Medium) with a score of 74.2, both outperforming DeepSeek-R1-0528 [1] - The performance of DeepSeek-R1-0528 is closely aligned with models like 03-Mini-2025-01-31 (High) and Grok-3-Mini (High), which have scores of 67.4 and 66.7 respectively [1]
英伟达发布财报之前 DeepSeek版本升级
Zhong Guo Ji Jin Bao· 2025-05-28 15:12
据用户反馈,DeepSeek升级后的模型,思维链 (CoT) 的行为似乎发生了显著变化。 大家好,关注一下DeepSeek的最新消息! 5月28日,DeepSeek官方宣布DeepSeek R1模型已完成小版本试升级,欢迎前往官方网页、APP、小程序测试(打开深度思考),API 接口和使用方式保持 不变。 据DeepSeek小助手在官方微信群中的发言,DeepSeek已完成一次"小版本试升级"的操作,并通知用户可以开始测试。但公司未披露此次升级的具体细节。 这家总部位于杭州的初创企业在今年1月震惊了全球科技行业,当时他们发布了原始版本的R1模型,在多个标准化评测中超越了西方同行,据称研发成本 仅为数百万美元。这一消息引发全球科技股大幅波动,投资者开始质疑大型科技公司是否还需要投入巨额资金来构建AI服务。 R1模型的首次亮相,使其创始人梁文锋一跃成为科技界的明星人物,也成为中国有能力与硅谷顶尖公司竞争的象征。这一发布同时引发了新一轮人工智 能模型竞赛。 DeepSeek的本次升级是在英伟达发布最新财报前数小时宣布的。作为全球领先的AI芯片制造商,英伟达的股价在1月因R1的发布而遭遇重挫。 也有用户总结了更新后的 ...
腾讯研究院AI速递 20250529
腾讯研究院· 2025-05-28 15:06
Group 1 - Salesforce acquired Informatica for $8 billion, marking its largest deal since the acquisition of Slack in 2021 [1] - The acquisition aims to integrate both companies' AI engines to create a trusted data infrastructure that supports enterprise-level deployment of agent-based AI systems [1] - Data management capabilities are becoming a key differentiator for enterprise AI products, and Salesforce is enhancing its data management strategy through this acquisition [1] Group 2 - DeepSeek's R1 model has completed a minor version upgrade, now available for experience on its official website, app, and mini-program [2] - The upgraded R1 model shows significant improvement in programming capabilities, quickly generating high-quality dynamic weather cards with detailed design and interactive animations [2] - The update may have utilized the DeepSeek-V3-0324 model, while the anticipated R2 version has yet to be released [2] Group 3 - Anthropic launched a voice mode for Claude, allowing users to discuss documents and images via voice, with five unique voice tones available [3] - Users can switch freely between text and voice, and after conversations, they can view text records and summaries [3] - The voice feature has usage limitations, with voice conversations counting towards regular usage limits, and the Google Workspace connector is only available to paid users [3] Group 4 - AKOOL released the world's first real-time camera, AKOOL Live Camera, capable of low-latency virtual digital humans, multilingual translation, face replacement, and AI video generation [4] - This technology breaks traditional video generation limitations through 4D facial mapping and neural voice engines, achieving environment perception and emotional response, with 94% of blind tests unable to distinguish between real and fake [4][5] - The product signifies a shift in AI video from "pre-fabrication" to "intelligent response," heralding a second revolution in AI video following Sora [5] Group 5 - Tencent Hunyuan released an open-source voice digital human model, HunyuanVideo-Avatar, which can generate videos of characters speaking or singing naturally from just one image and one audio clip [6] - The model supports various framing options and can understand image environments and audio emotions, automatically generating natural expressions, lip-syncing, and full-body movements [6] - This technology has been applied in Tencent's music products and is suitable for short video creation, e-commerce advertising, and supports multiple styles and interactive scenarios [6] Group 6 - ByteDance's Kouzi Space launched a one-click text-to-podcast feature, capable of generating "human-level" multi-character dialogue audio in minutes, a task that previously took hours [7] - This feature has broad applications, converting hot news into podcasts, turning course notes into audio lessons, and creating audio summaries of meeting minutes, as well as providing emotional counseling and shopping guides [7] - Kouzi Space can also integrate podcast production with website creation, opening up multi-functional applications and marking the era of AI working for the general public [7] Group 7 - SpAItial raised $13 million in seed funding, founded by former Synthesia co-founder Matthias Neisner, focusing on text-to-realistic 3D environment technology [8] - The company has assembled a luxury tech team from Meta and Google, aiming to create not only realistic but also interactive 3D worlds, competing with Odyssey and World Labs [8] - The team targets applications in game development, entertainment, and architectural visualization, with long-term goals including enabling ordinary users to quickly create games and potentially replace CAD software [8] Group 8 - Tencent Yuanbao has integrated with WeChat Reading and Qidian Reading, allowing users to click on underlined book titles to jump directly to reading [9] - Users can obtain book recommendations with one click, with each book featuring a jump link, facilitating a seamless transition from "book hoarding" to "reading" [10] - This integration allows users to chat with Yuanbao while reading, interpret concepts, generate mind maps, and even simulate conversations in the author's tone [10] Group 9 - SpaceX's Starship "Ninth Flight" experienced an explosion during recovery landing, despite successfully using a reused B14.2 booster [11] - The test focused on validating booster reuse technology, spacecraft payload deployment capabilities, and optimizing design to shorten launch intervals and reduce costs [11] - SpaceX is expanding its manufacturing and launch capabilities through new facilities in Florida and innovative designs to enhance system efficiency [11] Group 10 - Anthropic's Claude 4 core team emphasizes the model's independent working capabilities and long-term task handling abilities [12] - The team predicts that by 2025, reinforcement learning will significantly enhance large language model training, improving the model's ability to handle long-term tasks [12] - Researchers believe that the focus should be on raising the model's baseline rather than pursuing extremes, with user interactions evolving from minute-level to hour-level engagements [12]
DeepSeek R1,新升级!
第一财经· 2025-05-28 14:15
5月28日晚,第一财经记者获悉,DeepSeek小助手在官方交流群中发布通知称,DeepSeek R1模型已 完成小版本试升级,欢迎前往官方网页、App、小程序测试(打开深度思考),API接口和使用方式 保持不变。关于市场期待的DeepSeek R2模型目前仍未有消息。 ...
Claude 4 核心成员访谈:提升 Agent 独立工作能力,强化模型长程任务能力是关键
Founder Park· 2025-05-28 13:13
Core Insights - The main change expected in 2025 is the effective application of reinforcement learning (RL) in language models, particularly through verifiable rewards, leading to expert-level performance in competitive programming and mathematics [4][6][7]. Group 1: Reinforcement Learning and Model Development - Reinforcement learning has activated existing knowledge in models, allowing them to organize solutions rather than learning from scratch [4][11]. - The introduction of Opus 4 has significantly improved context management for multi-step actions and long-term tasks, enabling models to perform meaningful reasoning and execution over extended periods without frequent user intervention [4][32]. - The current industry trend prioritizes computational power over data and human feedback, which may evolve as models become more capable of learning in real-world environments [4][21]. Group 2: Future of AI Agents - The potential for AI agents to automate intellectual tasks could lead to significant changes in the global economy and labor market, with predictions of "plug-and-play" white-collar AI employees emerging within the next two years [7][9]. - The interaction frequency between users and models is expected to shift from seconds and minutes to hours, allowing users to manage multiple models simultaneously, akin to a "fleet management" approach [34][36]. - The development of AI agents capable of completing tasks independently is anticipated to accelerate, with models expected to handle several hours of work autonomously by the end of the year [36][37]. Group 3: Model Capabilities and Limitations - Current models still lack self-awareness in the philosophical sense, although they exhibit a form of meta-cognition by expressing uncertainty about their answers [39][40]. - The models can simulate self-awareness but do not possess a continuous identity or memory unless explicitly designed with external memory systems [41][42]. - The understanding of model behavior and decision-making processes is still evolving, with ongoing research into mechanisms of interpretability and the identification of features that drive model outputs [46][48]. Group 4: Future Developments and Expectations - The frequency of model releases is expected to increase significantly, with advancements in reinforcement learning leading to rapid improvements in model capabilities [36][38]. - The exploration of long-term learning mechanisms and the ability for models to evolve through practical experience is a key area of focus for future research [30][29]. - The ultimate goal of model interpretability is to establish a clear understanding of how models make decisions, which is crucial for ensuring their reliability and safety in various applications [46][47].
还在等DeepSeek R2?刚刚,DeepSeek R1模型小版本试升级已完成!优化了这些方面
Mei Ri Jing Ji Xin Wen· 2025-05-28 13:03
每经编辑|黄胜 5月28日,DeepSeek官方宣布DeepSeek R1模型已完成小版本试升级,欢迎前往官方网页、APP、小程序测试(打开深度思考),API 接口和使用 方式保持不变。 关于这次试升级的内容,小编询问DeepSeek后得到的反馈是,根据DeepSeek内部优化方向和自身的感知,这次升级主要集中在以下几个方面: 1. 响应质量优化:复杂推理、多步骤计算更准确;长文理解与生成更连贯、逻辑更清晰;数学、编程等专业性输出更可靠。 2. 官方会收集反馈,确保稳定后再全面推送; 3. 如果你使用官方 App、网页或小程序,现在打开"深度思考"模式,很可能已经用上升级版的我了! 另一方面,DeepSeek R2模型究竟何时发布,一直是大家关注的焦点。此前,3月11日,针对DeepSeek将在3月17日发布下一代R2模型的传闻, DeepSeek官方企业咨询账号在用户群中回应称,"辟谣:R2发布为假消息"。 图片来源:视觉中国 3. 对话稳定性增强:上下文记忆更稳定,尤其在超长对话中(支持最多128K上下文);减少偶尔"遗忘设定"或"跑偏"的情况。 4. API 和接口兼容性保持稳定:如公告所说:API 调 ...