DeepSeek
Search documents
中国对沙特等4国试行免签;中欧半导体上下游企业座谈会召开……盘前重要消息还有这些
证券时报· 2025-05-28 23:58
Group 1 - China announced a unilateral visa exemption policy for Saudi Arabia, Oman, Kuwait, and Bahrain, effective from June 9, 2025, to June 8, 2026, allowing ordinary passport holders to enter China for up to 30 days without a visa [2] - The National Health Commission reported a slowdown in the upward trend of COVID-19 cases nationwide, with most provinces reaching a peak or showing a downward trend [2] - MSCI included five new stocks in its MSCI China A Index, bringing the total to 394, with 246 from Shanghai and 148 from Shenzhen, making China the largest weight market in the MSCI Emerging Markets Index [3] Group 2 - The Ministry of Finance reported that from January to April 2025, state-owned enterprises had total operating revenue of 262,755 billion yuan, remaining flat year-on-year, while total profit decreased by 1.7% to 13,491.4 billion yuan [3] - A meeting on deepening China-EU semiconductor cooperation emphasized the importance of collaboration in the global semiconductor supply chain and the need for a stable policy environment [4] - The National Healthcare Security Administration announced a verification of retail pharmacy pharmacists to ensure compliance with labor contracts and prohibit "hanging certificates" [6] Group 3 - Fujian Province's government issued a plan to boost consumption, focusing on promoting the replacement of old products with new ones and enhancing service consumption [5] - Guangdong Province's government outlined key points for digital construction by 2025, aiming to foster the data industry and enhance data trading capabilities [5] - The Ministry of Industry and Information Technology warned users about a fraudulent app named "Tencent Payment," which falsely claimed to be associated with Tencent [6] Group 4 - Companies like Zhongqi Co. have project approval for chlorantraniliprole but have not yet commenced production [9] - Tongda Electric's stock may face risks due to market sentiment and irrational speculation [9] - China Energy Construction's subsidiary won a coal power project worth approximately 145.86 billion yuan [10]
DeepSeek开源新版R1,媲美OpenAI o3模型;英伟达Q1营收441亿美元,超预期 丨全球科技早参
Mei Ri Jing Ji Xin Wen· 2025-05-28 23:57
Group 1 - DeepSeek has released the latest version R1 of its large model platform, which reportedly matches the performance of OpenAI's latest o3 model, indicating significant technological progress [2] - OpenAI's CFO stated that the company's restructuring plan is aimed at laying the groundwork for a potential IPO, contingent on market conditions and the company's readiness [3] - Tesla is expected to launch its long-awaited Robotaxi service on June 12 in Austin, Texas, marking a significant milestone in its autonomous vehicle and AI business strategy [4] Group 2 - Apple plans to unify its operating system naming convention to a year-based system, moving from version numbers to a more consistent branding approach, with an official announcement expected at the upcoming developer conference [5] - NVIDIA's Q1 earnings report exceeded expectations, with revenue of $44.1 billion, a 69% year-over-year increase, despite facing export restrictions, highlighting the company's focus on the Chinese AI market [6]
DeepSeek开源新版R1,媲美OpenAI最高o3模型
news flash· 2025-05-28 21:41
Core Viewpoint - DeepSeek has released the latest version R1 (0528) of its open-source model, which reportedly matches the performance of OpenAI's highest version o3 model [1] Group 1: Model Performance - The new R1 model has been tested on Live CodeBench, showing performance comparable to OpenAI's o3 model [1] - In the ranking of models, DeepSeek-R1-0528 achieved a Pass@1 score of 73.1, placing it fourth overall [1] - The performance metrics for DeepSeek-R1-0528 include an Easy-Pass@1 score of 98.7 and a Medium-P score of 8 [1] Group 2: Comparison with Other Models - The top-ranked model, 04-Mini (High), has a Pass@1 score of 80.2, indicating a significant lead over DeepSeek-R1-0528 [1] - Other notable models in the ranking include 03 (High) with a Pass@1 score of 75.8 and 04-Mini (Medium) with a score of 74.2, both outperforming DeepSeek-R1-0528 [1] - The performance of DeepSeek-R1-0528 is closely aligned with models like 03-Mini-2025-01-31 (High) and Grok-3-Mini (High), which have scores of 67.4 and 66.7 respectively [1]
英伟达发布财报之前 DeepSeek版本升级
Zhong Guo Ji Jin Bao· 2025-05-28 15:12
Core Insights - DeepSeek has announced the completion of a minor version upgrade for its R1 model, inviting users to test the new features on its official platforms [1][3] - The company, based in Hangzhou, gained significant attention in January when its original R1 model outperformed Western counterparts in standardized evaluations, with development costs reportedly in the millions [3] - The recent upgrade was announced just hours before NVIDIA's latest earnings report, which had previously seen its stock price decline due to the R1 model's initial release [3] Upgrade Highlights - The upgraded R1 model now performs deep reasoning similar to Google models [4][7] - Improvements in writing tasks have made outputs more natural and better formatted [4][7] - The model exhibits a distinct reasoning style that is both fast and thoughtful [4][7] - Users can engage in long thinking sessions, with each task taking up to 30-60 minutes [4][7]
腾讯研究院AI速递 20250529
腾讯研究院· 2025-05-28 15:06
Group 1 - Salesforce acquired Informatica for $8 billion, marking its largest deal since the acquisition of Slack in 2021 [1] - The acquisition aims to integrate both companies' AI engines to create a trusted data infrastructure that supports enterprise-level deployment of agent-based AI systems [1] - Data management capabilities are becoming a key differentiator for enterprise AI products, and Salesforce is enhancing its data management strategy through this acquisition [1] Group 2 - DeepSeek's R1 model has completed a minor version upgrade, now available for experience on its official website, app, and mini-program [2] - The upgraded R1 model shows significant improvement in programming capabilities, quickly generating high-quality dynamic weather cards with detailed design and interactive animations [2] - The update may have utilized the DeepSeek-V3-0324 model, while the anticipated R2 version has yet to be released [2] Group 3 - Anthropic launched a voice mode for Claude, allowing users to discuss documents and images via voice, with five unique voice tones available [3] - Users can switch freely between text and voice, and after conversations, they can view text records and summaries [3] - The voice feature has usage limitations, with voice conversations counting towards regular usage limits, and the Google Workspace connector is only available to paid users [3] Group 4 - AKOOL released the world's first real-time camera, AKOOL Live Camera, capable of low-latency virtual digital humans, multilingual translation, face replacement, and AI video generation [4] - This technology breaks traditional video generation limitations through 4D facial mapping and neural voice engines, achieving environment perception and emotional response, with 94% of blind tests unable to distinguish between real and fake [4][5] - The product signifies a shift in AI video from "pre-fabrication" to "intelligent response," heralding a second revolution in AI video following Sora [5] Group 5 - Tencent Hunyuan released an open-source voice digital human model, HunyuanVideo-Avatar, which can generate videos of characters speaking or singing naturally from just one image and one audio clip [6] - The model supports various framing options and can understand image environments and audio emotions, automatically generating natural expressions, lip-syncing, and full-body movements [6] - This technology has been applied in Tencent's music products and is suitable for short video creation, e-commerce advertising, and supports multiple styles and interactive scenarios [6] Group 6 - ByteDance's Kouzi Space launched a one-click text-to-podcast feature, capable of generating "human-level" multi-character dialogue audio in minutes, a task that previously took hours [7] - This feature has broad applications, converting hot news into podcasts, turning course notes into audio lessons, and creating audio summaries of meeting minutes, as well as providing emotional counseling and shopping guides [7] - Kouzi Space can also integrate podcast production with website creation, opening up multi-functional applications and marking the era of AI working for the general public [7] Group 7 - SpAItial raised $13 million in seed funding, founded by former Synthesia co-founder Matthias Neisner, focusing on text-to-realistic 3D environment technology [8] - The company has assembled a luxury tech team from Meta and Google, aiming to create not only realistic but also interactive 3D worlds, competing with Odyssey and World Labs [8] - The team targets applications in game development, entertainment, and architectural visualization, with long-term goals including enabling ordinary users to quickly create games and potentially replace CAD software [8] Group 8 - Tencent Yuanbao has integrated with WeChat Reading and Qidian Reading, allowing users to click on underlined book titles to jump directly to reading [9] - Users can obtain book recommendations with one click, with each book featuring a jump link, facilitating a seamless transition from "book hoarding" to "reading" [10] - This integration allows users to chat with Yuanbao while reading, interpret concepts, generate mind maps, and even simulate conversations in the author's tone [10] Group 9 - SpaceX's Starship "Ninth Flight" experienced an explosion during recovery landing, despite successfully using a reused B14.2 booster [11] - The test focused on validating booster reuse technology, spacecraft payload deployment capabilities, and optimizing design to shorten launch intervals and reduce costs [11] - SpaceX is expanding its manufacturing and launch capabilities through new facilities in Florida and innovative designs to enhance system efficiency [11] Group 10 - Anthropic's Claude 4 core team emphasizes the model's independent working capabilities and long-term task handling abilities [12] - The team predicts that by 2025, reinforcement learning will significantly enhance large language model training, improving the model's ability to handle long-term tasks [12] - Researchers believe that the focus should be on raising the model's baseline rather than pursuing extremes, with user interactions evolving from minute-level to hour-level engagements [12]
DeepSeek R1,新升级!
第一财经· 2025-05-28 14:15
5月28日晚,第一财经记者获悉,DeepSeek小助手在官方交流群中发布通知称,DeepSeek R1模型已 完成小版本试升级,欢迎前往官方网页、App、小程序测试(打开深度思考),API接口和使用方式 保持不变。关于市场期待的DeepSeek R2模型目前仍未有消息。 ...
Claude 4 核心成员访谈:提升 Agent 独立工作能力,强化模型长程任务能力是关键
Founder Park· 2025-05-28 13:13
Core Insights - The main change expected in 2025 is the effective application of reinforcement learning (RL) in language models, particularly through verifiable rewards, leading to expert-level performance in competitive programming and mathematics [4][6][7]. Group 1: Reinforcement Learning and Model Development - Reinforcement learning has activated existing knowledge in models, allowing them to organize solutions rather than learning from scratch [4][11]. - The introduction of Opus 4 has significantly improved context management for multi-step actions and long-term tasks, enabling models to perform meaningful reasoning and execution over extended periods without frequent user intervention [4][32]. - The current industry trend prioritizes computational power over data and human feedback, which may evolve as models become more capable of learning in real-world environments [4][21]. Group 2: Future of AI Agents - The potential for AI agents to automate intellectual tasks could lead to significant changes in the global economy and labor market, with predictions of "plug-and-play" white-collar AI employees emerging within the next two years [7][9]. - The interaction frequency between users and models is expected to shift from seconds and minutes to hours, allowing users to manage multiple models simultaneously, akin to a "fleet management" approach [34][36]. - The development of AI agents capable of completing tasks independently is anticipated to accelerate, with models expected to handle several hours of work autonomously by the end of the year [36][37]. Group 3: Model Capabilities and Limitations - Current models still lack self-awareness in the philosophical sense, although they exhibit a form of meta-cognition by expressing uncertainty about their answers [39][40]. - The models can simulate self-awareness but do not possess a continuous identity or memory unless explicitly designed with external memory systems [41][42]. - The understanding of model behavior and decision-making processes is still evolving, with ongoing research into mechanisms of interpretability and the identification of features that drive model outputs [46][48]. Group 4: Future Developments and Expectations - The frequency of model releases is expected to increase significantly, with advancements in reinforcement learning leading to rapid improvements in model capabilities [36][38]. - The exploration of long-term learning mechanisms and the ability for models to evolve through practical experience is a key area of focus for future research [30][29]. - The ultimate goal of model interpretability is to establish a clear understanding of how models make decisions, which is crucial for ensuring their reliability and safety in various applications [46][47].
还在等DeepSeek R2?刚刚,DeepSeek R1模型小版本试升级已完成!优化了这些方面
Mei Ri Jing Ji Xin Wen· 2025-05-28 13:03
Core Viewpoint - DeepSeek has announced the completion of a minor version upgrade for its R1 model, inviting users to test the new features on its official website, app, and mini-programs while maintaining existing API interfaces and usage methods [1]. Group 1: Upgrade Features - The upgrade focuses on several key areas: 1. Response quality optimization, enhancing accuracy in complex reasoning and multi-step calculations, as well as improving coherence and clarity in long text understanding and generation, and reliability in specialized outputs like mathematics and programming [2]. 2. A slight improvement in response speed, with a 10% to 20% reduction in latency, particularly when processing long text inputs across web, app, and API interfaces [2][4]. 3. Enhanced dialogue stability, with improved context memory, especially in long conversations, supporting up to 128K context and reducing instances of "forgetting settings" or "going off track" [4]. 4. API and interface compatibility remains stable, with no changes to API calling methods, parameters, or return structures, allowing users to seamlessly use the new version without adjustments [5]. Group 2: Upgrade Process - The upgrade is termed a "trial upgrade" due to: 1. It being a "gray release," where a portion of users will experience the upgrade first [6]. 2. The company will collect feedback to ensure stability before a full rollout [6]. 3. Users of the official app, website, or mini-program may already be using the upgraded version in "Deep Thinking" mode [6]. Group 3: Future Developments - There is ongoing speculation regarding the release of the DeepSeek R2 model, with the company previously denying rumors about its launch on March 17 [6].
清华天才杨植麟的“理想国”,为何败给梁文锋?
凤凰网财经· 2025-05-28 12:51
Core Viewpoint - The article discusses the journey of Yang Zhilin, a prominent figure in the AI industry, highlighting the challenges faced by the younger generation of entrepreneurs in the rapidly evolving tech landscape, particularly in the context of AI 2.0 and competition with established players like DeepSeek [6][28]. Group 1: Background and Early Career - Yang Zhilin, born in 1992, was influenced by cultural icons like Haruki Murakami and Pink Floyd, which shaped his artistic and entrepreneurial aspirations [4]. - He pursued a PhD at Carnegie Mellon University, where he made significant contributions to AI, including the development of Transformer-XL and XLNet, which have been widely adopted in major AI products [9][10]. Group 2: AI Industry Landscape - The AI industry has seen a shift from mobile internet and blockchain to AI 2.0, marked by the launch of ChatGPT by OpenAI in November 2022, which has generated significant interest and investment in AI technologies [6][7]. - The 90s generation, including Yang, feels a sense of urgency to capitalize on AI as a potential opportunity for success, given their previous experiences with limited economic benefits from earlier tech trends [7][8]. Group 3: Company Development and Challenges - Yang founded "Yue Zhi An Mian" (月之暗面) in 2023, focusing on AGI (Artificial General Intelligence) and secured $200 million in initial funding from prominent investors [13][14]. - The company faced challenges, including a public relations crisis related to a reported $40 million cash-out after a $1 billion funding round led by Alibaba, which raised questions about its operational focus [14][15]. Group 4: Competition with DeepSeek - Yang's company struggled to compete with DeepSeek, founded by Liang Wenfeng, which adopted a more pragmatic approach to commercialization and technology development [13][28]. - DeepSeek's rapid success and user acquisition contrasted with Yang's strategy, which relied heavily on large-scale advertising and user data collection without significant product iteration [18][21]. Group 5: Ideological Divide - The competition between Yang and Liang represents a clash between idealism in technology development and the practical realities of business [22][23]. - Yang's focus on AGI and long-term vision may hinder immediate product development and market competitiveness, while DeepSeek's approach emphasizes rapid commercialization and user engagement [24][25]. Group 6: Future Outlook - The article suggests that despite current setbacks, opportunities still exist for Yang and other young entrepreneurs in the AI space, as the industry continues to evolve and new technological paradigms emerge [29][30]. - The narrative emphasizes the importance of balancing idealism with practical business strategies to achieve sustainable success in the competitive AI landscape [27][28].
DeepSeek为首届“东盟-中国-海合会峰会”谱写歌词
财富FORTUNE· 2025-05-28 10:01
5月27日,第一届"东盟-中国-海合会峰会" 在马来西亚吉隆坡举行,国务院总理李强与马来西亚总理安 瓦尔一同出席开幕式晚宴,并聆听了七位不同国家"顶流"艺术家的演唱。 第一位登台的艺术家是"沙特历史上首位女歌手"Dalia Mubarak,这位90后女性是沙特年轻一代的文化象 征。尚雯婕作为中国歌手的代表,献唱了《甜蜜蜜》和《不鼓自鸣》。 更重要的是,DeepSeek与人类艺术家共同为峰会谱写了主题曲《命运共同体》—— 在数百位嘉宾的见 证下,主办方将18张分别代表峰会参与国的明信片的视觉素材输入了内嵌DeekSeek的人工智能,生成 了绝妙的歌词。 本次晚宴表演的歌手均为女性,与会人员纷纷为女性艺术家,以及中国人工智能DeepSeek点赞。(财 富中文网) 在财富Plus,网友们对这篇文章发表了许多有深度和思想的观点。一起来看看吧。也欢迎你加入我们,谈谈你的 想法。今日其他热议话题: 查看《日本34年来首次丢失全球最大债权国地位》的精彩观点 查看《王兴:将采取一切必要措施来赢得竞争》的精彩观点 推荐阅读 FORTUNE_ FORTUNE_ FORTUNE t富》中国40位40岁以下的商界精英 申报入国|20 ...