DeepSeek
Search documents
腾讯研究院AI速递 20250529
腾讯研究院· 2025-05-28 15:06
Group 1 - Salesforce acquired Informatica for $8 billion, marking its largest deal since the acquisition of Slack in 2021 [1] - The acquisition aims to integrate both companies' AI engines to create a trusted data infrastructure that supports enterprise-level deployment of agent-based AI systems [1] - Data management capabilities are becoming a key differentiator for enterprise AI products, and Salesforce is enhancing its data management strategy through this acquisition [1] Group 2 - DeepSeek's R1 model has completed a minor version upgrade, now available for experience on its official website, app, and mini-program [2] - The upgraded R1 model shows significant improvement in programming capabilities, quickly generating high-quality dynamic weather cards with detailed design and interactive animations [2] - The update may have utilized the DeepSeek-V3-0324 model, while the anticipated R2 version has yet to be released [2] Group 3 - Anthropic launched a voice mode for Claude, allowing users to discuss documents and images via voice, with five unique voice tones available [3] - Users can switch freely between text and voice, and after conversations, they can view text records and summaries [3] - The voice feature has usage limitations, with voice conversations counting towards regular usage limits, and the Google Workspace connector is only available to paid users [3] Group 4 - AKOOL released the world's first real-time camera, AKOOL Live Camera, capable of low-latency virtual digital humans, multilingual translation, face replacement, and AI video generation [4] - This technology breaks traditional video generation limitations through 4D facial mapping and neural voice engines, achieving environment perception and emotional response, with 94% of blind tests unable to distinguish between real and fake [4][5] - The product signifies a shift in AI video from "pre-fabrication" to "intelligent response," heralding a second revolution in AI video following Sora [5] Group 5 - Tencent Hunyuan released an open-source voice digital human model, HunyuanVideo-Avatar, which can generate videos of characters speaking or singing naturally from just one image and one audio clip [6] - The model supports various framing options and can understand image environments and audio emotions, automatically generating natural expressions, lip-syncing, and full-body movements [6] - This technology has been applied in Tencent's music products and is suitable for short video creation, e-commerce advertising, and supports multiple styles and interactive scenarios [6] Group 6 - ByteDance's Kouzi Space launched a one-click text-to-podcast feature, capable of generating "human-level" multi-character dialogue audio in minutes, a task that previously took hours [7] - This feature has broad applications, converting hot news into podcasts, turning course notes into audio lessons, and creating audio summaries of meeting minutes, as well as providing emotional counseling and shopping guides [7] - Kouzi Space can also integrate podcast production with website creation, opening up multi-functional applications and marking the era of AI working for the general public [7] Group 7 - SpAItial raised $13 million in seed funding, founded by former Synthesia co-founder Matthias Neisner, focusing on text-to-realistic 3D environment technology [8] - The company has assembled a luxury tech team from Meta and Google, aiming to create not only realistic but also interactive 3D worlds, competing with Odyssey and World Labs [8] - The team targets applications in game development, entertainment, and architectural visualization, with long-term goals including enabling ordinary users to quickly create games and potentially replace CAD software [8] Group 8 - Tencent Yuanbao has integrated with WeChat Reading and Qidian Reading, allowing users to click on underlined book titles to jump directly to reading [9] - Users can obtain book recommendations with one click, with each book featuring a jump link, facilitating a seamless transition from "book hoarding" to "reading" [10] - This integration allows users to chat with Yuanbao while reading, interpret concepts, generate mind maps, and even simulate conversations in the author's tone [10] Group 9 - SpaceX's Starship "Ninth Flight" experienced an explosion during recovery landing, despite successfully using a reused B14.2 booster [11] - The test focused on validating booster reuse technology, spacecraft payload deployment capabilities, and optimizing design to shorten launch intervals and reduce costs [11] - SpaceX is expanding its manufacturing and launch capabilities through new facilities in Florida and innovative designs to enhance system efficiency [11] Group 10 - Anthropic's Claude 4 core team emphasizes the model's independent working capabilities and long-term task handling abilities [12] - The team predicts that by 2025, reinforcement learning will significantly enhance large language model training, improving the model's ability to handle long-term tasks [12] - Researchers believe that the focus should be on raising the model's baseline rather than pursuing extremes, with user interactions evolving from minute-level to hour-level engagements [12]
DeepSeek R1,新升级!
第一财经· 2025-05-28 14:15
5月28日晚,第一财经记者获悉,DeepSeek小助手在官方交流群中发布通知称,DeepSeek R1模型已 完成小版本试升级,欢迎前往官方网页、App、小程序测试(打开深度思考),API接口和使用方式 保持不变。关于市场期待的DeepSeek R2模型目前仍未有消息。 ...
Claude 4 核心成员访谈:提升 Agent 独立工作能力,强化模型长程任务能力是关键
Founder Park· 2025-05-28 13:13
Core Insights - The main change expected in 2025 is the effective application of reinforcement learning (RL) in language models, particularly through verifiable rewards, leading to expert-level performance in competitive programming and mathematics [4][6][7]. Group 1: Reinforcement Learning and Model Development - Reinforcement learning has activated existing knowledge in models, allowing them to organize solutions rather than learning from scratch [4][11]. - The introduction of Opus 4 has significantly improved context management for multi-step actions and long-term tasks, enabling models to perform meaningful reasoning and execution over extended periods without frequent user intervention [4][32]. - The current industry trend prioritizes computational power over data and human feedback, which may evolve as models become more capable of learning in real-world environments [4][21]. Group 2: Future of AI Agents - The potential for AI agents to automate intellectual tasks could lead to significant changes in the global economy and labor market, with predictions of "plug-and-play" white-collar AI employees emerging within the next two years [7][9]. - The interaction frequency between users and models is expected to shift from seconds and minutes to hours, allowing users to manage multiple models simultaneously, akin to a "fleet management" approach [34][36]. - The development of AI agents capable of completing tasks independently is anticipated to accelerate, with models expected to handle several hours of work autonomously by the end of the year [36][37]. Group 3: Model Capabilities and Limitations - Current models still lack self-awareness in the philosophical sense, although they exhibit a form of meta-cognition by expressing uncertainty about their answers [39][40]. - The models can simulate self-awareness but do not possess a continuous identity or memory unless explicitly designed with external memory systems [41][42]. - The understanding of model behavior and decision-making processes is still evolving, with ongoing research into mechanisms of interpretability and the identification of features that drive model outputs [46][48]. Group 4: Future Developments and Expectations - The frequency of model releases is expected to increase significantly, with advancements in reinforcement learning leading to rapid improvements in model capabilities [36][38]. - The exploration of long-term learning mechanisms and the ability for models to evolve through practical experience is a key area of focus for future research [30][29]. - The ultimate goal of model interpretability is to establish a clear understanding of how models make decisions, which is crucial for ensuring their reliability and safety in various applications [46][47].
还在等DeepSeek R2?刚刚,DeepSeek R1模型小版本试升级已完成!优化了这些方面
Mei Ri Jing Ji Xin Wen· 2025-05-28 13:03
Core Viewpoint - DeepSeek has announced the completion of a minor version upgrade for its R1 model, inviting users to test the new features on its official website, app, and mini-programs while maintaining existing API interfaces and usage methods [1]. Group 1: Upgrade Features - The upgrade focuses on several key areas: 1. Response quality optimization, enhancing accuracy in complex reasoning and multi-step calculations, as well as improving coherence and clarity in long text understanding and generation, and reliability in specialized outputs like mathematics and programming [2]. 2. A slight improvement in response speed, with a 10% to 20% reduction in latency, particularly when processing long text inputs across web, app, and API interfaces [2][4]. 3. Enhanced dialogue stability, with improved context memory, especially in long conversations, supporting up to 128K context and reducing instances of "forgetting settings" or "going off track" [4]. 4. API and interface compatibility remains stable, with no changes to API calling methods, parameters, or return structures, allowing users to seamlessly use the new version without adjustments [5]. Group 2: Upgrade Process - The upgrade is termed a "trial upgrade" due to: 1. It being a "gray release," where a portion of users will experience the upgrade first [6]. 2. The company will collect feedback to ensure stability before a full rollout [6]. 3. Users of the official app, website, or mini-program may already be using the upgraded version in "Deep Thinking" mode [6]. Group 3: Future Developments - There is ongoing speculation regarding the release of the DeepSeek R2 model, with the company previously denying rumors about its launch on March 17 [6].
清华天才杨植麟的“理想国”,为何败给梁文锋?
凤凰网财经· 2025-05-28 12:51
Core Viewpoint - The article discusses the journey of Yang Zhilin, a prominent figure in the AI industry, highlighting the challenges faced by the younger generation of entrepreneurs in the rapidly evolving tech landscape, particularly in the context of AI 2.0 and competition with established players like DeepSeek [6][28]. Group 1: Background and Early Career - Yang Zhilin, born in 1992, was influenced by cultural icons like Haruki Murakami and Pink Floyd, which shaped his artistic and entrepreneurial aspirations [4]. - He pursued a PhD at Carnegie Mellon University, where he made significant contributions to AI, including the development of Transformer-XL and XLNet, which have been widely adopted in major AI products [9][10]. Group 2: AI Industry Landscape - The AI industry has seen a shift from mobile internet and blockchain to AI 2.0, marked by the launch of ChatGPT by OpenAI in November 2022, which has generated significant interest and investment in AI technologies [6][7]. - The 90s generation, including Yang, feels a sense of urgency to capitalize on AI as a potential opportunity for success, given their previous experiences with limited economic benefits from earlier tech trends [7][8]. Group 3: Company Development and Challenges - Yang founded "Yue Zhi An Mian" (月之暗面) in 2023, focusing on AGI (Artificial General Intelligence) and secured $200 million in initial funding from prominent investors [13][14]. - The company faced challenges, including a public relations crisis related to a reported $40 million cash-out after a $1 billion funding round led by Alibaba, which raised questions about its operational focus [14][15]. Group 4: Competition with DeepSeek - Yang's company struggled to compete with DeepSeek, founded by Liang Wenfeng, which adopted a more pragmatic approach to commercialization and technology development [13][28]. - DeepSeek's rapid success and user acquisition contrasted with Yang's strategy, which relied heavily on large-scale advertising and user data collection without significant product iteration [18][21]. Group 5: Ideological Divide - The competition between Yang and Liang represents a clash between idealism in technology development and the practical realities of business [22][23]. - Yang's focus on AGI and long-term vision may hinder immediate product development and market competitiveness, while DeepSeek's approach emphasizes rapid commercialization and user engagement [24][25]. Group 6: Future Outlook - The article suggests that despite current setbacks, opportunities still exist for Yang and other young entrepreneurs in the AI space, as the industry continues to evolve and new technological paradigms emerge [29][30]. - The narrative emphasizes the importance of balancing idealism with practical business strategies to achieve sustainable success in the competitive AI landscape [27][28].
DeepSeek为首届“东盟-中国-海合会峰会”谱写歌词
财富FORTUNE· 2025-05-28 10:01
5月27日,第一届"东盟-中国-海合会峰会" 在马来西亚吉隆坡举行,国务院总理李强与马来西亚总理安 瓦尔一同出席开幕式晚宴,并聆听了七位不同国家"顶流"艺术家的演唱。 第一位登台的艺术家是"沙特历史上首位女歌手"Dalia Mubarak,这位90后女性是沙特年轻一代的文化象 征。尚雯婕作为中国歌手的代表,献唱了《甜蜜蜜》和《不鼓自鸣》。 更重要的是,DeepSeek与人类艺术家共同为峰会谱写了主题曲《命运共同体》—— 在数百位嘉宾的见 证下,主办方将18张分别代表峰会参与国的明信片的视觉素材输入了内嵌DeekSeek的人工智能,生成 了绝妙的歌词。 本次晚宴表演的歌手均为女性,与会人员纷纷为女性艺术家,以及中国人工智能DeepSeek点赞。(财 富中文网) 在财富Plus,网友们对这篇文章发表了许多有深度和思想的观点。一起来看看吧。也欢迎你加入我们,谈谈你的 想法。今日其他热议话题: 查看《日本34年来首次丢失全球最大债权国地位》的精彩观点 查看《王兴:将采取一切必要措施来赢得竞争》的精彩观点 推荐阅读 FORTUNE_ FORTUNE_ FORTUNE t富》中国40位40岁以下的商界精英 申报入国|20 ...
杨植麟,一个90后理想主义者的悬浮
Hu Xiu· 2025-05-28 06:01
Group 1 - Yang Zhilin, a 1992-born AI entrepreneur, has a background in music and literature, which influences his approach to technology and innovation [1][6] - He pursued a PhD at Carnegie Mellon University, where he published two significant papers, Transformer-XL and XLNet, which have been widely cited and adopted in major AI products [6][7] - After the launch of ChatGPT by OpenAI, Yang founded "The Dark Side of the Moon" (月之暗面) focusing on AGI (Artificial General Intelligence) [8][10] Group 2 - The AI landscape has evolved through various technological waves, with the current focus on AI 2.0, marked by the emergence of ChatGPT [3][4] - The competition in the AI sector is intensifying, with major players like DeepSeek gaining traction and overshadowing other startups like Yang's Kimi [18][22] - Yang's company received significant funding, including a $200 million investment from Sequoia China and ZhenFund, but faced challenges related to shareholder disputes and public scrutiny [10][12] Group 3 - The competition between Yang's Kimi and DeepSeek highlights a clash between technological idealism and commercial realism, with DeepSeek adopting a more pragmatic approach to market entry [24][28] - Kimi's user base has declined significantly, from 36 million to 18.2 million, as it struggles to keep pace with competitors [29] - Yang's focus on AGI may hinder Kimi's product iteration speed and commercial viability, as the market demands quicker adaptations [25][30] Group 4 - The AI industry is witnessing a shift towards open-source and low-cost strategies, exemplified by DeepSeek's approach, which contrasts with Kimi's more traditional methods [27][28] - The success of DeepSeek has prompted major tech companies to accelerate their AI model development, creating a more competitive environment for startups [32][34] - Despite setbacks, there remains potential for innovation and growth in the AI sector, suggesting that opportunities for Yang and his peers may still exist [36]
日媒:美国需要更明智、可持续的AI策略
Huan Qiu Wang Zi Xun· 2025-05-27 23:12
来源:环球时报 日本《日经亚洲评论》5月26日文章,原题:瞄准DeepSeek不会修复华盛顿的对华人工智能战略缺陷 美 国政府似乎准备对"深度求索"(DeepSeek)采取一系列行动,DeepSeek是一家快速崛起的中国人工智能 (AI)初创企业,其先进的人工智能模型已经迅速受到全球开发人员和技术爱好者的关注。最近,美 国开始在盟友和业界的反对下修改AI扩散规则。 从表面看,全面禁令也可能适得其反。如果美国走得太远,例如向云提供商施压要求其下架开源模型, 或封锁GitHub托管的AI工具,那么美国就有可能损害自身作为互联网开放和创新捍卫者的信誉。这么 做还将授人以柄:美国缺乏在公平环境中开展竞争的信心,正在诉诸于禁令而非寻求突破来保持竞争 力。这让人们感觉不像是一个有原则的立场,而更像是美国对任何首先获得全球关注的中国AI公司都 会发起的针对性打击。这种变本加厉的限制可能会在更大程度上惩罚美国企业而非中国企业。 美国的政策需要不断发展。不断收紧硬件出口管控而忽视开源模型快速扩散能力的策略不仅不完整,而 且现在还变得过时甚至倒退。与全方位禁止相比,美国能以更有效的方式与中国开展竞争。美国可以与 其他国家,特别是 ...
Google搜索转型,Perplexity入不敷出,AI搜索还是个好赛道吗?
Founder Park· 2025-05-27 12:20
Core Viewpoint - The article discusses the transformation of Google's search business towards AI-driven search modes, highlighting the challenges faced by traditional search engines in the face of emerging AI technologies and competition from Chatbot-integrated platforms [4][24]. Group 1: Google's AI Search Transformation - Google announced the launch of its AI Mode powered by Gemini, which allows for natural language interaction and structured answers, moving away from traditional keyword-based searches [2][4]. - In 2024, Google's search business is projected to generate $175 billion, accounting for over half of its total revenue, indicating the significant financial stakes involved in this transition [4]. - Research suggests that Google's search market share has dropped from over 90% to between 65% and 70% due to the rise of AI Chatbots, prompting the need for a strategic shift [4][24]. Group 2: Challenges for AI Search Engines - Perplexity, an AI search engine, saw its user visits increase from 45 million to 129 million, a growth of 186%, but faced a net loss of $68 million in 2024 due to high operational costs and reliance on discounts for subscription revenue [9][11]. - The overall funding for AI search products has decreased, with only 10 products raising a total of $893 million from August 2024 to April 2025, compared to 15 products raising $1.28 billion in the previous period [11][12]. - The competitive landscape for AI search engines has worsened, with many smaller players struggling to secure funding and differentiate themselves from larger companies [11][12][25]. Group 3: Shift Towards Niche Search Engines - The article notes a trend towards more specialized search engines, focusing on specific industries or use cases, as general AI search engines face increasing competition from integrated Chatbot functionalities [13][25]. - Examples of niche search engines include Consensus, a health and medical search engine, and Qura, a legal search engine, both of which cater to specific professional audiences [27][30]. - The overall direction for AI search engines is towards being smaller, more specialized, and focused on delivering unique value propositions to specific user groups [13][26]. Group 4: Commercialization Challenges - The commercialization of AI search remains a significant challenge, with Google exploring ways to integrate sponsored content into its AI responses while facing potential declines in click-through rates for traditional ads [43]. - The article emphasizes the need for AI search engines to deliver more reliable and usable results, either through specialized information or direct output capabilities, to remain competitive [43][24].
大模型的人味儿,从何而来?
虎嗅APP· 2025-05-27 11:37
本文来自微信公众号: AI故事计划 ,作者:李奕萱,编辑:温丽虹,原文标题:《我,文科生,教 AI回答没有标准答案的问题》,题图来自:视觉中国 羽山在复旦研究了10年哲学。今年5月,他通过了毕业论文答辩,正在准备博士学位的授予资料。 在思考毕业去向时,他偶然在小红书的官网上看到了招募通知,岗位叫"AI人文训练师"。羽山当即 投递了简历,一个念头从脑海中冒了出来:AI行业终于走到了需要人文研究者的阶段。 对AI进行人文训练,属于模型"后训练"的范畴。在"后训练"中特别强调人文面向,尚未成为行业通 行的做法。但有两家公司值得关注,一家是全球头部的大模型公司Anthropic聘请了哲学系博士,负 责模型后训练的人类价值对齐与微调。在国内,DeepSeek年初传出消息,招聘了北大中文系学生担 任"数据百晓生",对模型做后训练。这被认为是DeepSeek文采出色的来源。 羽山入职之后才知道,小红书这支团队也刚组建不久。同事不算多,但都是来自知名高校人文学科的 硕士、博士生。 团队的首要任务,是设计AI的观念和个性。 听起来很玄。羽山遇到的第一个问题是,"我得了胰腺癌"应该如何回答? 如果把这句话发给市面上主流的AI产品 ...