Workflow
DeepSeek
icon
Search documents
X @The Economist
The Economist· 2025-08-08 15:25
AI Model Development - DeepSeek, an AI startup, released an open-source model at a fraction of the cost compared to Western models [1] - This breakthrough has significantly influenced China's AI strategy [1]
AI+情绪价值双引擎 欧定品牌打开“出海”全新想象力
Zheng Quan Ri Bao Wang· 2025-08-08 09:43
欧定希望AI能力可以帮助品牌在海外市场取得成功。在现有数据基础上,欧定在海内外回收十万件衬 衫,通过AI算法模型不断调整,致力于为海外用户提供更合体满意的定制设计方案。 朱家勇表示,欧定选择科技作为出海名片,是因为有像华为、DeepSeek这样的公司让世界认识到中国 的"科技"实力。所以欧定也选择用科技打开市场,让AI这样的新质生产力帮助品牌在海外建立起品牌信 任和用户忠诚。"这种信任感,也是中企'出海'获取新增量的起点。" 本报讯(记者袁传玺)8月8日,2025(第十九届)中国品牌节正式开幕。在"AI引擎,全球共振"高峰对话环 节,欧定头等舱高弹衬衫品牌创始人朱家勇受邀与中国传媒大学广告与品牌学院院长赵新利、中央电视 台品牌顾问李光斗、匹克品牌创始人许景南等嘉宾同台论道,探寻中国品牌出海新航道。 会上,朱家勇结合欧定作为新一代男装品牌的出海实践,分享了如何以技术创新+情绪价值双轮驱动, 打破中国服装企业低价出海路径,进军全球中高端市场的经验与思考。 朱家勇在中国品牌节对话环节中表示,中国服装公司在海外依然普遍处于"拼利润"阶段。"以Shein、 Temu等平台为代表的热销商品价格区间仅为9至24美元,比ZA ...
OpenAI o3封王,4比0横扫马斯克Grok 4,全球大模型对抗赛完美收官
3 6 Ke· 2025-08-08 09:29
Group 1 - OpenAI's o3 won the inaugural Kaggle AI Chess Championship by decisively defeating the favorite Grok 4 with a score of 4-0, marking a significant achievement in AI competition [1][11] - The tournament was hosted by Google's Kaggle platform, aiming to evaluate large models' critical thinking, strategic planning, and adaptability in a complex game environment [4][25] - The participating AI players included top models from OpenAI, xAI, Google, Anthropic, DeepSeek, and Moonshot, showcasing a competitive landscape [4][6] Group 2 - The competition rules were designed to challenge AI models by prohibiting the use of professional chess engines and requiring decisions to be made based on the models' reasoning capabilities [6][12] - The semifinals featured a notable match between Grok 4 and Google’s Gemini Pro, with Grok narrowly winning 3-2, while o3 easily defeated o4 mini 4-0 [6][18] - The final match saw Grok 4 making critical mistakes early on, allowing o3 to maintain a clear and stable strategy throughout the game [9][12] Group 3 - The championship victory established o3 as the "undefeated champion," having not lost a single game throughout the tournament [11][17] - Magnus Carlsen, the world chess champion, commented on the performance levels of the AI models, suggesting that o3's skill was comparable to a 1200 rating, while Grok 4 was around 800, indicating a significant gap from top human players [21][23] - Kaggle plans to use the AI chess tournament as a continuous evaluation standard, with future expansions into other complex games like Go and simulation games [25][20]
大模型落地企业端:开源闭源之争未终结 | 海斌访谈
Di Yi Cai Jing· 2025-08-08 08:53
Core Insights - The industry application of large models is expected to experience explosive growth in the first half of 2025, with companies like Alibaba, Jiyue Xingchen, and Baidu leading the commercialization efforts [1][3] - Open-source models have gained popularity in China, but the competition between open-source and closed-source models continues as companies seek to implement large models in specific industries [1][7] Group 1: Company Performance - Yaxin Technology has capitalized on the initial wave of large model applications, reporting a revenue of 26 million yuan in AI model application and delivery for the first half of 2025, a staggering 76-fold increase year-on-year [3] - Yaxin Technology has signed contracts worth 70 million yuan, marking a 78-fold increase compared to the previous year, and is collaborating with major cloud providers to develop industry-specific large model solutions [3] - Jiyue Xingchen aims to achieve a commercial revenue of 1 billion yuan this year, focusing on both foundational models and applications, with significant partnerships in the mobile phone and automotive sectors [4] Group 2: Market Dynamics - The demand for large models is more pronounced in the enterprise sector compared to individual consumers, as a 10% efficiency improvement can significantly impact market competitiveness for businesses [5] - The open-source model offers free access but lacks the support of original manufacturers, which can slow down iteration speed compared to closed-source models [8] - Many enterprises prefer private deployment of large models for data protection, but this approach can lead to slow iteration and high costs, as companies often struggle to achieve successful implementation [8][9] Group 3: Competitive Landscape - The competition between open-source and closed-source models is affecting business models, with some companies like Jiyue Xingchen suggesting that certain business models, such as customized delivery, may be unsustainable [9][10] - The pricing war initiated by major companies has significantly reduced the cost of APIs, making it challenging for startup companies to rely on token-based revenue models [9][10]
赚大钱没那么容易了
Hu Xiu· 2025-08-08 06:55
Core Insights - The current investment landscape reflects a nostalgia for the "golden era" of mobile internet, with many investors feeling they missed out on significant opportunities during that time [1][2] - The investment community is urged to move beyond this nostalgia and recognize that every era presents unique opportunities, even if they differ from past experiences [2][12] Group 1: Investment Trends - The investment cycle in China tends to present new opportunities approximately every three years, suggesting that current investors should remain open to emerging trends [2][10] - The rise of generative AI and embodied intelligence is reshaping the investment landscape, with significant capital flowing into these sectors despite the inherent risks [4][5] - The investment community is increasingly focused on long-term partnerships with companies, moving from a short-term profit mindset to a more sustainable investment approach [7][10] Group 2: Market Dynamics - The current market is characterized by a concentration of capital in a few high-profile sectors, leading to a "winner-takes-all" scenario where most funds are directed towards a limited number of opportunities [5][10] - The investment cycle has lengthened, particularly in hard tech sectors like AI and robotics, requiring patience and a long-term vision from investors [6][9] - The trend of "patient capital" is emerging as a response to the challenges faced in the current investment environment, emphasizing the importance of supporting companies through their growth phases [10][12] Group 3: Future Outlook - There is a belief among some investors that a new wave of opportunities, particularly in AI, could surpass the previous mobile internet boom, although this remains to be validated [12][13] - The increasing barriers to entry in the investment space suggest that achieving high returns will become more challenging, necessitating a shift in investor mindset [16] - The evolving landscape is prompting a reevaluation of the role of venture capitalists, with a focus on creating social value alongside financial returns [17]
消息称百度计划8月底前发布AI推理新模型,未来几个月推文心5.0
Feng Huang Wang· 2025-08-08 06:33
Core Insights - Baidu plans to launch a new inference model by the end of August 2025 to handle more complex tasks and compete with companies like DeepSeek and OpenAI [1] - The company is set to release an updated version of its core foundational model, named Ernie 5.0, in the coming months [1] - Baidu's Ernie 4.5, released in March, is touted as the "strongest" model, featuring significant improvements in multimodal understanding, text, and logical reasoning, outperforming GPT-4.5 in several tests while offering API call prices at only 1% of GPT-4.5 [1] - The Ernie X1 model is designed to compete with DeepSeek-R1, supporting multimodal and multi-tool capabilities, with API call prices approximately half that of R1 [1]
GPT-5登场!国产大模型“扎堆上新”,DeepSeek得加速了
Hua Xia Shi Bao· 2025-08-08 05:04
Core Insights - OpenAI has officially launched its new flagship AI model, GPT-5, marking a significant step towards achieving general artificial intelligence (AGI) [2] - The release emphasizes practical applications rather than technical specifications, showcasing improvements in programming, creative writing, and health consultation capabilities [3][5] - The launch of GPT-5 has heightened expectations for competing models, particularly DeepSeek's upcoming R2 model, which has faced delays [2][8] Group 1: GPT-5 Features and Performance - GPT-5 has shown significant enhancements in three key areas: programming, creative writing, and health consultation, with capabilities such as creating responsive websites and identifying potential health issues [3][5] - OpenAI has not disclosed the model parameters, focusing instead on the model's ability to integrate into various real-world applications [3][5] - The model is available in four versions: GPT-5, GPT-5 mini, GPT-5 nano, and GPT-5 chat, with different usage limits and subscription options for consumers [5][6] Group 2: Market Impact and Competition - Following the release of GPT-5, OpenAI's dominance in the AI model market is expected to strengthen, as evidenced by ChatGPT's leading position in user traffic [7][8] - DeepSeek, despite being a previous leader, has seen a decline in user engagement and is under pressure to release its R2 model to remain competitive [8][10] - Other companies in the industry are rapidly launching new models, indicating a highly competitive landscape where DeepSeek must accelerate its development to keep pace [9][10]
全球大模型进化的下一个方向,OpenAI的GPT-5做出来了
3 6 Ke· 2025-08-08 03:57
Core Insights - OpenAI has launched GPT-5, which is described as a significant advancement over its predecessor models, providing capabilities akin to conversing with an expert in various fields [2][3] - GPT-5 consists of two models: a long-thinking version and a high-efficiency version, which can switch automatically based on user queries [3] - Performance benchmarks indicate that GPT-5 outperforms GPT-4, with hallucination rates reduced by six times [3] - The cost of inference for GPT-5 has significantly decreased, with token output reduced by 50%-80% compared to previous models [10] Company Performance - OpenAI remains the leading AI startup globally, with a valuation of $300 billion and cumulative funding exceeding $79.7 billion as of August 2023 [11] - ChatGPT has 180 million daily active users and 5 million paid enterprise users, with 20 million paid individual users as of April 2023 [11] - OpenAI is projected to achieve an annual recurring revenue (ARR) of $12 billion in 2023, representing over 80% year-on-year growth [13] Competitive Landscape - OpenAI faces increasing competition from companies like Google, Anthropic, and xAI in the U.S. market, and from Chinese companies like Alibaba and DeepSeek in the Chinese market [14] - Despite its advantages, OpenAI has received criticism for not meeting public expectations regarding performance improvements with frequent model iterations [14] - OpenAI's valuation is 4.9 times that of its closest competitor, Anthropic, which has an estimated valuation of $61.5 billion [13] Market Trends - The AI application explosion, particularly in the area of Agents, is expected to be a significant trend by 2025, with predictions indicating that 33% of enterprise software will include Agents by 2028 [18] - GPT-5's advancements in multi-modal capabilities and Agent tool usage are seen as crucial for addressing current limitations in AI applications [19] - The competition in the large model space is intensifying, with rapid iterations and updates occurring among major tech companies [21][26] Future Outlook - The release of GPT-5 is anticipated to trigger a new round of competition among tech companies to develop stronger models and acquire larger computational resources [26] - Key areas of focus for future AI development include enhancing multi-modal reasoning, video generation capabilities, and the ability to handle complex multi-step tasks [20][27] - The ongoing race in the large model sector suggests that any performance advantage is temporary, necessitating continuous innovation and adaptation [28]
当中国极客们不再仰望硅谷:本土科技偶像的时代来了 | 深网
Jin Shi Shu Ju· 2025-08-07 12:06
出品丨深网·腾讯新闻小满工作室 "春节后我注意到一个变化,"强脑科技(BrainCo)创始人韩璧丞说,"很多同事的电脑屏保,悄然从马斯克换成了DeepSeek创始人梁文锋。" DeepSeek的爆火,让梁文锋的故事刷屏。低调、谦逊和不事张扬的形象,在流量的狂欢中淬炼出独特光芒,却意外成就了这个时代最稀缺的英雄叙事: 用技术纯粹地改变世界。跟着梁文锋一起爆火的,还有杭州的"六小龙"。 位于余杭区文一西路的强脑科技参展区每日都人潮涌动——投资人、供应商、各地产业基金、政府部门纷至沓来;滨江区宇树科技大厦前同样排起长 队……无一例外,几家公司的门口都有一条温馨提示:"没有付费参访服务!谨防受骗!" 来源:深网腾讯新闻 图: 创业初期,韩璧丞在波士顿的地下室做了一个机器人,可通过意识来对它控制。 文丨薛芳编辑丨张睿 半年光阴流转,这场"科技朝圣热",还在持续。涌动的人群,他们朝圣的圣殿正在从硅谷转向本土。昔日,硅谷的光芒从惠普车库的微光燃起,直至 OpenAI 推开星际之门;而今,本土技术英雄登上了偶像神坛。 身处风暴中心的韩璧丞,把自己剥离于聚光灯之外。"我80%的时间仍在产品研发上,"他语气笃定。正是这份专注,使 ...
DeepSeek的GRPO会导致模型崩溃?看下Qwen3新范式GSPO
机器之心· 2025-08-07 09:42
Core Viewpoint - The article discusses the evolution of reinforcement learning techniques in the post-training phase of large language models (LLMs), highlighting the introduction of Group Sequence Policy Optimization (GSPO) as a solution to the instability issues associated with Group Relative Policy Optimization (GRPO) [2][10][31]. Group 1: Training Phases and Techniques - The training of large language models typically consists of two phases: pre-training and post-training, where the latter focuses on improving the model's understanding and execution of human instructions [1]. - The post-training phase employs reinforcement learning, with initial methods like Reinforcement Learning from Human Feedback (RLHF) being time-consuming and costly due to reliance on human annotators [2][3]. Group 2: Innovations and Comparisons - DeepSeek introduced an automated approach to RLHF, significantly reducing costs and improving efficiency by allowing the model to learn through reward signals rather than manual evaluations [2]. - The DeepSeek team proposed the Group Relative Policy Optimization (GRPO) algorithm, which they believe is more effective than the Proximal Policy Optimization (PPO) used by OpenAI in ChatGPT [3][5]. Group 3: Issues with GRPO - The Qwen team identified serious stability issues with GRPO, particularly due to its reliance on token-level importance sampling, which can lead to high variance and training instability [10][11][12]. - The instability arises from the incorrect application of importance sampling weights at the token level, which can accumulate high variance in long sequences, exacerbating the training challenges [15][16][17]. Group 4: Introduction of GSPO - To address the issues with GRPO, the Qwen team proposed the Group Sequence Policy Optimization (GSPO), which utilizes sequence-level importance sampling to enhance training stability [10][22][31]. - GSPO's design mitigates the accumulation of variance seen in token-level sampling, leading to improved training efficiency and stability [23][24]. Group 5: Experimental Evidence and Advantages - Experimental results demonstrated that GSPO outperformed GRPO in various tasks, showcasing better scalability and efficiency in training [20][30]. - The Qwen team highlighted that GSPO simplifies the training of Mixture-of-Experts (MoE) models by eliminating the need for auxiliary strategies like Routing Replay, which were necessary for GRPO to achieve stable convergence [25][27][30].