Scaling Laws(规模定律)

Search documents
DeepSeek开源引领AI普惠化浪潮
Wind万得· 2025-03-02 22:40
Core Insights - The article discusses the rapid advancements in AI, particularly focusing on DeepSeek's open-source strategy and the release of OpenAI's GPT-4.5 model, highlighting the competitive landscape in the AI large model sector [1][9]. Group 1: DeepSeek's Open-Source Strategy - DeepSeek, established in 2023, has released several products, including DeepSeek R1, which offers performance comparable to leading closed-source models at a significantly lower training cost of approximately $557.6 million [2][5]. - The open-source initiative by DeepSeek, including the release of code libraries like FlashMLA and DeepEP, aims to lower the development barrier for AI models and enhance computational efficiency [5][6]. - The performance of DeepSeek R1 has led to a rapid user growth of 100 million within just seven days post-launch, marking it as the fastest-growing AI application globally [7]. Group 2: Global AI Large Model Progress - The AI large model sector is experiencing significant growth, with DeepSeek's low-cost models challenging existing players like Kimi, which saw only a 28% increase in active users compared to DeepSeek's 750% growth [7]. - OpenAI's GPT-4.5, released on February 28, 2025, is touted as the largest and most knowledgeable chat model to date, with a high cost structure that raises questions about its performance relative to its price [9][10]. - The competitive landscape is shifting, with DeepSeek's open-source approach prompting other companies, including OpenAI, to consider similar strategies to maintain market relevance [13]. Group 3: AI Large Model Investment Dynamics - The emergence of low-cost, high-performance models like those from DeepSeek is reshaping investment dynamics, allowing smaller firms to enter the market and focus on innovation rather than heavy capital investment [14][15]. - The article notes a trend where investment focus is shifting from infrastructure to application scenarios, with significant funding opportunities in vertical applications such as finance and healthcare [15]. - Recent funding events in the AI large model sector indicate a growing interest, with several companies securing substantial investments, reflecting the market's evolving landscape [16][17].
对话香港大学马毅:“如果相信只靠 Scaling Laws 就能实现 AGI,你该改行了”
晚点LatePost· 2024-06-04 10:05
文丨程曼祺 编辑丨宋玮 黄俊杰 当大部分人都相信一件事或趋势时,不同意的人可以选择沉默,也可以大声说出来。前者是少数派中的多数派,后者少数派中的少数派。 马毅就是一个少数派中的少数派。 自 2000 年从伯克利大学博士毕业以来,马毅先后任职于伊利诺伊大学香槟分校(UIUC)、微软亚研院、上海科技大学、伯克利大学和香港大 学,现担任香港大学计算机系主任和数据科学研究院院长。 他最早将 "压缩感知" 技术应用于计算机视觉领域,在人脸识别、物体分类等任务上产生了巨大影响。 知名 AI 学者李飞飞是马毅在 UIUC 时参与招聘的第一个华人助理教授,ResNet 一作何恺明是马毅在微软亚研院负责视觉组时招的第一个新员 工。 少数派中的少数派。 马毅公开表达时直言不讳。AI 业界惊叹于 GPT 等大模型的威力,担心 AI 可能毁灭人类,如图灵奖得主杰弗里·辛顿(Geoffrey Hinton) 和 OpenAI 发起者之一伊隆·马斯克(Elon Musk)就多次将 AI 类比为原子弹,呼吁监管。 "说现在的 AI 危险的人,要么是无知,要么是别有目的。" 马毅在 twitter 上回应 AI 威胁论。 强烈的观点来自他对 ...