Workflow
Scaling Laws
icon
Search documents
2024 到 2025,《晚点》与闫俊杰的两次访谈,记录一条纯草根 AI 创业之路
晚点LatePost· 2026-01-09 02:38
Core Insights - MiniMax aims to contribute significantly to the improvement of AI in the industry, focusing on grassroots AI entrepreneurship despite challenges ahead [3][4] - The company has set ambitious goals for 2024 and 2025, including achieving technical capabilities comparable to GPT-4 and increasing user scale tenfold [4][36] - MiniMax emphasizes the importance of creating AI products that serve ordinary people, rather than focusing solely on large clients [5][9] Group 1: Company Vision and Strategy - MiniMax's vision is to create AI that is accessible to everyone, encapsulated in the phrase "Intelligence with everyone" [5][51] - The company believes that AGI should be a product used daily by ordinary people, rather than a powerful tool for a select few [9][51] - MiniMax's approach involves a dual focus on both technology and product development from the outset, contrary to the belief that startups should prioritize one over the other [14][15] Group 2: Technical Development and Challenges - The company has adopted a mixed expert (MoE) model for its large-scale AI, which is seen as a gamble compared to the more stable dense models used by competitors [10][20] - MiniMax faced significant challenges during the development of its MoE model, including multiple failures and the need for iterative learning [11][19] - The company recognizes that improving model performance is crucial and that many advancements come from the model itself rather than product features [19][34] Group 3: Market Position and Competition - MiniMax believes that the AI industry will see multiple companies capable of producing models similar to GPT-4, indicating a competitive landscape [41][37] - The company asserts that relying solely on funding for growth is not sustainable and emphasizes the importance of serving users and generating revenue [37][38] - MiniMax aims to differentiate itself by focusing on technical innovation and product development rather than merely increasing user numbers [57] Group 4: Future Outlook and Industry Trends - The company anticipates that the AI landscape will evolve rapidly, with significant advancements in model capabilities and user engagement [41][56] - MiniMax acknowledges the importance of open-sourcing technology to accelerate innovation and improve its technical brand [54][56] - The company is committed to continuous improvement in both technology and user experience, aiming to adapt to changing market demands [28][36]
KAN作者刘子鸣:AI还没等到它的「牛顿」
机器之心· 2026-01-02 05:00
大家新年快乐!今天和大家分享 KAN 作者刘子鸣最新发布的一篇博客。 过去的一年,我们见证了 Scaling Laws 持续发力,模型能力不断刷新天花板。虽然 AI 社区从未停止对可解释性的探索,但在工程进展如此迅猛的当下,我们对模 型内部机制的理解,似乎总是慢了半拍。 刘子鸣在博客中,借用科学史提出了一个发人深省的观点:如果参照物理学的发展史, 今天的 AI 可能还远未在这个时代的「牛顿力学」时刻,而是仍处于「第谷 (Tycho)时代」, 一个拥有大量观测和实验,却尚未来得及系统性总结规律的早期阶段。 我们拥有海量的实验数据和强大的模型,却缺乏对底层现象的系统性梳理。他指出,为了追求短期性能指标,AI 领域跳过了「理解」这一关键步骤,这实际上是 在背负高昂的「认知债务」。 机器之心编辑部 更为矛盾的是,当前的学术发表机制往往偏爱「完美的故事」或「巨大的性能提升」,导致大量像「第谷的观测记录」那样碎片化但极具价值的「AI 现象学」工 作被忽视。 为此,刘子鸣呼吁建立一种「平易近人的现象学」: 不以即时应用为导向,回归到用 Toy Model(玩具模型)进行可控的、多视角的假设驱动探索。 他宣布将身 体力行,通 ...
OpenAI,65倍,8300亿美元
Ge Long Hui· 2025-12-20 11:39
SpaceX将IPO的热度还没过,OpenAI又整了个大活: 计划在新一轮融资中募集1000亿美元。 若能筹得目标金额,OpenAI的估值可将飙升至8300亿美元。 而在两天以前,这个数字是5000亿美元。 短短48小时内,涨了3300亿美元…… 这就是OpenAI。 2023年你觉得290亿美元的估值很贵,2024年你觉得1570亿美元的估值是泡沫…… 当2025年底面对8300亿美元的估值时,又能说出什么呢? 01 65倍"溢价" 传统的SaaS公司估值通常看市销率或市盈率。 根据Techloy及WSJ的报道,OpenAI在2025年的预计营收约为127亿美元。 巴克莱银行测算,这项技术落地后,GPT-6的训练效率将提升10倍,但前期需要430亿美元的算力储 备,这正是千亿融资的核心用途之一。 就像SpaceX的火箭,虽然烧钱,但一旦成功脱离地心引力,投资人是真的愿意买账。 按8300亿美元估值算,其市销率高达65倍。 作为对比,在SaaS最疯狂的2021年,Snowflake的市销率也就是50-80倍左右,现在大部分成熟的SaaS公 司已经回落到10-15倍区间。 那么,山姆奥特曼凭什么让投资人接受65 ...
AI 价值链-Google Gemini 3 Pro、Claude Opus 4.5、Grok 4.1 与 DeepSeek 3.2…… 谁才是真正的领导者?这意味着什么
2025-12-12 02:19
Summary of Key Points from the Conference Call Industry Overview - The conference call discusses the U.S. semiconductor and internet industries, focusing on the AI value chain and the competition among leading AI models: Google Gemini 3 Pro, Claude Opus 4.5, Grok 4.1, and DeepSeek 3.2 [1][2][3]. Core Insights and Arguments - **Model Performance Comparison**: - Gemini 3 Pro and Claude Opus 4.5 are viewed as closely matched, while skepticism surrounds DeepSeek's claim to leadership. All three models have published benchmarks that favor their performance, but third-party benchmarking is still ongoing [3][4][14]. - Early results indicate that Gemini and Claude are neck and neck, with Grok 4.1 outperforming GPT-5 [3][14]. - **Scaling Laws**: - The scaling laws for AI models remain intact, suggesting renewed confidence among AI labs and their investors to expand AI infrastructure. Continued access to superior compute resources and unique data is essential for scaling [4][15]. - **OpenAI's Challenges**: - OpenAI is reportedly lagging behind its competitors, facing issues such as disappointing GPT-5 performance, failed pre-training runs, and significant talent departures. This situation raises concerns about its future leadership in the AI space [6][18][19]. - **Compute Infrastructure**: - The competition between GPUs and TPUs is highlighted, with concerns about Nvidia's market position. The defining theme is compute scarcity, which benefits both GPU and ASIC technologies [7][20][22]. - **Market Dynamics**: - There is a potential shift from model benchmarking to product adoption and monetization, as evidenced by Gemini's inability to displace ChatGPT despite superior performance [8][21]. Important but Overlooked Content - **DeepSeek's Position**: - DeepSeek's ability to quickly follow leading models raises concerns about the sustainability of frontier model economics if model improvement slows down. However, current model improvements are still strong [5][17]. - **Investment Implications**: - Nvidia (NVDA) is rated as outperforming with a target price of $275, citing a significant datacenter opportunity. Broadcom (AVGO) is also rated outperforming with a target price of $400, driven by a strong AI trajectory. AMD (AMD) is rated market perform with a target price of $200, contingent on OpenAI's success [10][11][12]. - **Consumer Behavior**: - OpenAI's large user base, with 800 million monthly active users, may provide a competitive moat despite its current challenges. The sticky nature of consumer behavior in technology could offer OpenAI some breathing room [18][19]. - **Future Monitoring**: - Investors are advised to closely monitor developments in the AI space, particularly regarding OpenAI's performance and the broader implications for the semiconductor and AI infrastructure markets [19][21]. This summary encapsulates the key points discussed in the conference call, providing insights into the competitive landscape of AI models, the challenges faced by leading companies, and the implications for investors in the semiconductor and AI sectors.
对话AI“老炮”邹阳:AGI不是你该关心的,现在的技术足够改变世界
3 6 Ke· 2025-12-09 12:28
距离ChatGPT的横空出世,已经过去了三年。 三年时间,市场对于AI的热情并没有丝毫褪去,反而赌注越下越重。但在资本持续加码的喧嚣背后,盲目的兴奋感正 在被一种具体的焦虑所取代:这波AI浪潮的终点到底在哪?什么才是落地的正确姿势? 在大多数人还在仰望AGI(通用人工智能)的顶峰,或者迷失在对话框里的时候,未来式智能联合创始人兼COO邹阳 给出了一个冷静且独特的判断。 "沿着大语言模型这条路大概率不能登顶AGI。不过这不重要我也不关心。重要的是,在半山腰足够改变世界。" 在邹阳看来,半山腰的技术能力,已经具备了全部产品落地产生价值的可能性。AI真正的战场不在于做一个陪聊的聊 天机器人,而在于潜入产业流程,成为企业里那80%重复、高频、规则与判断聚合的脑力工作的"外接大脑"。 这种判断力,源自邹阳在AI行业深耕多年的体感。 邹阳算得上经历了AI 从 1.0 走向 2.0 的完整周期。从魅族 AI 实验室,到搜狗语音交互技术中心,再到阿里巴巴达摩 院,他的职业路径几乎与行业演进同步。 在达摩院期间,他负责的智能语音语义产品线连续多年在国内AI 云服务市场占据第一,属于那批最早把技术推向大规 模商用的人。 他见过人 ...
Ilya辟谣Scaling Law终结论
AI前线· 2025-11-30 05:33
Core Insights - The era of relying solely on scaling resources to achieve breakthroughs in AI capabilities may be over, as stated by Ilya Sutskever, former chief scientist of OpenAI [2] - Current AI technologies can still produce significant economic and social impacts, even without further breakthroughs [5] - The consensus among experts is that achieving Artificial General Intelligence (AGI) may require more breakthroughs, particularly in continuous learning and sample efficiency, likely within the next 20 years [5] Group 1 - Ilya Sutskever emphasized that the belief in "bigger is better" for AI development is diminishing, indicating a shift back to a research-driven era [16][42] - The current models exhibit a "jaggedness" in performance, excelling in benchmarks but struggling with real-world tasks, highlighting a gap in generalization capabilities [16][20] - The focus on scaling has led to a situation where the number of companies exceeds the number of novel ideas, suggesting a need for innovative thinking in AI research [60] Group 2 - The discussion on the importance of emotional intelligence in humans was compared to the value function in AI, suggesting that emotions play a crucial role in decision-making processes [31][39] - Sutskever pointed out that the evolution of human capabilities in areas like vision and motor skills provides a strong prior knowledge that current AI lacks [49] - The potential for rapid economic growth through the deployment of advanced AI systems was highlighted, with the caveat that regulatory mechanisms could influence this growth [82]
Nvidia says its GPUs are a 'generation ahead' of Google's AI chips
CNBC· 2025-11-25 18:29
Core Viewpoint - Nvidia asserts that its technology remains a generation ahead of the industry, despite concerns regarding competition from Google's AI chips [1][2]. Company Position - Nvidia claims its chips are more flexible and powerful than ASIC chips, such as Google's TPUs, emphasizing that its latest generation, known as Blackwell, offers greater performance, versatility, and fungibility [3]. - Nvidia holds over 90% of the market for artificial intelligence chips with its graphics processors, although Google's in-house chips have gained attention as a potential alternative [4]. Market Dynamics - Nvidia's shares fell 3% following reports that Meta, a key customer, might partner with Google to utilize its tensor processing units for data centers [2]. - Google recently launched Gemini 3, a state-of-the-art AI model trained on its TPUs, which has been well-received [5]. Industry Trends - Nvidia's CEO Jensen Huang noted that the theory of "scaling laws" in AI development, which suggests that using more chips and data leads to more powerful AI models, remains valid and will drive further demand for Nvidia's chips and systems [6].
Janus Henderson's Denny Fish on AI: We'll continue to see models ‘leapfrogging each other'
Youtube· 2025-11-25 18:23
Core Insights - The technology sector is currently in a competitive race towards achieving artificial general intelligence, with major players like Google, Meta, Anthropic, OpenAI, and Microsoft all striving to advance their models [2] - Recent earnings reports from Nvidia and the launch of Gemini 3 indicate that scaling laws in GPU and TPU demand remain strong, suggesting continued growth in infrastructure needs for AI training [2][4] - The market may experience fluctuations, but the long-term outlook for infrastructure development remains positive, with hyperscalers increasing capital expenditure expectations [6] Industry Dynamics - The competition among tech companies is expected to lead to continuous advancements in AI models, with each iteration showing significant improvements [2] - Investors should be aware that a slowdown in model advancements could signal diminishing scaling laws, which would impact infrastructure demand [4] - The construction of data centers is subject to natural limitations, such as permitting and power availability, making the current infrastructure build more prolonged compared to past tech booms [6] Market Behavior - The market's reaction to earnings reports can be unpredictable, often reflecting prior expectations rather than the actual results [5] - Despite short-term market fluctuations, the demand for GPUs and TPUs remains high, with significant interest from multiple buyers for each unit produced [6]
The Industry Reacts to Gemini 3...
Matthew Berman· 2025-11-20 02:14
Google dropped Gemini 3 24 hours ago and the industry has been reacting strongly. It is definitely the best model on the planet and I'm going to show you all of the industry reactions right now. First is from Artificial Analysis, the company that runs independent benchmarks against all of the top models. And yes, Gemini 3 is number one. Here's what they have to say. For the first time, Google has a leading language model and it debuts with a threepoint buffer between the second best model GPT 5.1%. And a lo ...
Amazon, Meta, Microsoft, and Google are gambling $320 billion on AI infrastructure. The payoff isn't there yet
Business Insider· 2025-10-07 08:20
Investment and Infrastructure - The Trump administration prioritizes infrastructure development to support the AI revolution, with significant investments expected from major tech companies [1] - Meta plans to invest $600 billion in AI infrastructure by 2028, while OpenAI and Oracle are set to invest $500 billion in a project called Stargate [1] - Amazon anticipates spending over $30 billion on capital expenditures in the next two quarters [1] Economic Impact and Concerns - The business case for AI remains untested, raising concerns about whether revenue from AI products will justify the increasing expenditures [2] - The current spending on AI infrastructure and software has contributed more to GDP growth than consumer spending [8] - There are fears of a potential bubble in the tech sector, with the Nasdaq up 19% this year despite concerns [7] Data Center Growth - An investigation revealed that there are 1,240 data centers in the US, marking a nearly fourfold increase since 2010 [3] - Major energy users like Amazon, Meta, Microsoft, and Google are projected to spend an estimated $320 billion on capital expenditures this year, primarily for AI infrastructure [4] Future Projections and Challenges - Bain estimates that by 2030, annual capital expenditures will reach $500 billion, requiring companies to generate $2 trillion in annual revenue to justify the spending [23] - OpenAI's CFO stated the company expects to triple its revenue to about $13 billion this year, while agreeing to pay Oracle $60 billion annually for data center capacity [24] Financing and Investment Strategies - Companies are increasingly turning to non-traditional financing methods to fund their data center expansions, with Meta raising $29 billion from various investment firms [33] - The structured-credit market is being utilized to finance the data center boom, with developers packaging rental income into bonds for further investment [35] Industry Comparisons and Historical Context - The current AI infrastructure boom is being compared to historical projects like the Apollo space program and the railroad system, highlighting its scale and ambition [9][10] - Past overinvestments in industries like railroads led to significant financial crises, raising concerns about the sustainability of current AI investments [15][30]