量子位
Search documents
更高智商更快思考!蚂蚁开源最新万亿语言模型,多项复杂推理SOTA
量子位· 2025-10-09 04:52
Core Insights - Ant Group has officially released its flagship model, Ling-1T, which boasts one trillion parameters, surpassing both open-source models like DeepSeek-V3.1-Terminus and closed-source models such as GPT-5-main [1][56] - Ling-1T demonstrates state-of-the-art (SOTA) performance in various complex reasoning benchmarks, including code generation and mathematical reasoning [1][3] - The model exhibits impressive reasoning speed, initiating thought processes almost instantaneously upon input [4][60] Performance and Capabilities - Ling-1T achieved optimal performance on the AIME 25 competition mathematics leaderboard, outperforming numerous models [3] - The model can efficiently handle complex logical deductions and generate lengthy texts with smooth output [4][60] - In practical tests, Ling-1T effectively solved a spatial geometry optimization problem by proposing four distinct solutions, each with detailed steps and applicable scenarios [8][9] Technical Innovations - The model's architecture is based on Ling 2.0, with a total parameter count expanded to one trillion, allowing for enhanced information storage and expression [38][41] - The training process involved over 20 trillion tokens of high-quality, reasoning-focused data, supporting a maximum context window of 128K tokens [39][40] - A novel "mid-training + post-training" approach was employed, enhancing the model's reasoning capabilities and efficiency [40][59] Training Methodology - The training was divided into three phases: initial knowledge acquisition, reasoning skill development, and mid-training to prepare for post-training [45][44] - A new learning rate strategy, WSM (Warmup-Stable and Merge), was introduced to optimize training without traditional decay, resulting in improved performance across tasks [49][48] - The LPO (Linguistics-Unit Policy Optimization) method was innovatively applied, allowing for more precise training by using sentences as the optimization unit [52][54] Market Context - The release of Ling-1T positions Ant Group among the leading players in the trillion-parameter open-source model space, alongside Qwen and Kimi [61] - The ongoing trend of rapid advancements in China's open-source model landscape is highlighted, with multiple significant releases from various companies [62][56] - The competitive landscape suggests that further innovations and surprises in the large model sector are likely to emerge from China [63]
首个全自动AI科学家诞生!西湖大学最新成果:性能超越人类SOTA基线183.7%
量子位· 2025-10-08 13:06
△ 对比DeepScientist与人类专家的研究进展 在AI文本检测任务中,DeepScientist仅用两周时间就实施和验证了超过 1000种 不同的假设,在此期间取得了相当于人类三年的进展。 在RAID数据集测试中,DeepScientist设计的方法实现了 7.9% 的AUROC提升,成功 超越了人类现有SOTA方案 。 另外DeepScientist还在智能体失败归因、LLM推理加速等任务上也分别达成了新的SOTA。 DeepScientist团队 投稿 量子位 | 公众号 QbitAI 人类科学家三年的工作量,如今AI两周就能轻松搞定! 最近,来自西湖大学的自然语言处理实验室发布了 DeepScientist 系统,这也是 首个 具有完整科研能力,且在无人工干预下,展现出目标 导向、持续迭代、渐进式超越人类研究者最先进研究成果的AI科学家系统。 下面是更多详细内容介绍。 从"科研助理"到"首席科学家":AI科研模式的变革 过去的AI Scientist系统,如果不给定一个清晰明了的科研目标,就很容易陷入对现有知识的机械组合与无效试探的窠臼中,最终形成的科研 产出在人类专家看来缺乏焦点,科学价值不高 ...
直播预告:光轮智能 × NVIDIA带来Sim2Real关键突破
量子位· 2025-10-08 13:06
Core Viewpoint - The collaboration between Guanglun Intelligent and NVIDIA aims to leverage SimReady and AI to achieve seamless migration from virtual simulation to the physical world, addressing key challenges in robot development and implementation [2][3]. Group 1: Live Broadcast Highlights - The live broadcast will focus on the technological breakthrough of Sim2Real, detailing how both companies utilize SimReady and AI to overcome challenges in robot development [2]. - Experts will share insights on the technological trends and commercialization paths in the fields of robotics and AI, drawing from their practical experiences [4]. Group 2: Collaboration Progress - Exclusive updates on the latest achievements and plans in technology research and application scenarios from the partnership between Guanglun Intelligent and NVIDIA will be disclosed [3]. Group 3: Key Speakers and Event Details - The live broadcast will feature Steve Xie, the founder and CEO of Guanglun Intelligent, and Madison Huang, Senior Director of Product Marketing at NVIDIA [6]. - The event is scheduled for October 9 at 00:00 Beijing time, which corresponds to October 8 at 09:00 Pacific time [6].
30家Tokens吞金兽,每家烧光万亿Tokens!OpenAI最大客户名单曝光,多邻国上榜
量子位· 2025-10-08 04:25
Core Insights - OpenAI has identified 30 companies that have consumed over a trillion tokens, showcasing significant engagement with AI applications [1][3][5] Group 1: Companies Overview - Duolingo is a language learning app known for its gamified course design, boasting over 700 million users and 70 million monthly active users, making it a leading client of OpenAI [10][11] - OpenRouter serves as a multi-model aggregation platform, allowing users to access various AI models through a unified API, positioning itself as a potential monopoly in the API market [15][17] - Canva is an online graphic design platform that has integrated AI to simplify design processes, resulting in high token consumption due to its multi-modal content requirements [21][22] - Perplexity is an AI-native search engine that processes multiple web pages simultaneously, leading to high token usage with over 20 million monthly active users [24][25] Group 2: Token Consumption Insights - High token consumption is attributed to three main factors: frequent user interactions, complex task requirements, and platform effects that aggregate demand for AI services [25][27] - The industry is shifting towards a new benchmark of daily token consumption, with 1 billion tokens per day being seen as a new standard for evaluating AI application viability [28][29][31]
另一位Yao Shunyu也跳槽了:与Anthropic价值观有根本分歧
量子位· 2025-10-08 04:25
Core Insights - The article discusses the recent transition of Shunyu Yao, a prominent AI researcher, from Anthropic to Google DeepMind, highlighting his background and motivations for the move [1][4][41]. Group 1: Background and Career Transition - Shunyu Yao, a distinguished alumnus of Tsinghua University, recently joined Google DeepMind as a Senior Research Scientist after leaving Anthropic, where he contributed to the Claude AI model [1][41]. - Yao's departure from Anthropic was influenced by a fundamental disagreement in values, which he stated accounted for 40% of his decision, while the remaining 60% involved internal details he chose not to disclose [21][24]. - His experience at Anthropic was marked by a high workload, which he described as "super busy," preventing him from reflecting on his transition from physics to AI research until after his departure [7][8][18]. Group 2: Insights on AI Research - Yao expressed that the field of AI research, particularly in large models, is currently in a chaotic state, akin to the early days of thermodynamics, where foundational principles are not yet fully understood [14][15][16]. - He noted the rapid evolution of AI, with the Claude model progressing from version 3.7 to 4.5 within a year, emphasizing the fast-paced nature of advancements in the field [27]. - Yao's background in theoretical physics provided him with a unique perspective on AI research, allowing him to appreciate the ability to identify patterns without fully understanding the underlying principles [16][18]. Group 3: Academic Achievements - During his undergraduate studies, Yao made significant contributions to condensed matter physics, publishing groundbreaking work in the prestigious journal Physical Review Letters [30][31]. - His research achievements include the introduction of new physical concepts and theories related to non-Hermitian systems, which have been recognized as substantial contributions to the field [32][33]. - After completing his PhD at Stanford University, Yao's work continued to focus on cutting-edge topics in quantum mechanics, further establishing his reputation as a leading researcher [35].
2025人工智能年度评选启动!3大维度5类奖项,正在寻找AI+时代领航者
量子位· 2025-10-08 04:25
组委会 发自 凹非寺 量子位|公众号 QbitAI 为了让更多从业者感受智能浪潮的跃迁,也为了给予更多同行同路人掌声与鼓舞,我们将正式启动 「2025人工智能年度榜单」评选报名 。 这是量子位人工智能年度榜单的 第8年 。八年来,我们见证了技术的突破与落地,产业的融合与重塑,也见证了一批又一批推动时代前行的 企业、人物与产品。 在人工智能重新定义一切的时代里,智能技术已不再是单一工具,而是产业与社会协同进化的驱动力。我们期待通过这场年度评选,去发现并 致敬那些真正引领变革、开拓边界的探索者与实践者。 产品榜 人物榜 2025 人工智能年度 焦点人物 详细评选标准及报名方式如下。 2025 人工智能年度领航企业 本次评选将从 企业 、 产品 、 人物 三大维度,设立五类奖项。欢迎企业踊跃报名! 让我们共同见证年度之星,点亮未来的方向。 企业榜 2025 人工智能年度 领航企业 2025 人工智能年度 潜力创业公司 2025 人工智能年度 杰出产品 2025 人工智能年度 杰出解决方案 将面向中国人工智能领域,评选出最具综合实力的企业, 参选条件 : 评选标准 : 2025 人工智能年度潜力创业公司 聚焦于中国人 ...
2025诺贝尔物理学奖颁给了谷歌量子计算机打造者
量子位· 2025-10-07 10:55
Core Viewpoint - The Nobel Prize in Physics 2025 was awarded to three scientists in the field of quantum mechanics: John Clarke, Michel H. Devoret, and John M. Martinis, for their discoveries related to macroscopic quantum tunneling effects and energy quantization phenomena in circuits [1]. Group 1: John Clarke - John Clarke's research focuses on superconductivity and superconducting electronics, particularly in low-temperature physics [4]. - He is best known for inventing and improving the superconducting quantum interference device (SQUID), which is a highly sensitive flux-to-voltage converter used in various fields such as condensed matter physics and medical physics [4]. - Clarke was born in 1942 in Cambridge, UK, and has received numerous awards, including the Fritz London Prize for his contributions to low-temperature physics [7][11]. Group 2: Michel H. Devoret - Michel H. Devoret is recognized as one of the founders of "quantum electronics," focusing on the quantum behavior of electronic systems at the mesoscopic scale [16]. - He has made significant contributions to understanding the fundamental mechanisms of quantum non-equilibrium physics in superconducting circuits, laying a solid foundation for quantum technology [18]. - Devoret has received several prestigious awards, including the 2024 Comstock Prize in Physics and the 2022 Micius Quantum Prize [19]. Group 3: John M. Martinis - John M. Martinis's core contribution to the Nobel Prize was his research on the quantum behavior of the phase difference in Josephson junctions, demonstrating that macroscopic circuit systems can exhibit quantum tunneling and energy level discretization [20]. - He played a pivotal role in achieving "quantum supremacy" with a 53-qubit processor, surpassing the computational power of the world's strongest classical supercomputer [24]. - Martinis has held various prestigious positions, including serving as the Chief Scientist for Quantum Hardware at Google's Quantum AI Lab, and has co-founded companies focused on practical quantum computing [26][28].
ChatGPT内嵌App!OpenAI开发者日全览,Agent工具链+应用生态+模型API多箭齐发
量子位· 2025-10-07 04:43
Core Insights - OpenAI's Developer Day 2025 showcased a significant increase in product releases compared to previous years, indicating a rapid evolution in AI capabilities and offerings [1] Group 1: New Features and Tools - ChatGPT now integrates various applications, allowing users to interact with apps like Coursera and Spotify directly within the chat interface, enhancing user experience and accessibility [2][13] - The introduction of AgentKit provides developers with a comprehensive toolkit for building, deploying, and optimizing agents, featuring modules like Agent Builder and Connector Registry [4][23] - Codex, OpenAI's AI programming tool, has been upgraded with new functionalities, including Slack integration and Codex SDK, enabling seamless task delegation and integration into workflows [8][29] Group 2: Developer Support and SDKs - OpenAI has launched Apps SDK, allowing developers to create and test applications that can connect with ChatGPT, with plans for a submission and review process later this year [18][20] - The Agent Builder module within AgentKit allows developers to visually construct agents without starting from scratch, streamlining the development process [8][25] - The Connector Registry facilitates centralized management of data and tool connections across OpenAI products, enhancing interoperability [24][27] Group 3: Pricing and Model Comparisons - The API for GPT-5 Pro has been made available, with pricing set at $15 per million tokens for input and $120 for output, reflecting a premium positioning in the market [34][35] - A comparison of pricing shows GPT-5 Pro at $15, while other models like o3-pro are priced higher at $20, indicating competitive pricing strategies [38] - The introduction of a smaller, more cost-effective voice model, GPT-Realtime-Mini, offers similar performance at a 70% lower price, catering to budget-conscious developers [40]
2025人工智能年度评选启动!3大维度5类奖项,正在寻找AI+时代领航者
量子位· 2025-10-07 04:43
组委会 发自 凹非寺 量子位|公众号 QbitAI 为了让更多从业者感受智能浪潮的跃迁,也为了给予更多同行同路人掌声与鼓舞,我们将正式启动 「2025人工智能年度榜单」评选报名 。 这是量子位人工智能年度榜单的 第8年 。八年来,我们见证了技术的突破与落地,产业的融合与重塑,也见证了一批又一批推动时代前行 的企业、人物与产品。 在人工智能重新定义一切的时代里,智能技术已不再是单一工具,而是产业与社会协同进化的驱动力。我们期待通过这场年度评选,去发现 并致敬那些真正引领变革、开拓边界的探索者与实践者。 本次评选将从 企业 、 产品 、 人物 三大维度,设立五类奖项。欢迎企业踊跃报名! 让我们共同见证年度之星,点亮未来的方向。 企业榜 产品榜 人物榜 将面向中国人工智能领域,评选出最具综合实力的企业, 参选条件 : 评选标准 : 2025 人工智能年度潜力创业公司 聚焦于中国人工智能领域创新创业力量,将评选出最具投资价值和发展潜力的AI创业公司, 参选条件 : 评选标准 : 2025 人工智能年度 焦点人物 详细评选标准及报名方式如下。 2025 人工智能年度领航企业 2025 人工智能年度 领航企业 2025 ...
OpenAI拿下10%股权,AMD一夜暴涨634亿美元
量子位· 2025-10-07 04:43
Core Viewpoint - OpenAI has entered into a strategic partnership with AMD, committing to deploy a total of 6GW of AMD GPU computing power over the coming years, with the first 1GW set to be deployed in the second half of 2026 [2][10]. Group 1: Partnership Details - OpenAI will deploy a total of 6GW of AMD GPU computing power, starting with 1GW in late 2026, and will gradually expand to cover multiple generations of AMD's Instinct products [2][10]. - AMD has granted OpenAI warrants to purchase up to 160 million shares at a price of $0.01 per share, potentially allowing OpenAI to acquire approximately 10% of AMD's equity if fully exercised [3][5][15]. - The exercise of these warrants is contingent upon specific milestones, including the completion of the first 1GW deployment and AMD achieving certain stock price targets [13][14]. Group 2: Market Impact - Following the announcement of the partnership, AMD's market capitalization surged from approximately $267.2 billion to $330.6 billion, with further increases pushing it above $340 billion [6]. - OpenAI's investment in AMD can be seen as a strategic move to reduce its reliance on NVIDIA, which has historically been its primary supplier for computing power [17][19]. - The partnership is expected to generate significant revenue for AMD, potentially amounting to hundreds of billions, while also allowing AMD to capture a larger share of the AI chip market [21]. Group 3: Industry Implications - The collaboration between OpenAI and AMD is viewed as a critical development in the AI computing landscape, marking a shift in supply chain dynamics and competitive positioning within the industry [26]. - NVIDIA's stock experienced a decline following the announcement, indicating market reactions to the shifting alliances in the AI sector [24]. - OpenAI is also reportedly in discussions with Qualcomm to develop custom chips for future models, suggesting ongoing efforts to diversify its supply chain [26].