量子位
Search documents
算力越高收入越多!OpenAI率先验证AI商业Scaling Law:最新收入200亿美元
量子位· 2026-01-20 01:34
Core Viewpoint - OpenAI's revenue has significantly increased, with annual recurring revenue (ARR) rising from $2 billion to $20 billion over two years, indicating a strong growth trajectory despite high operational costs [2][12]. Revenue and Growth - OpenAI's ARR has surged to $20 billion, reflecting a tenfold increase in revenue projected from 2023 to 2025, alongside a 9.5-fold increase in computing power [2][13]. - The relationship between computing power and revenue is emphasized, where increased investment in computing drives research and model capabilities, leading to higher revenue, which in turn supports further investment [9][12]. Comparison with Competitors - In comparison to a competitor (Claude's parent company), OpenAI's computing power and ARR are significantly larger, with projections showing a growth from 0.2 GW and $2 billion in 2023 to 1.9 GW and over $20 billion by 2025 [14][17]. Operational Costs - OpenAI's operational costs are substantial, with an estimated $7 billion spent on computing resources in 2024, primarily through cloud services from Microsoft [21][22]. - The company is also investing heavily in building its own AI data centers, indicating a long-term strategy to manage costs and enhance capabilities [18][19]. Business Model and Future Plans - OpenAI's business model is evolving, with the introduction of advertising aimed at providing decision support in commercial scenarios, alongside subscription services and API usage [27][30]. - The company plans to launch its first hardware product in the second half of 2026, which is expected to further integrate into its revenue-computing cycle [33][34].
定位大模型「作弊」神经回路!新研究首次揭示:虚假奖励如何精准激活第18-20层记忆
量子位· 2026-01-20 01:34
Core Insights - The article discusses the phenomenon of "Spurious Rewards" in large language models (LLMs) and how they can enhance accuracy even with false reward signals during training [1][2] - It highlights the concept of "Perplexity Paradox," where models show decreased perplexity for answers but increased perplexity for questions, indicating a trade-off between general understanding and specific memorization [3][6] Group 1: Key Findings - The research team identified that the model's internal memory shortcuts are activated by false RLVR, leading to a more efficient retrieval of contaminated knowledge rather than genuine learning [1][6] - The critical memory nodes are located in layers 18-20, which serve as functional anchors for retrieving memorized answers [10][20] - The study utilized various analytical methods, including Path Patching and Jensen-Shannon Divergence (JSD), to pinpoint the layers responsible for memory retrieval and structural adaptation [9][15] Group 2: Mechanisms and Dynamics - The research demonstrated that the model's decision-making process occurs at layers 18-20, where it chooses between reasoning paths and memory shortcuts [23] - The introduction of Neural ODEs allowed the team to model the continuous evolution of hidden states, confirming that separation forces peak at the critical layers [21] - The team successfully manipulated memory retrieval by scaling the activation of specific neurons, demonstrating a dose-dependent relationship in memory retrieval accuracy [25][30] Group 3: Implications and Future Directions - The findings provide new tools for evaluating RLVR effectiveness, suggesting that improvements may be illusory if they stem from memory activation circuits [36] - The research opens new avenues for detecting data contamination through internal neural activation patterns, moving beyond traditional statistical methods [38] - It proposes controllable methods for reducing reliance on contaminated knowledge without retraining the model, paving the way for new techniques in reasoning and decontamination [39]
ChatGPT强行上马广告,因为OpenAI真的太烧钱
量子位· 2026-01-19 09:30
Core Viewpoint - OpenAI is facing a financial crisis, prompting the introduction of advertising in ChatGPT as a potential solution to generate revenue and avoid bankruptcy [7][15][51]. Financial Situation - OpenAI is projected to run out of funds within 18 months, with reports indicating a possible acquisition by larger companies like Microsoft or Amazon [7][15]. - The company raised $40 billion in funding last year, but its expenses are significantly higher, with projected annual burn rates exceeding $8 billion in 2025 and reaching $40 billion by 2028 [10][13]. - OpenAI's revenue for the previous year was only $20 billion, highlighting a substantial financial gap compared to its expenditures [15]. - The AI industry is estimated to have a funding shortfall of at least $800 billion, exacerbating OpenAI's financial challenges [15][16]. Advertising Strategy - OpenAI plans to test advertising in the free version of ChatGPT, marking a shift from a subscription-based revenue model to include advertising income [26][28]. - The ads will be labeled as "sponsored content" and will not affect the objectivity of ChatGPT's responses [27][29]. - OpenAI anticipates generating "low billions" in revenue from advertising by 2026, with plans to scale this income source over time [22][28]. Business Model Expansion - The introduction of advertising is part of OpenAI's broader commercial strategy, which includes subscription services and API usage-based billing [25][41]. - OpenAI's CFO emphasized that the business model should expand in line with the value created by its intelligence [36]. - Future revenue growth is expected to come from various sources, including subscriptions, API usage, and potential new pricing models as AI technology evolves [41]. User Engagement and Growth - OpenAI's weekly and daily active user metrics are at all-time highs, driven by a cycle of investment in computing power, research, and product development [43][44]. - The company is experiencing a 9.5 times increase in computing power from 2023 to 2025, with revenue growth projected to match this increase [46][55].
AI的尽头,是电工(doge)
量子位· 2026-01-19 09:30
Group 1 - The core viewpoint of the article highlights the increasing demand for electricians in the AI era, with an estimated annual shortage of about 81,000 electricians in the U.S. from 2024 to 2034, leading to a projected 9% growth in employment for electricians over the next decade, significantly higher than the average for all occupations [2][3][4] - The surge in job openings is primarily driven by data centers, which are creating a substantial demand for electricians and other blue-collar workers, including plumbers and HVAC technicians [6][8] - Major tech companies are significantly increasing their hiring in the energy sector, with a 34% year-on-year increase in recruitment for 2024, maintaining high levels into 2025, and overall hiring in this sector is approximately 30% higher than before the release of ChatGPT in 2022 [9][10][11][12] Group 2 - The shortage of electricians is exacerbated by a long-standing lack of skilled labor in the construction industry, with many young people being encouraged to pursue white-collar jobs instead of trades, leading to a gap as experienced workers retire [24][28][30] - Training for electricians is becoming more rigorous, with companies preferring to hire fully trained workers rather than apprentices, which further complicates the supply-demand imbalance [34][36] - Tech companies like Google are proactively addressing the shortage by funding training programs to enhance the skills of existing electricians and train new apprentices, aiming to increase the overall workforce by about 70% by 2030 [36][37] Group 3 - The article discusses the critical energy demands of AI and data centers, emphasizing that the lack of electricity supply is becoming a more pressing issue than chip shortages, with predictions that China's electricity output will reach three times that of the U.S. by 2026 [40][50] - The future of AI development is increasingly tied to energy availability, including the need for infrastructure such as transformers and cooling systems, indicating a collective effort across the industry is necessary [48][49]
ChatGPT强行上马广告,因为OpenAI真的很烧钱
量子位· 2026-01-19 07:00
Core Viewpoint - OpenAI is facing a financial crisis, prompting the introduction of advertising in ChatGPT as a potential solution to generate revenue and avoid bankruptcy [7][15][51]. Financial Situation - OpenAI is projected to run out of funds within 18 months, with reports indicating a possible acquisition by larger companies like Microsoft or Amazon [7][15]. - The company raised a record $40 billion in funding last year, but its expenses are significantly high, with projected annual burn rates exceeding $8 billion in 2025 and reaching $40 billion by 2028 [10][13]. - OpenAI's revenue for the previous year was only $20 billion, highlighting a substantial financial gap compared to its expenditures [15]. - The AI industry is estimated to have an $800 billion funding shortfall, exacerbating OpenAI's financial challenges [15][16]. Advertising Strategy - OpenAI plans to test advertising in the free version of ChatGPT, marking a shift from a subscription-based revenue model to include advertising income [26][28]. - The ads will be labeled as "sponsored content" and will not affect the objectivity of ChatGPT's responses [27][29]. - OpenAI anticipates generating "low billions" in revenue from advertising by 2026, with plans to scale this income source over time [22][28]. Business Model Expansion - The introduction of advertising is part of OpenAI's broader commercial strategy, which includes subscription services and API usage-based billing [25][41]. - OpenAI's CFO emphasized that the business model should expand in line with the value created by its intelligence [36]. - Future revenue growth is expected to come from various sources, including subscriptions, API usage, and potential new pricing models as AI technology advances [41][42]. User Engagement and Growth - OpenAI's weekly and daily active user metrics are at all-time highs, driven by a cycle of investment in computing power, research, and product development [43][44]. - The company expects a 9.5-fold increase in computing power from 2023 to 2025, with revenue growth projected to match this increase [46][55].
哈工大系闯出人形机器人黑马:成立不到一年,全栈开源3m/s原型机,小米商汤都投了
量子位· 2026-01-19 07:00
Core Viewpoint - Roboparty has launched a fully open-source bipedal humanoid robot prototype, "Roboto_Original," aiming to revolutionize the humanoid robot development industry through collaborative innovation and shared resources [2][10]. Group 1: Open Source Initiative - The open-source release includes not only software code but also hardware schematics, EBOM material lists, supplier information, and a comprehensive knowledge base to facilitate development [5][10]. - The goal is to create a reproducible, verifiable, and modifiable open-source framework, addressing the industry's long-standing pain points of high development barriers and lack of standardization [6][9][10]. Group 2: Technical Specifications - The "Roboto_Original" prototype has a running speed of up to 3 m/s, positioning it among the leading open-source humanoid robots globally [4][24]. - The robot's hardware features a height of 1.2m and a weight of 30kg, with detailed design documents available to lower the barriers for hardware development and replication [12][14]. Group 3: Software and Control - The project has released full control code covering core modules for imitation, perception, and navigation, allowing developers to leverage extensive motion capture data [16]. - The AMP control algorithm enhances the robot's walking and running capabilities, ensuring natural movement and stability, which is crucial for real-world applications [26][27]. Group 4: Engineering and Collaboration - Roboparty has established a knowledge base for hands-on learning in humanoid robotics, focusing on practical issues like walking stability and production costs [21][36]. - The initiative aims to shift the industry from isolated trial-and-error approaches to collaborative breakthroughs, fostering a community-driven development environment [22][30]. Group 5: Industry Impact and Funding - The project has secured millions in seed funding from notable investors, indicating strong market interest and validation of its technological approach [29]. - Roboparty aims to reduce development costs by 80%, making humanoid robotics more accessible and scalable across various industries [32][31].
45年数论猜想被GPT-5.2 Pro独立完成证明,陶哲轩:没犯任何错误
量子位· 2026-01-19 07:00
Core Viewpoint - The article discusses the significant achievement of OpenAI's GPT-5.2 Pro in independently proving a mathematical conjecture known as the Erdős problem, specifically the 281st problem from the Erdős problem collection, which had remained unsolved for 45 years [2][4][5]. Group 1: Proof and Validation - The proof was verified by Fields Medalist Terence Tao, who described it as "the clearest first-class result contributed by AI to date" [3]. - The proof utilized concepts from ergodic theory and combinatorial mathematics, specifically leveraging the Birkhoff theorem and avoiding common pitfalls such as limit exchanges and quantifier order errors [9][15][12]. - Tao translated the proof into combinatorial language, confirming its validity and establishing that the proof is indeed correct [16][17]. Group 2: Alternative Solutions - An unexpected discovery was made by a user named KoishiChan, who pointed out that a simpler solution to the problem exists, utilizing two theorems established in 1936 and 1966 [18]. - The first theorem is the density convergence theorem co-proven by Harold Davenport and Paul Erdős in 1936, and the second is Rogers' theorem from a 1966 publication [19]. - This raises questions about why Erdős himself did not recognize the proximity of the solution when he proposed the problem in 1980 [20]. Group 3: AI's Success Rate and Future Implications - Following the announcement, various AI models were tested for their ability to validate the proof, with Gemini 3 Pro confirming its correctness [24]. - However, Tao cautioned that the true success rate of AI tools in solving such problems is likely skewed due to reporting biases, with only about 1% to 2% of attempts yielding positive results [30]. - Despite this low success rate, the existence of over 600 unsolved problems in the Erdős collection suggests that AI contributions could still be significant [31].
真·开外挂!MIT新研究:架构0改动,让大模型解锁千万级上下文
量子位· 2026-01-19 03:48
Core Insights - The article discusses a new method called Recursive Language Model (RLM) developed by MIT CSAIL for processing long texts, addressing the issue of context decay in large models [1][5][11] - RLM allows top models like GPT-5 and Qwen-3 to handle super long texts with millions of tokens without modifying their architecture [2][23] Summary by Sections Context Decay Issue - Large models struggle with context decay, where the performance declines as the text length increases, leading to a loss of memory for earlier information [5][6] - Current mainstream solutions include context compression, retrieval-augmented generation (RAG), and architectural optimizations [7][10] RLM Methodology - RLM outsources context processing to an interactive Python environment, enabling models to programmatically break down tasks and process them as needed [4][13][15] - The model initiates a Python REPL environment, storing long prompts as string variables and performing operations like keyword filtering and logical decomposition [14] Performance Metrics - RLM has demonstrated the ability to effectively handle over 10 million tokens, significantly surpassing the native context window of models like GPT-5 [16] - In complex long text tasks, RLM showed substantial improvements, achieving F1 scores of 58.00% and 23.11% for GPT-5 and Qwen-3, respectively, in the OOLONG-Pairs task [16] - For the BrowseComp-Plus multi-document reasoning task, RLM (GPT-5) achieved a correct rate of 91.33%, outperforming other long text processing methods [16] Cost Efficiency - RLM's cost at the 50th percentile is competitive with other long text processing solutions, indicating a favorable cost-performance ratio in most regular task scenarios [19] - However, at the 95th percentile, RLM's costs can spike due to its dynamic reasoning process, which increases API call frequency based on task complexity [20][21]
零样本&少样本横扫12个工业医疗数据集:西门子×腾讯优图新研究精准定位缺陷,检测精度新SOTA丨AAAI 2026
量子位· 2026-01-19 03:48
Core Insights - The article discusses the development of AdaptCLIP, a universal visual anomaly detection framework that aims to improve performance in industrial quality inspection and medical imaging by leveraging the capabilities of the CLIP model while addressing its limitations in zero-shot and few-shot scenarios [2][4]. Group 1: Challenges in Anomaly Detection - Traditional models for defect detection require extensive labeled data, making them less effective in real-world scenarios where data is scarce [1][3]. - The core challenge in anomaly detection is the need for models to generalize across domains while accurately identifying subtle anomalies with minimal target domain data [3][4]. Group 2: AdaptCLIP Framework - AdaptCLIP introduces a lightweight adaptation approach by adding three adapters to the CLIP model without altering its core structure, enabling it to perform both image-level anomaly classification and pixel-level anomaly segmentation [5][6]. - The framework employs an alternating learning strategy, optimizing visual and textual representations separately to enhance performance in zero-shot anomaly detection [20][21]. Group 3: Key Innovations - The visual adapter fine-tunes CLIP's output tokens to better align with the anomaly detection task, significantly improving pixel-level localization capabilities [15][18]. - The text adapter eliminates the need for manually designed prompts by learning optimized embeddings for "normal" and "anomalous" classes, thus reducing dependency on prompt engineering [16][18]. Group 4: Experimental Results - AdaptCLIP achieved an average image-level AUROC of 86.2% across multiple industrial datasets in zero-shot scenarios, outperforming existing methods [31]. - In medical imaging tasks, AdaptCLIP demonstrated an average pixel-level AUPR of 48.7% and an average image-level AUROC of 90.7%, indicating superior performance compared to other approaches [31][32]. Group 5: Efficiency and Scalability - The model introduces approximately 0.6 million additional trainable parameters under zero-shot conditions, significantly lower than competing methods that can exceed 10.7 million parameters [32][37]. - AdaptCLIP maintains a reasonable inference time of about 162 ms per image at a resolution of 518x518, balancing detection accuracy with deployment efficiency [32][37].
量子位编辑作者招聘
量子位· 2026-01-19 03:48
编辑部 发自 凹非寺 量子位 | 公众号 QbitAI AI热潮还在汹涌,但如果你还不知道如何参与……那为什么不来 量子位 呢? 我们是一家以 追踪AI新进展 为核心的内容平台,经过8年积累,目前拥有顶流影响力,广泛且备受认可的产业资源,以及时代风口的最佳观 测和学习生态位。 目前,我们有 三大方向 岗位招聘,希望你是 (或者能成为) 这三个方向的内容专家: 岗位均为全职,工作地点:北京中关村。 岗位面向: 加入我们,你可以获得: 以下是岗位详情: 所有岗位不同能力层级职位均在开放,欢迎结合个人履历和经验申请。 AI产业方向 岗位职责: AI产业方向 :关注基建层创新,包含芯片、AI Infra、云计算; AI财经方向 :关注AI领域创投和财报,跟踪产业链资本动向; AI产品方向 :关注AI在应用和硬件终端方向的进展。 AI财经商业方向 岗位职责: 任职要求: AI产品方向 岗位职责: 社招:覆盖编辑、主笔、主编各个层级,按能力匹配岗位; 校招:应届毕业生,接受实习且可转正。 站在AI浪潮之巅 :第一时间接触和了解AI领域最新技术和产品,构建完整的AI认知体系。 玩转AI新工具 :将各种AI新技术、新工具应用 ...