Workflow
量子位
icon
Search documents
AI的尽头,是电工(doge)
量子位· 2026-01-19 09:30
Group 1 - The core viewpoint of the article highlights the increasing demand for electricians in the AI era, with an estimated annual shortage of about 81,000 electricians in the U.S. from 2024 to 2034, leading to a projected 9% growth in employment for electricians over the next decade, significantly higher than the average for all occupations [2][3][4] - The surge in job openings is primarily driven by data centers, which are creating a substantial demand for electricians and other blue-collar workers, including plumbers and HVAC technicians [6][8] - Major tech companies are significantly increasing their hiring in the energy sector, with a 34% year-on-year increase in recruitment for 2024, maintaining high levels into 2025, and overall hiring in this sector is approximately 30% higher than before the release of ChatGPT in 2022 [9][10][11][12] Group 2 - The shortage of electricians is exacerbated by a long-standing lack of skilled labor in the construction industry, with many young people being encouraged to pursue white-collar jobs instead of trades, leading to a gap as experienced workers retire [24][28][30] - Training for electricians is becoming more rigorous, with companies preferring to hire fully trained workers rather than apprentices, which further complicates the supply-demand imbalance [34][36] - Tech companies like Google are proactively addressing the shortage by funding training programs to enhance the skills of existing electricians and train new apprentices, aiming to increase the overall workforce by about 70% by 2030 [36][37] Group 3 - The article discusses the critical energy demands of AI and data centers, emphasizing that the lack of electricity supply is becoming a more pressing issue than chip shortages, with predictions that China's electricity output will reach three times that of the U.S. by 2026 [40][50] - The future of AI development is increasingly tied to energy availability, including the need for infrastructure such as transformers and cooling systems, indicating a collective effort across the industry is necessary [48][49]
ChatGPT强行上马广告,因为OpenAI真的很烧钱
量子位· 2026-01-19 07:00
Core Viewpoint - OpenAI is facing a financial crisis, prompting the introduction of advertising in ChatGPT as a potential solution to generate revenue and avoid bankruptcy [7][15][51]. Financial Situation - OpenAI is projected to run out of funds within 18 months, with reports indicating a possible acquisition by larger companies like Microsoft or Amazon [7][15]. - The company raised a record $40 billion in funding last year, but its expenses are significantly high, with projected annual burn rates exceeding $8 billion in 2025 and reaching $40 billion by 2028 [10][13]. - OpenAI's revenue for the previous year was only $20 billion, highlighting a substantial financial gap compared to its expenditures [15]. - The AI industry is estimated to have an $800 billion funding shortfall, exacerbating OpenAI's financial challenges [15][16]. Advertising Strategy - OpenAI plans to test advertising in the free version of ChatGPT, marking a shift from a subscription-based revenue model to include advertising income [26][28]. - The ads will be labeled as "sponsored content" and will not affect the objectivity of ChatGPT's responses [27][29]. - OpenAI anticipates generating "low billions" in revenue from advertising by 2026, with plans to scale this income source over time [22][28]. Business Model Expansion - The introduction of advertising is part of OpenAI's broader commercial strategy, which includes subscription services and API usage-based billing [25][41]. - OpenAI's CFO emphasized that the business model should expand in line with the value created by its intelligence [36]. - Future revenue growth is expected to come from various sources, including subscriptions, API usage, and potential new pricing models as AI technology advances [41][42]. User Engagement and Growth - OpenAI's weekly and daily active user metrics are at all-time highs, driven by a cycle of investment in computing power, research, and product development [43][44]. - The company expects a 9.5-fold increase in computing power from 2023 to 2025, with revenue growth projected to match this increase [46][55].
哈工大系闯出人形机器人黑马:成立不到一年,全栈开源3m/s原型机,小米商汤都投了
量子位· 2026-01-19 07:00
Core Viewpoint - Roboparty has launched a fully open-source bipedal humanoid robot prototype, "Roboto_Original," aiming to revolutionize the humanoid robot development industry through collaborative innovation and shared resources [2][10]. Group 1: Open Source Initiative - The open-source release includes not only software code but also hardware schematics, EBOM material lists, supplier information, and a comprehensive knowledge base to facilitate development [5][10]. - The goal is to create a reproducible, verifiable, and modifiable open-source framework, addressing the industry's long-standing pain points of high development barriers and lack of standardization [6][9][10]. Group 2: Technical Specifications - The "Roboto_Original" prototype has a running speed of up to 3 m/s, positioning it among the leading open-source humanoid robots globally [4][24]. - The robot's hardware features a height of 1.2m and a weight of 30kg, with detailed design documents available to lower the barriers for hardware development and replication [12][14]. Group 3: Software and Control - The project has released full control code covering core modules for imitation, perception, and navigation, allowing developers to leverage extensive motion capture data [16]. - The AMP control algorithm enhances the robot's walking and running capabilities, ensuring natural movement and stability, which is crucial for real-world applications [26][27]. Group 4: Engineering and Collaboration - Roboparty has established a knowledge base for hands-on learning in humanoid robotics, focusing on practical issues like walking stability and production costs [21][36]. - The initiative aims to shift the industry from isolated trial-and-error approaches to collaborative breakthroughs, fostering a community-driven development environment [22][30]. Group 5: Industry Impact and Funding - The project has secured millions in seed funding from notable investors, indicating strong market interest and validation of its technological approach [29]. - Roboparty aims to reduce development costs by 80%, making humanoid robotics more accessible and scalable across various industries [32][31].
45年数论猜想被GPT-5.2 Pro独立完成证明,陶哲轩:没犯任何错误
量子位· 2026-01-19 07:00
Core Viewpoint - The article discusses the significant achievement of OpenAI's GPT-5.2 Pro in independently proving a mathematical conjecture known as the Erdős problem, specifically the 281st problem from the Erdős problem collection, which had remained unsolved for 45 years [2][4][5]. Group 1: Proof and Validation - The proof was verified by Fields Medalist Terence Tao, who described it as "the clearest first-class result contributed by AI to date" [3]. - The proof utilized concepts from ergodic theory and combinatorial mathematics, specifically leveraging the Birkhoff theorem and avoiding common pitfalls such as limit exchanges and quantifier order errors [9][15][12]. - Tao translated the proof into combinatorial language, confirming its validity and establishing that the proof is indeed correct [16][17]. Group 2: Alternative Solutions - An unexpected discovery was made by a user named KoishiChan, who pointed out that a simpler solution to the problem exists, utilizing two theorems established in 1936 and 1966 [18]. - The first theorem is the density convergence theorem co-proven by Harold Davenport and Paul Erdős in 1936, and the second is Rogers' theorem from a 1966 publication [19]. - This raises questions about why Erdős himself did not recognize the proximity of the solution when he proposed the problem in 1980 [20]. Group 3: AI's Success Rate and Future Implications - Following the announcement, various AI models were tested for their ability to validate the proof, with Gemini 3 Pro confirming its correctness [24]. - However, Tao cautioned that the true success rate of AI tools in solving such problems is likely skewed due to reporting biases, with only about 1% to 2% of attempts yielding positive results [30]. - Despite this low success rate, the existence of over 600 unsolved problems in the Erdős collection suggests that AI contributions could still be significant [31].
真·开外挂!MIT新研究:架构0改动,让大模型解锁千万级上下文
量子位· 2026-01-19 03:48
Core Insights - The article discusses a new method called Recursive Language Model (RLM) developed by MIT CSAIL for processing long texts, addressing the issue of context decay in large models [1][5][11] - RLM allows top models like GPT-5 and Qwen-3 to handle super long texts with millions of tokens without modifying their architecture [2][23] Summary by Sections Context Decay Issue - Large models struggle with context decay, where the performance declines as the text length increases, leading to a loss of memory for earlier information [5][6] - Current mainstream solutions include context compression, retrieval-augmented generation (RAG), and architectural optimizations [7][10] RLM Methodology - RLM outsources context processing to an interactive Python environment, enabling models to programmatically break down tasks and process them as needed [4][13][15] - The model initiates a Python REPL environment, storing long prompts as string variables and performing operations like keyword filtering and logical decomposition [14] Performance Metrics - RLM has demonstrated the ability to effectively handle over 10 million tokens, significantly surpassing the native context window of models like GPT-5 [16] - In complex long text tasks, RLM showed substantial improvements, achieving F1 scores of 58.00% and 23.11% for GPT-5 and Qwen-3, respectively, in the OOLONG-Pairs task [16] - For the BrowseComp-Plus multi-document reasoning task, RLM (GPT-5) achieved a correct rate of 91.33%, outperforming other long text processing methods [16] Cost Efficiency - RLM's cost at the 50th percentile is competitive with other long text processing solutions, indicating a favorable cost-performance ratio in most regular task scenarios [19] - However, at the 95th percentile, RLM's costs can spike due to its dynamic reasoning process, which increases API call frequency based on task complexity [20][21]
零样本&少样本横扫12个工业医疗数据集:西门子×腾讯优图新研究精准定位缺陷,检测精度新SOTA丨AAAI 2026
量子位· 2026-01-19 03:48
Core Insights - The article discusses the development of AdaptCLIP, a universal visual anomaly detection framework that aims to improve performance in industrial quality inspection and medical imaging by leveraging the capabilities of the CLIP model while addressing its limitations in zero-shot and few-shot scenarios [2][4]. Group 1: Challenges in Anomaly Detection - Traditional models for defect detection require extensive labeled data, making them less effective in real-world scenarios where data is scarce [1][3]. - The core challenge in anomaly detection is the need for models to generalize across domains while accurately identifying subtle anomalies with minimal target domain data [3][4]. Group 2: AdaptCLIP Framework - AdaptCLIP introduces a lightweight adaptation approach by adding three adapters to the CLIP model without altering its core structure, enabling it to perform both image-level anomaly classification and pixel-level anomaly segmentation [5][6]. - The framework employs an alternating learning strategy, optimizing visual and textual representations separately to enhance performance in zero-shot anomaly detection [20][21]. Group 3: Key Innovations - The visual adapter fine-tunes CLIP's output tokens to better align with the anomaly detection task, significantly improving pixel-level localization capabilities [15][18]. - The text adapter eliminates the need for manually designed prompts by learning optimized embeddings for "normal" and "anomalous" classes, thus reducing dependency on prompt engineering [16][18]. Group 4: Experimental Results - AdaptCLIP achieved an average image-level AUROC of 86.2% across multiple industrial datasets in zero-shot scenarios, outperforming existing methods [31]. - In medical imaging tasks, AdaptCLIP demonstrated an average pixel-level AUPR of 48.7% and an average image-level AUROC of 90.7%, indicating superior performance compared to other approaches [31][32]. Group 5: Efficiency and Scalability - The model introduces approximately 0.6 million additional trainable parameters under zero-shot conditions, significantly lower than competing methods that can exceed 10.7 million parameters [32][37]. - AdaptCLIP maintains a reasonable inference time of about 162 ms per image at a resolution of 518x518, balancing detection accuracy with deployment efficiency [32][37].
李飞飞的World Labs联手光轮智能,具身智能进入评测驱动时代!
量子位· 2026-01-19 03:48
Core Viewpoint - The collaboration between World Labs, led by Fei-Fei Li, and Guanglun Intelligent, a leading synthetic data company, aims to address the long-standing issue of "scalable evaluation" in the field of embodied intelligence, marking the entry into an evaluation-driven era for this technology [1][2][3]. Group 1: Companies Involved - World Labs is founded by Fei-Fei Li, a prominent figure in AI, known for her work on ImageNet and as a former chief AI scientist at Google Cloud [4][5]. - Guanglun Intelligent is recognized as a hot company in the embodied intelligence infrastructure sector, having established a strong partnership with NVIDIA and contributing to the development of simulation systems [54][55]. Group 2: Technological Innovations - World Labs is set to launch its first product, Marble, by the end of 2025, which can generate high-fidelity 3D worlds from minimal input [8][9]. - Marble aims to provide a visualized world model, allowing users to create and export 3D environments efficiently, thus serving as a productivity tool for visual effects and game developers [15][16]. Group 3: Challenges in Evaluation - The rapid advancement of models in embodied intelligence has outpaced existing benchmarks, creating a need for new evaluation methods [20][22]. - Traditional evaluation methods are inadequate for assessing the capabilities of embodied intelligence, necessitating the use of simulation as a scalable solution [29][30]. Group 4: Strategic Collaboration - The partnership between World Labs and Guanglun Intelligent is crucial for developing a comprehensive evaluation framework that integrates environment generation and physical interaction [37][49]. - Guanglun Intelligent's role is to provide the necessary physical assets and evaluation loops, ensuring that the simulated environments can support real physical interactions [49][50]. Group 5: Future Directions - The collaboration signifies a pivotal moment in the embodied intelligence sector, as it transitions into an evaluation-driven era, with the potential to shape research directions and identify technological bottlenecks [71][72][76]. - The establishment of robust evaluation standards, such as RoboFinals, highlights the industry's shift towards scalable and credible assessment frameworks for advanced robotic models [63][64].
量子位编辑作者招聘
量子位· 2026-01-19 03:48
编辑部 发自 凹非寺 量子位 | 公众号 QbitAI AI热潮还在汹涌,但如果你还不知道如何参与……那为什么不来 量子位 呢? 我们是一家以 追踪AI新进展 为核心的内容平台,经过8年积累,目前拥有顶流影响力,广泛且备受认可的产业资源,以及时代风口的最佳观 测和学习生态位。 目前,我们有 三大方向 岗位招聘,希望你是 (或者能成为) 这三个方向的内容专家: 岗位均为全职,工作地点:北京中关村。 岗位面向: 加入我们,你可以获得: 以下是岗位详情: 所有岗位不同能力层级职位均在开放,欢迎结合个人履历和经验申请。 AI产业方向 岗位职责: AI产业方向 :关注基建层创新,包含芯片、AI Infra、云计算; AI财经方向 :关注AI领域创投和财报,跟踪产业链资本动向; AI产品方向 :关注AI在应用和硬件终端方向的进展。 AI财经商业方向 岗位职责: 任职要求: AI产品方向 岗位职责: 社招:覆盖编辑、主笔、主编各个层级,按能力匹配岗位; 校招:应届毕业生,接受实习且可转正。 站在AI浪潮之巅 :第一时间接触和了解AI领域最新技术和产品,构建完整的AI认知体系。 玩转AI新工具 :将各种AI新技术、新工具应用 ...
全球首个负载100斤的真实持续干活机器人,来自银河通用
量子位· 2026-01-19 01:00
Core Viewpoint - The Galbot S1, a fully autonomous heavy-duty embodied intelligent robot, has been deployed in the production line of CATL, marking a significant advancement in the industrial application of embodied intelligence [1][8][56] Group 1: Product Features and Capabilities - Galbot S1 has a maximum continuous operational load capacity of 50 kilograms, breaking industry limits and addressing the heavy load requirements in industrial settings [2][9] - The robot features a pioneering fully autonomous, zero-remote operation "embodied handling model," which utilizes pure visual perception to operate in complex factory environments without relying on static paths or pre-set routes [5][17] - The design of Galbot S1 aligns with industrial standards, featuring IP54 protection against dust and water, and a maximum working height of 2.3 meters, making it suitable for various industrial environments [15] Group 2: Industry Impact and Significance - The deployment of Galbot S1 in leading manufacturing companies like CATL signifies a milestone in integrating embodied intelligence into core production processes, moving beyond auxiliary roles [8][26][27] - This development reflects a shift from merely demonstrating technological capabilities to establishing a replicable and scalable productivity solution within the manufacturing sector [46][51] - The successful operation of Galbot S1 in real production lines validates its reliability and long-term value, indicating that embodied intelligence is now a critical component of industrial upgrades [28][29] Group 3: Market and Future Prospects - The recent completion of a 2.1 billion yuan financing round and a valuation exceeding 20 billion yuan positions the company as a leader in the embodied intelligence sector in China, showcasing confidence in the long-term prospects of intelligent manufacturing [53] - The ongoing collaboration with major manufacturers and the accumulation of operational data and engineering experience are paving the way for large-scale applications of embodied intelligence [34][51] - The emergence of Galbot S1 represents a crucial step in the evolution of embodied intelligence, as it begins to play a key role in enhancing productivity within the industrial core [56][57]
马斯克最大算力中心建成了:全球首个GW级超算集群,再创世界纪录
量子位· 2026-01-18 05:29
Core Viewpoint - The launch of Colossus 2, the world's first 1GW supercomputing cluster, marks a significant advancement in AI infrastructure, with plans to upgrade to 1.5GW by April and potentially reach 2GW, which could match the power consumption of major U.S. cities [2][12]. Group 1: Colossus 2 Overview - Colossus 2 is equipped with approximately 200,000 NVIDIA H100/H200 GPUs and around 30,000 NVIDIA GB200 NVL72 GPUs, significantly enhancing its computational power compared to its predecessor, Colossus 1, which was built in just 122 days [9][10]. - The cluster's 1GW capacity can power about 750,000 households, equivalent to the peak power demand of San Francisco [11]. - Once fully operational, Colossus 2 will house 555,000 GPUs, surpassing the GPU counts of Meta, Microsoft, and Google [13][14]. Group 2: Implications for AI Development - The advancements in Colossus 2 are expected to facilitate the development of Grok 5, which is projected to have parameters around 6 trillion, more than double that of Grok 4 [15][18]. - With the recent $20 billion funding round for xAI, the scaling capabilities for Grok 5 are increasing, leading to larger model parameters and faster training and deployment speeds [18][19]. - The rapid development of AI models is seen as a competitive advantage in the industry, emphasizing that speed is a crucial factor in the AI era [20]. Group 3: Energy Supply Concerns - The construction of large data centers like Colossus 2 is contributing to a projected annual electricity demand growth of 4.8% over the next decade, which is unprecedented for the U.S. energy system [27]. - The imbalance between rapidly increasing demand and slow supply growth is causing concerns about the stability of the power grid, leading to potential rolling blackouts for 67 million residents in 13 states during extreme weather [5][22][23]. - PJM, the regional transmission organization, is struggling to maintain supply-demand balance and has proposed measures to reduce peak demand from data centers, which have faced opposition from major tech companies [32][34].