量子位
Search documents
字节Seed:大概念模型来了,推理的何必是下一个token
量子位· 2026-01-04 11:00
henry 发自 凹非寺 量子位 | 公众号 QbitAI LLM的下一个推理单位,何必是Token? 刚刚,字节Seed团队发布最新研究—— DLCM(Dynamic Large Concept Models) 将大模型的推理单位从token(词) 动态且自适应地推到了concept(概念)层级。 DLCM通过 端到端地方式学习语义边界,动态地将Token序列分割成概念,在压缩后的概念空间中进行深度推理,并借助因果交叉注意力将 概念级推理结果重构为Token级预测 。 由此,传统LLM中基于均匀、冗余Token信息密度的计算分配,被转化为面向概念的动态推理与自适应算力分配。 在以推理为主的基准任务上,DLCM在将推理阶段FLOPs降低 34% 的同时,还将平均准确率提升了 2.69% 。 这也意味着,大模型的推理效率并不必然依赖更密集的Token级计算,而可以通过更高层级的语义组织来获得。 接下来,我们具体来看。 分层的下一token预测框架 如上所说,DLCM的核心在于学习动态的Token-概念映射,实现了计算资源的自适应分配。 接下来,在 动态分割 阶段,模型基于Token级表示,计算相邻Token之间 ...
MIT新论文:2026推理模型过时了,“套娃模型”当立
量子位· 2026-01-04 09:06
Core Viewpoint - The article discusses the emergence of a new paradigm in language models called the "Recursive Language Model" (RLM), which significantly improves the handling of long texts and reduces costs compared to traditional models like GPT-5 [3][5][23]. Group 1: RLM Overview - The RLM introduces a novel approach by storing text in a code environment and allowing the model to write programs that recursively call itself to process the text [5][9]. - This method decouples the length of input data from the model's context window size, enabling the processing of text limited only by physical memory rather than the constraints of the Transformer architecture [10][12]. Group 2: Performance Metrics - RLM has demonstrated the ability to effectively handle up to 10 million tokens, surpassing the context window of leading models like GPT-5 by two orders of magnitude [23]. - In various benchmark tests, RLM outperformed traditional models in complex tasks, achieving F1 scores of 58.00% and 23.11% in OOLONG and OOLONG-Pairs tests, respectively, while traditional models scored below 0.1% [27]. Group 3: Cost Efficiency - RLM's approach allows for selective reading of relevant text segments, leading to a significant reduction in operational costs. For instance, the average cost for RLM in the BrowseComp-Plus benchmark was only $0.99, compared to $1.50 to $2.75 for GPT-5 [29][31]. - This cost efficiency indicates that RLM can maintain performance while controlling inference costs, making it a viable option for large-scale applications involving long texts [32].
OpenAI首款硬件定型为笔!网友:就叫oPen吧
量子位· 2026-01-04 07:25
Core Viewpoint - OpenAI's first AI hardware product is an "AI pen," which exceeds expectations and aims to enhance user interaction with AI technology [1][6][12]. Group 1: Product Features - The AI pen is designed to facilitate two-way communication with ChatGPT through paired devices like smartphones [4][10]. - It is described as being similar in size to an iPod Shuffle, weighing approximately 10-15 grams [7]. - The pen is expected to run OpenAI's custom models locally, converting handwritten content into text and allowing users to sync this information with ChatGPT for further inquiries [10][11]. Group 2: Design and Development - The pen's design involves collaboration with Jony Ive, former Chief Design Officer at Apple, indicating a focus on professional design expertise [3][13]. - OpenAI's acquisition of Jony Ive's hardware company for approximately $6.5 billion last year marks a significant step in its hardware development strategy [14]. Group 3: Strategic Implications - The choice of a pen as the first hardware product reflects OpenAI's long-term vision to create a seamless AI experience that minimizes distractions and enhances user interaction [19][22]. - This product aims to fill a gap in the current ecosystem, reducing reliance on major platforms like Apple and Google, and potentially opening new revenue streams through hardware and services [25][27].
量子位编辑作者招聘
量子位· 2026-01-04 05:21
Core Viewpoint - The article emphasizes the ongoing AI boom and invites individuals to join the company "Quantum Bit," which focuses on tracking AI advancements and has established itself as a leading content platform in the industry [1]. Job Opportunities - The company is hiring for three main directions: AI Industry, AI Finance, and AI Product, with positions available for both experienced professionals and fresh graduates [2][4]. AI Industry Direction - Responsibilities include monitoring innovations in infrastructure, such as chips, AI infrastructure, and cloud computing, as well as producing accessible interpretations of cutting-edge research and technical reports from major conferences [6][7]. - The company offers a dynamic work environment, opportunities for personal influence, and professional mentorship for newcomers [6]. AI Finance Direction - This role focuses on venture capital and financial reporting within the AI sector, tracking capital movements in the industry and producing analyses of investment trends and company strategies [9]. AI Product Direction - Responsibilities involve assessing AI applications and hardware, writing in-depth evaluations of new products, and engaging with entrepreneurs and experts in the field [10]. Company Growth and Impact - By 2025, Quantum Bit aims to have over 2.4 million subscribers on WeChat and more than 7 million users across platforms, with a daily readership exceeding 2 million [12].
「AI 100」榜单启动招募,AI产品“年会”不能停丨量子位智库
量子位· 2026-01-04 05:21
Core Insights - The article discusses the emergence of numerous keywords in the AI product sector by 2025, highlighting transformative AI products that are reshaping the industry [4] - The "AI 100" list by Quantum Bit Think Tank aims to evaluate and recognize the top AI products in China, reflecting the current landscape and future trends in AI [4][12] Group 1: AI 100 List Overview - The "AI 100" list is divided into three main categories: "Flagship AI 100," "Innovative AI 100," and the top three products in ten popular sub-sectors [6] - The "Flagship AI 100" will focus on the strongest AI products of 2025, showcasing those that have achieved significant technological breakthroughs and practical application value [7] - The "Innovative AI 100" aims to identify products that are expected to emerge in 2026, representing cutting-edge AI technology and potential industry disruptors [8] Group 2: Sub-sector Focus - The ten sub-sectors for the top three products include AI Browser, AI Agent, AI Smart Assistant, AI Workbench, AI Creation, AI Education, AI Healthcare, AI Entertainment, Vibe Coding, and AI Consumer Hardware [9] - This categorization is designed to provide a more precise reflection of development trends within each specific field [9] Group 3: Application and Evaluation - The evaluation of the "AI 100" list employs a dual assessment system combining quantitative and qualitative measures, focusing on user data and expert evaluations [13] - Quantitative metrics include user scale, growth, activity, and retention, while qualitative assessments consider long-term potential, technology, market space, and user experience [13]
LeCun曝Meta作弊刷榜,田渊栋:我没想到这个结局
量子位· 2026-01-04 05:21
Core Viewpoint - The article discusses the fallout from the release of Meta's Llama 4, highlighting internal conflicts and the departure of key figures like LeCun and Tian Yuandong, who are now pursuing entrepreneurial ventures due to dissatisfaction with Meta's direction in AI development [1][3][22]. Group 1: Llama 4 and Internal Conflicts - Llama 4 faced significant criticism and allegations of cheating in benchmark tests, leading to a loss of confidence from Meta's leadership [1][10]. - The release of DeepSeek, a competing AI model, pressured Meta to accelerate its AI investments, resulting in internal turmoil and a shift in team dynamics [4][6]. - The communication breakdown within the team was exacerbated by differing priorities, with LeCun's team wanting to innovate while leadership preferred proven technologies [7][8]. Group 2: Departures and New Ventures - LeCun and Tian Yuandong both announced their intentions to start new companies after leaving Meta, with LeCun focusing on world models and Tian Yuandong on new AI initiatives [27][33]. - LeCun's new venture, Advanced Machine Intelligence (AMI), aims to explore advanced machine intelligence through open-source projects, while he will serve as the executive chairman [27][30]. - Tian Yuandong expressed a desire to co-found a startup, indicating a trend among former Meta employees to seek new opportunities outside the company [33]. Group 3: Future Directions in AI - LeCun's focus on the V-JEPA architecture aims to enhance AI's understanding of the physical world through video and spatial data, with expectations for significant progress within 12 months [32]. - The article emphasizes the need for AI to move beyond language limitations, as highlighted by LeCun's critique of the current focus on large language models [25][26].
这里还有8个“Manus”:1亿美元ARR,都是ToC
量子位· 2026-01-03 10:00
Core Insights - The article discusses the emergence of the "1 Billion ARR Club" in the AI sector, highlighting companies that have achieved significant annual recurring revenue (ARR) and their implications for the industry [1][3][4]. Group 1: Definition and Importance of ARR - ARR stands for Annual Recurring Revenue, representing stable, repeatable income generated by a product within a year [5]. - It reflects a critical question for AI companies: whether users are willing to pay for AI services long-term [6]. Group 2: Notable Companies in the 1 Billion ARR Club - Companies achieving over $1 billion ARR include: - Perplexity: $20 billion - ElevenLabs: $6.6 billion - Lovable: $6.6 billion - Replit: over $3 billion - Suno: $2.5 billion - Gamma: $2.1 billion - Character: over $1 billion - Manus: $500 million - HeyGen: over $500 million [7][8]. Group 3: Categories of Business Models - The companies can be categorized into five main business paths: 1. AI Search/Information Services (e.g., Perplexity) [12][13]. 2. Audio/Voice Infrastructure Products (e.g., ElevenLabs) [15][16]. 3. Vibe Coding/Development Tools (e.g., Replit and Lovable) [17][18]. 4. Content/Office Efficiency Tools (e.g., Gamma) [20][21]. 5. Generative Entertainment Content (e.g., Suno and HeyGen) [23][24]. Group 4: Trends and Market Dynamics - The shift from foundational models to consumer products is a significant trend, with the consumer (ToC) sector emerging as a new goldmine [9][30]. - The AI 2.0 era is characterized by high user tolerance for product iterations, allowing companies to receive rapid feedback and adjust quickly [32][37]. Group 5: Challenges and Considerations - Despite the growth, user stickiness is low, leading to potential churn as users switch to better products [34]. - AI-Native applications face unique cost structures, where each interaction incurs computational costs, necessitating a focus on sustainable revenue models [40][46]. - Companies must balance user growth with the costs of AI processing to ensure long-term viability [47][49]. Group 6: Strategic Acquisitions - Meta's acquisition of Manus illustrates the value of established AI products with proven user bases, as it allows Meta to leverage existing capabilities rather than developing new products from scratch [58][62]. - The acquisition not only brings a product but also a talented team capable of enhancing Meta's AI offerings across its platforms [66].
「AI 100」榜单启动招募,AI产品“年会”不能停丨量子位智库
量子位· 2026-01-03 07:16
Core Insights - The article discusses the emergence of numerous keywords in the AI product sector by 2025, highlighting transformative AI products that are leading the market [4] - The "AI 100" list by Quantum Bit Think Tank aims to evaluate and recognize the top AI products in China, reflecting the industry's evolution and future trends [4][12] Group 1: AI 100 List Overview - The "AI 100" list is divided into three main categories: "Flagship AI 100," "Innovative AI 100," and the top three products in ten popular sub-sectors [6] - The "Flagship AI 100" will focus on the strongest AI products of 2025, showcasing those that have achieved significant technological breakthroughs and practical application value [7] - The "Innovative AI 100" aims to identify products that are expected to emerge in 2026, representing cutting-edge AI technology and potential industry disruptors [8] Group 2: Sub-sector Focus - The ten sub-sectors for the top three products include AI Browser, AI Agent, AI Smart Assistant, AI Workbench, AI Creation, AI Education, AI Healthcare, AI Entertainment, Vibe Coding, and AI Consumer Hardware [9] Group 3: Application and Evaluation Process - The application period for the "AI 100" list is from now until January 15, 2026, with the results to be published in mid to late January 2026 [10] - The evaluation system combines quantitative and qualitative assessments, focusing on user data and expert evaluations to ensure objectivity and accuracy [13]
机器人也怕疼!港城突破性电子皮肤:主动痛觉 + 损伤自检双buff拉满
量子位· 2026-01-03 07:16
henry 发自 凹非寺 量子位 | 公众号 QbitAI 这下,你打人形机器人,它真的会「疼」了。 来自香港城市大学的研究团队提出了一种全新的 神经形态机器人电子皮肤(neuromorphic RE-skin,NRE-skin) 。 NRE-skin通过模仿人类神经系统,利用分层(Hierarchical)的神经形态架构,让触觉信号不再需要传到中央处理器,而是在皮肤内部就完 成了初步处理与脉冲编码。 基于这一仿生设计,NRE-skin同时实现了三项关键能力: 网友表示这种复杂而精细的触觉感知,将会为机器人领域带来一次巨大的跃迁。 高分辨率触觉感知 :高效采集并编码精确的压力和位置信息。 主动保护机制 :具备局部反射机制,能够进行主动疼痛感知与损伤检测。 维护高效性 :支持快速更换的模块化快拆结构。 而这一研究也无疑会为后续的触觉反馈算法和硬件设计提供新的思路。 接下来我们具体来看。 NRE-skin遵循这一思路,在硬件层面实现了"传感器即神经元"的设计:它将每个压力传感器直接与一个微型振荡电路相集成。 当皮肤感知压力时,传感器的电阻变化会即时调控振荡电路,导致其输出的脉冲信号频率发生改变。 具体而言,压力越 ...
量子位编辑作者招聘
量子位· 2026-01-03 07:16
Core Viewpoint - The article emphasizes the ongoing AI boom and invites individuals to join the company "Quantum Bit," which focuses on tracking AI advancements and has established itself as a leading content platform in the industry [1]. Group 1: Job Opportunities - The company is hiring for three main directions: AI Industry, AI Finance, and AI Product, with positions available for both experienced professionals and fresh graduates [2][4]. - Positions are open for various levels, including editors, lead writers, and chief editors, with a focus on matching roles to individual capabilities [6]. Group 2: Job Responsibilities - **AI Industry Direction**: Responsibilities include tracking innovations in infrastructure, such as chips, AI infrastructure, and cloud computing, as well as producing accessible reports on technical conferences and papers [6][7]. - **AI Finance Direction**: Focuses on venture capital, financial reports, and analyzing capital movements within the AI industry, including interviews with investors and entrepreneurs [11]. - **AI Product Direction**: Involves monitoring AI applications and hardware developments, writing in-depth product evaluations, and engaging with product experts [11]. Group 3: Benefits and Work Environment - Employees can expect a vibrant team atmosphere, opportunities for personal influence through original content creation, and professional mentorship from senior editors [6][11]. - The company offers competitive salaries and comprehensive benefits, including social insurance, meal allowances, and performance bonuses [6]. Group 4: Company Growth and Reach - By 2025, Quantum Bit aims to have over 2.4 million subscribers on WeChat and more than 7 million users across platforms, with a daily reading volume exceeding 2 million [12]. - The company is recognized as the top new media outlet in the AI and frontier technology sectors according to third-party data platforms [12].