Workflow
量子位
icon
Search documents
量子位编辑作者招聘
量子位· 2026-01-01 02:13
Core Viewpoint - The article emphasizes the ongoing AI boom and invites individuals to join the company "Quantum Bit," which focuses on tracking AI advancements and has established itself as a leading content platform in the industry [1]. Group 1: Job Opportunities - The company is hiring for three main directions: AI Industry, AI Finance, and AI Product, with positions available for both experienced professionals and fresh graduates [2][4]. - Positions are open for various levels, including editors, lead writers, and chief editors, with a focus on matching roles to individual capabilities [6]. Group 2: Job Responsibilities - **AI Industry Direction**: Responsibilities include tracking innovations in infrastructure, such as chips, AI infrastructure, and cloud computing, as well as interpreting technical reports from conferences [6][7]. - **AI Finance Direction**: Focuses on venture capital, financial reports, and capital movements within the AI industry, requiring strong analytical skills and a passion for interviews [11]. - **AI Product Direction**: Involves monitoring AI applications and hardware developments, producing in-depth evaluations of AI products, and engaging with industry experts [11]. Group 3: Benefits and Work Environment - Employees will have the opportunity to engage with cutting-edge AI technologies, enhance their work efficiency, and build personal influence through original content creation [6]. - The company offers competitive salaries, comprehensive benefits including social insurance, meal allowances, and performance bonuses, along with a dynamic and open team culture [6][11]. Group 4: Company Growth and Reach - By 2025, Quantum Bit aims to have over 2.4 million subscribers on WeChat and more than 7 million users across platforms, with a daily reading volume exceeding 2 million [12].
豆包一声声“OK”把罗永浩搞破防,不就是大型现场直播版图灵测试
量子位· 2026-01-01 02:13
Core Viewpoint - The annual technology innovation sharing conference hosted by Luo Yonghao has become a viral sensation, primarily due to two key events: the announcement of ticket refunds for all attendees and a lively debate between Luo and the AI assistant Doubao, which showcased the capabilities of real-time interactive AI [1][2][3]. Group 1 - Luo Yonghao announced that all ticket holders would receive refunds, which sparked significant discussion [2]. - The debate between Luo and Doubao became the highlight of the event, drawing attention for its engaging and humorous exchanges [3][8]. - The debate served as a public test of Doubao's real-time interactive AI capabilities, demonstrating its ability to engage in complex discussions [11][34]. Group 2 - Doubao's performance was characterized by rapid responses and the ability to maintain a coherent argument, showcasing its advanced understanding of context and logic [13][25]. - The debate highlighted Doubao's improvements in emotional intelligence and its ability to adjust responses based on the conversation's tone [32][36]. - The event marked a significant milestone in AI development, indicating that real-time interactive AI has reached a stage suitable for practical applications [34][38]. Group 3 - Doubao's capabilities were enhanced through multiple iterations of its underlying model, focusing on real-time interaction, human-like responses, and adherence to user instructions [30][32]. - The debate illustrated a shift in AI from being a passive tool to an interactive partner capable of complex dialogue [35][36]. - The implications of this technology extend to various fields, including customer service, education, and personal assistance, where AI can handle more nuanced interactions [38].
LeCun预言成真?这有一份通往AGI的硬核路线图:从BERT到Genie,在掩码范式的视角下一步步构建真正的世界模型
量子位· 2026-01-01 02:13
Core Viewpoint - The article discusses the emergence of World Models in AI, emphasizing the importance of Masking as a foundational principle for building these models, which are seen as essential for achieving Artificial General Intelligence (AGI) [1][3][5]. Group 1: Definition and Components of World Models - The true World Model is defined as an organic system composed of three core subsystems: a Generative Heart, an Interactive Loop, and a Memory System [6][8]. - The Generative Heart ($G$) predicts future states and simulates world dynamics, while the Interactive Loop ($F,C$) allows for real-time interaction and decision-making [8]. - The Memory System ($M$) ensures continuity over time, preventing the world from becoming a series of fragmented experiences [8][9]. Group 2: Evolution of World Models - The evolution of World Models is categorized into five stages, with Masking being the central theme throughout these stages [10][12]. - Stage I focuses on Mask-based Models, highlighting Masking as a universal generative principle rather than just a pre-training technique [13][24]. - Stage II aims for Unified Models that process and generate all modalities under a single architecture, with a debate between Language-Prior and Visual-Prior modeling approaches [25][26]. Group 3: Interactive Generative Models - Stage III introduces Interactive Generative Models, where models respond to user actions, transforming from mere simulators to interactive environments [36][40]. - The Genie series, particularly Genie-3, represents the state-of-the-art in real-time interactive models, achieving 720p resolution and 24fps frame rates [41][42]. Group 4: Memory and Consistency - Stage IV addresses Memory & Consistency, focusing on the need for persistent memory to prevent catastrophic forgetting and state drift in generated worlds [46][48]. - Solutions proposed include Externalized Memory, architecture-level persistence, and consistency governance to maintain coherence in generated environments [49][50]. Group 5: Ultimate Form of World Models - Stage V envisions True World Models that exhibit persistence, agency, and emergence, allowing for complex interactions and societal dynamics within the simulated world [51][52]. - The article concludes with the challenges of coherence, compression, and alignment that must be addressed to realize these advanced models [58].
Hinton加入Scaling Law论战,他不站学生Ilya
量子位· 2026-01-01 02:13
Core Viewpoint - The article discusses the ongoing debate surrounding the "Scaling Law" in AI, highlighting contrasting perspectives from key figures in the field, particularly Ilya Sutskever and Geoffrey Hinton, regarding the future and limitations of scaling AI models [1][8][21]. Group 1: Perspectives on Scaling Law - Ilya Sutskever expresses skepticism about the continued effectiveness of Scaling Law, suggesting that merely increasing model size may not yield significant improvements in AI performance [23][40]. - Geoffrey Hinton, on the other hand, maintains that Scaling Laws are still valid but face challenges, particularly due to data scarcity, which he believes can be addressed by AI generating its own training data [10][21]. - Demis Hassabis, CEO of DeepMind, supports Hinton's view, emphasizing the importance of scaling for achieving advanced AI systems and the potential for self-evolving AI through data generation [15][19]. Group 2: The Debate on Data and Model Scaling - The article outlines the historical context of Scaling Law, which posits that increasing model parameters, training data, and computational resources leads to predictable improvements in AI performance [26][27]. - Recent discussions have shifted towards concerns about data limitations, with Ilya arguing that the era of pre-training is coming to an end due to diminishing returns from scaling [32][41]. - Yann LeCun also shares skepticism about the assumption that more data and computational power will automatically lead to smarter AI, indicating a broader questioning of the Scaling Law's applicability [46][48]. Group 3: Future Directions and Research Focus - The article suggests that while current paradigms may still yield significant economic and social impacts, achieving Artificial General Intelligence (AGI) or Artificial Superintelligence (ASI) will likely require further research breakthroughs [53]. - There is a consensus among leading researchers that while AGI is not a distant fantasy, the nature and speed of necessary breakthroughs remain uncertain [53].
端侧翻译新标杆:腾讯混元1.5开源,1.8B模型离线运行,效果超主流商用API
量子位· 2025-12-31 11:11
混元团队 投稿 量子位 | 公众号 QbitAI 在语言模型的比拼中,机器翻译一直被视为检验机器理解复杂语义和跨文化对齐能力的"试金石"。 面向端侧场景,12月30日,腾讯混元宣布推出并开源翻译模型1.5,经过量化,可支持端侧直接部署和离线实时翻译,仅需1GB内存即可流 畅运行,并且在参数量极小的前提下,效果超过了大部分商用翻译API。 在常用的中外互译和英外互译测试集Flores200、WMT25以及民汉语言的测试集中,Tencent-HY-MT1.5-1.8B全面超越中等尺寸开源模型 和主流商用翻译API,达到Gemini-3.0-Pro这种超大尺寸闭源模型的90分位水平。在WMT25和民汉翻译测试集上,其效果仅略微差于 Gemini-3.0-Pro,远超其他模型。 模型在效率和性价比也表现突出,与主流商用翻译模型API对比,HY-MT1.5-1.8B推理速度更快,处理50个tokens的平均耗时只有0.18 秒,其他模型的时间在0.4秒左右,显示出明显的速度优势,凭借优化的模型设计和推理逻辑,其领先的效率使其高度适用于即时通讯、智 能客服、移动翻译应用等高吞吐、实时翻译场景。 在大模型厂商纷纷角逐手机等 ...
董事长稚晖君发布上纬新材首款机器人!能塞书包还能骑机器狗
量子位· 2025-12-31 11:11
henry 发自 凹非寺 量子位 | 公众号 QbitAI 2025年的最后一天,上市公司上纬新材董事长 彭志辉 (稚晖君)发布了一款能装进书包的机器人产品—— 上纬启元Q1 。 这是全球首款最小尺寸(0.8m)、实现全身力控的人形机器人,也是智元机器人联合创始人 稚晖君 担任上纬新材董事长以来,发布的首款具 身智能机器人产品。 虽然体型迷你,但大机器人能做的,启元Q1也能做。 大机器人做不了的,启元Q1还能做。 (我骑过狗你骑过吗?) 而前段时间让网友猜疯了的 "大有可为" 神秘海报,也终于在这次的发布视频中正式揭晓答案。 其中醒目的1.88,既不是身高,也不是售价,而是启元Q1的体积(立方米)——一个被压缩到背包级的人形机器人尺寸。 启元Q1是一款怎样的机器人? 从产品定位上看,稚晖君这次的新作 启元Q1 ,是一款面向个人用户、开发者,科研、陪伴、创作场景的小尺寸人形机器人。 值得一提的是,这种小型化设计,并不只是为了方便携带。更轻的重量,让机器人本身更耐造,也把使用和试错成本一起打了下来,更适合个 人和小团队反复折腾。 在产品能力上,启元Q1反复强调了一个关键词—— 全身力控 。 简单来说,全身力控并不 ...
马斯克买了新厂房上GPU,2GW供电规模,“巨硬”更更硬了
量子位· 2025-12-31 05:28
Core Viewpoint - Elon Musk's xAI is expanding its computing power through the "Macrohard" project, with the acquisition of a third facility named MACROHARD RR, which will have a power supply capacity of 2GW [1][2]. Group 1: Facility Expansion - The new facility MACROHARD RR is located near the existing Colossus II site, which is part of the Macrohard project [15][16]. - Colossus I, the first facility, was built in just 122 days and is currently the largest and most stable computing cluster globally, equipped with approximately 200,000 NVIDIA H100/H200 and 30,000 NVIDIA GB200 NVL72 GPUs [6][7]. - Colossus II is set to deploy 110,000 NVIDIA GB200 GPUs in its first phase, with a final goal of over 550,000 GPUs and a peak power demand exceeding 1.1GW [11]. Group 2: Power Supply and Infrastructure - The 2GW power capacity of the new facility can support around 1.1 million NVIDIA GB200 NVL72 GPUs, based on previous power density and efficiency metrics [2][4]. - xAI has partnered with Solaris Energy Infrastructure to build a permanent gas turbine power plant in Mississippi, which is expected to provide over 1GW of power by early 2027 [18][20]. - To mitigate noise complaints from nearby residents, xAI has constructed a wall between the power plant site and residential areas and deployed 168 Tesla Megapack battery storage systems to support local power needs during peak usage [20]. Group 3: Financial Aspects - xAI is reportedly planning to raise $15 billion at a valuation of $230 billion to support its expansion efforts [22]. - Musk has denied reports regarding the fundraising but has not provided further clarification [23].
黄仁勋「收购式」抢人继续:20多亿美金“买走”Mobileye创始人AI新团队
量子位· 2025-12-31 05:28
Group 1 - Nvidia is reportedly planning to acquire Israeli AI startup AI21 Labs for $2-3 billion to recruit over 200 top AI talents [1][2][40] - AI21 Labs, founded in 2017, specializes in developing large language models and has a strong founding team with notable backgrounds in AI and technology [3][7][27] - The company's valuation in 2023 is approximately $1.4 billion, with a recent funding round led by Nvidia and Google raising $300 million [4][6] Group 2 - The acquisition reflects Nvidia's strategy of "talent acquisition" rather than traditional business mergers, allowing it to bypass strict regulations on business monopolies [41][42] - AI21 Labs has developed its own models, including the Jurassic series and the recently launched Jamba, which is an open-source large model [28][29] - The partnership between Nvidia and AI21 Labs aims to combine Nvidia's computational infrastructure with AI21's application solutions, enhancing enterprise-level generative AI deployment [36][50] Group 3 - Nvidia's recent acquisitions, including Groq, demonstrate a pattern of acquiring companies primarily for their talent rather than their technology [43][45] - The acquisition of AI21 Labs is expected to further solidify Nvidia's strategic position in Israel, integrating hardware, software, and AI applications [50][51] - Nvidia's ambition extends beyond being a chip company, as it seeks to control the entire AI industry chain through strategic acquisitions [52][54]
MiniMax作价461亿港元募资46亿,1月9日敲钟代码00100
量子位· 2025-12-31 05:28
Core Viewpoint - MiniMax, a Chinese AI company, is set to go public with an IPO aiming to raise over $600 million, valuing the company at over HKD 46.1 billion, and is expected to list on January 9, 2026 [2][7]. Group 1: Company Overview - MiniMax is positioned as a global artificial general intelligence (AGI) technology company, with services covering over 200 countries and regions, and 70% of its revenue coming from international operations [12]. - The company has a strong backing from 14 cornerstone investors, including Alibaba and the Abu Dhabi Investment Authority, with total subscriptions amounting to approximately HKD 27.23 billion [7][8]. Group 2: Market Context - December 2025 marks a significant period for IPOs in Hong Kong, with 25 companies having completed listings, making it the busiest month since 2019 [9]. - MiniMax and another company, Zhiyuan, are both entering the market around the same time, creating a competitive atmosphere that splits investor attention [10]. Group 3: Financial Performance - MiniMax's revenue has shown remarkable growth, reaching $3.46 million in 2023 and projected to soar to $30.52 million in 2024, representing a year-on-year increase of 782.2% [35]. - For the first nine months of 2025, revenue surged by 175% to $53.44 million, significantly surpassing the previous year's total [36]. - The company has improved its gross margin from -24.7% in 2023 to 23.3% in the first nine months of 2025, indicating a positive trend in profitability [38]. Group 4: Product Development - MiniMax has released several models, including the M1 and M2 text models, with M2 achieving top rankings in performance metrics [20][21]. - The company has also developed a voice model, Speech 01, and its upgraded version, Speech 02, which supports over 40 languages and has generated over 2.2 million hours of speech [24]. - MiniMax's video model, Hailuo, has been recognized for its capabilities in generating videos and has helped create over 590 million videos globally [28]. Group 5: Investment and Support - MiniMax has raised over $1.5 billion in funding from various strategic investors, including major tech companies and venture capital firms, positioning it as a leading player in the AGI space [50]. - The company has a cash reserve of $1.102 billion as of September 30, 2025, which is sufficient to sustain operations for over 53 months without additional funding [46].
AI终于学会在家“伺候人”!Hey Tuya,我躺了
量子位· 2025-12-31 03:37
Core Viewpoint - The article discusses the emergence of "Hey Tuya," an AI life assistant developed by Tuya Smart, which integrates software and hardware to create a seamless smart home experience, moving beyond simple command responses to proactive engagement in daily life [6][46]. Group 1: AI Life Assistant Features - "Hey Tuya" allows users to control smart home devices through a unified interface, enabling actions like adjusting lighting and temperature with simple voice commands [8][16]. - The assistant can manage home security by providing real-time updates and alerts, such as recognizing delivery personnel through camera feeds [23][24]. - It offers energy management solutions, allowing users to set automated energy-saving strategies for their devices [30][31]. Group 2: User Interaction and Personalization - The AI can create to-do lists and reminders based on natural language conversations, adapting to users' habits and schedules [32][35]. - "Hey Tuya" can simulate a user's voice in meetings, recording discussions and generating structured meeting notes [37][38]. - It provides nutritional analysis of meals through image recognition, helping users track their dietary habits [40][42]. Group 3: Technological Infrastructure - The underlying architecture of "Hey Tuya" is based on Tuya's Physical AI Engine (PAE), which enables real-time collaboration between AI and physical devices [46][47]. - PAE includes three core engines: Conversational AI Engine, Vision AI Engine, and IoT Intelligence Engine, facilitating complex interactions across various environments [50][51]. - The system employs a long-term memory mechanism to learn user preferences and behaviors over time, enhancing its responsiveness [53][54]. Group 4: Company Background and Market Position - Tuya Smart, founded in June 2014, has evolved from a device connectivity platform to a leading AI cloud service provider, focusing on AIoT integration [56][59]. - As of September 30, 2025, Tuya has over 1.622 million registered developers across more than 200 countries, showcasing its extensive global reach [60]. - The company's AIoT ecosystem encompasses over 3,000 product series across eight categories, positioning it as a key player in the smart home and IoT market [61][62].