Workflow
量子位
icon
Search documents
离开马斯克后,他把人形机器人做成了这样
量子位· 2026-01-10 06:36
允中 发自 凹非寺 量子位 | 公众号 QbitAI 如果你对人形机器人的印象,还停留在——走两步就摔、抓东西像戴着拳击手套、干活前得先写一堆脚本…… 那么 MATRIX-3 的出现,可能要强行带你"翻篇"了。 显然想通过从底层算法到顶层应用的系统性重构,让机器人走得更远: 进工厂,飞入寻常百姓家。 作为一款主打 安全、自主、可泛化 的物理智能机器人,它更敢跟人待在同一个空间,更能自己做判断,也更不怕换任务、换环境。 做出这台机器人的,是一家去年才正式走到台前的公司—— 矩阵超智 。 能干的活更像人,目标也不止于专业场景"打工",而是开始往日常生活里迈。 但底子不轻、来头不算低调:公司团队背景横跨 特斯拉、英伟达、OpenAI 等顶级技术体系,目标也非常直给: AGI路线上的通用人形机器人。 可以说,一年前,MATRIX-1亮相时,外界更关注两点:全身复合材料带来的"观感完成度",以及实时语音对话的交互感。 但这次,创始人 张海星 ——这位有着30年消费电子实战经验的"老极客",2021年加入特斯拉, 参与Optimus人形机器人开发,并主导特斯 拉中国设计中心相关项目 —— △ 矩阵超智创始人兼CEO张海星 ...
量子位编辑作者招聘
量子位· 2026-01-10 03:07
Core Viewpoint - The article emphasizes the ongoing AI boom and invites individuals to join the company "Quantum Bit," which focuses on tracking AI advancements and has established itself as a leading content platform in the industry [1]. Group 1: Job Opportunities - The company is hiring for three main directions: AI Industry, AI Finance, and AI Product, with positions available for both experienced professionals and fresh graduates [2][4]. - Positions are open for various levels, including editors, lead writers, and chief editors, with a focus on matching roles to individual capabilities [6]. Group 2: Job Responsibilities - **AI Industry Direction**: Responsibilities include tracking innovations in infrastructure, such as chips, AI infrastructure, and cloud computing, as well as producing accessible reports on technical conferences and papers [6][7]. - **AI Finance Direction**: Focuses on venture capital, financial reports, and analyzing capital movements within the AI industry, including interviews with investors and entrepreneurs [11]. - **AI Product Direction**: Involves monitoring AI applications and hardware developments, writing in-depth product evaluations, and engaging with product experts [11]. Group 3: Benefits and Work Environment - Employees will have the opportunity to engage with cutting-edge AI technologies, enhance their work efficiency through new tools, and build personal influence in the AI field [6]. - The company offers competitive salaries, comprehensive benefits including social insurance, meal allowances, and performance bonuses, and promotes a dynamic and open work culture [6]. Group 4: Company Growth and Reach - By 2025, Quantum Bit aims to have over 2.4 million subscribers on WeChat and more than 7 million users across platforms, with a daily reading volume exceeding 2 million [12]. - The company is recognized as the top new media outlet in the AI and frontier technology sectors according to third-party data platforms [12].
「AI 100」榜单启动招募,AI产品“年会”不能停丨量子位智库
量子位· 2026-01-10 03:07
Core Insights - The article discusses the emergence of numerous keywords in the AI product sector by 2025, highlighting transformative AI products that are leading the market [4] - The "AI 100" list by Quantum Bit Think Tank aims to evaluate and recognize the top AI products in China, reflecting the industry's evolution and future trends [4][12] Group 1: AI 100 List Overview - The "AI 100" list is divided into three main categories: "Flagship AI 100," "Innovative AI 100," and the top three products in ten popular sub-sectors [6] - The "Flagship AI 100" will focus on the strongest AI products of 2025, showcasing those that have achieved significant technological breakthroughs and practical application value [7] - The "Innovative AI 100" aims to identify products that are expected to emerge in 2026, representing cutting-edge AI technology and potential industry disruptors [8] Group 2: Sub-sector Focus - The ten hottest sub-sectors for the top three products include AI browsers, AI agents, AI smart assistants, AI workstations, AI creation, AI education, AI healthcare, AI entertainment, Vibe Coding, and AI consumer hardware [9] Group 3: Application and Evaluation Criteria - The evaluation of the "AI 100" list employs a dual assessment system combining quantitative and qualitative measures, focusing on user data and expert evaluations [13] - Quantitative metrics include user scale, growth, activity, and retention, while qualitative assessments consider long-term potential, technology, market space, and user experience [13]
吴恩达:图灵测试不够用了,我会设计一个AGI专用版
量子位· 2026-01-10 03:07
Core Viewpoint - The article discusses Andrew Ng's announcement of a new Turing test, termed the Turing-AGI test, aimed at evaluating Artificial General Intelligence (AGI) capabilities in a more practical and economically relevant manner [1][8][30]. Group 1: Turing-AGI Test Concept - The Turing-AGI test is designed specifically for AGI, addressing the inadequacies of the traditional Turing test which primarily focused on human-machine dialogue [2][10]. - The new test aims to measure AI's ability to perform knowledge-based work tasks, reflecting a more comprehensive definition of intelligence [14][19]. - Participants in the test will include AI systems or professionals, who will be tasked with real-world scenarios, such as customer service, requiring them to provide ongoing feedback [15][17]. Group 2: Industry Context and Trends - 2025 is anticipated to mark the beginning of the AI industrial era, with significant advancements in model performance and AI-driven applications becoming essential [4][5]. - The competition for top talent in the AI sector is intensifying, driven by the rapid development of AGI concepts in both academia and industry [6][5]. - Current benchmark tests often mislead the public by overestimating AI capabilities, as they are based on predetermined test sets that do not reflect real-world performance [7][20][21]. Group 3: Implications of the Turing-AGI Test - The Turing-AGI test will allow judges to create arbitrary tasks, enhancing the assessment of AI's general capabilities compared to fixed benchmark tests [28]. - Ng suggests that hosting a Turing-AGI test could help calibrate societal expectations of AI, potentially reducing hype around AGI while focusing on practical advancements [29][30]. - The test could set clear goals for AI teams, moving away from vague aspirations of achieving human-level intelligence [31].
Hinton的亿万富豪博士生
量子位· 2026-01-10 03:07
Core Viewpoint - The article discusses the legacy and influence of Geoffrey Hinton in the AI field, highlighting his contributions and the success of his first PhD student, Peter Brown, who became a prominent figure in quantitative finance [1][8][14]. Group 1: Hinton's Influence and Legacy - Hinton is recognized as a pivotal figure in the development of neural networks, which have become foundational in AI, particularly in deep learning [4][8]. - The 1986 photo from the first connectionist summer school at CMU features Hinton alongside other influential figures in AI, showcasing the early community that would shape the future of technology [2][4]. - Hinton's commitment to his research and his reluctance to leverage his connections for personal gain reflect his integrity and dedication to the field [9][10]. Group 2: Peter Brown's Journey - Peter Brown, Hinton's first PhD student, transitioned from AI research to become the CEO of Renaissance Technologies, a leading quantitative hedge fund [5][14]. - Brown's early work in speech recognition laid the groundwork for modern statistical models in the field, influencing decades of research [23][25]. - His decision to join Renaissance Technologies was driven by financial necessity, highlighting the intersection of personal circumstances and career choices [31][33]. Group 3: Renaissance Technologies - Renaissance Technologies is known for its high returns, particularly through its Medallion Fund, which achieved an annualized return of over 66% from 1988 to 2019 [38]. - The firm's success is attributed to its reliance on data-driven, quantitative trading strategies developed by mathematicians and computer scientists [39][40]. - Brown's leadership and work ethic, including a commitment to long hours, have been crucial to the firm's performance and his personal wealth accumulation [42][43].
智能体「卷王」诞生!干活自动配结项报告,1.5张截图就把事说清了
量子位· 2026-01-10 03:07
Core Insights - The article discusses the concept of SmartSnap, which transforms GUI agents from passive executors to proactive self-verifiers, enabling them to collect evidence while completing tasks [7][12]. Group 1: Challenges in Current AI Verification - A significant challenge in LLM/VLM-driven agents is the uncertainty of task completion quality after execution [2]. - Existing verification methods require complex manual checks and robust trajectory-level validation, which can be inefficient and contextually noisy [4][5]. - These methods depend on continuous observable feedback, which can fail due to environmental changes [6]. Group 2: SmartSnap Overview - SmartSnap allows agents to actively collect and submit a "snapshot of evidence" while performing tasks, akin to a project completion report [8][9]. - The approach aims to reduce the verification burden on external validators by enabling agents to self-verify their actions [6][19]. Group 3: Key Innovations - SmartSnap introduces a dual mission for agents: executing tasks and self-verifying their completion [11][12]. - The 3C principle (Completeness, Conciseness, Creativity) is established to ensure evidence quality without overwhelming validators [15]. - The training utilizes the GRPO algorithm with intrinsic reward shaping to enhance evidence quality while minimizing reward hacking [14]. Group 4: Performance Improvements - SmartSnap has shown significant performance improvements across various models, with the highest increase reaching 26.08% [17]. - The average task now requires only 1.5 evidence snapshots, greatly reducing validation costs [18]. - Agents trained with SmartSnap demonstrate improved interaction efficiency, leading to fewer interaction rounds [18]. Group 5: Future Implications - The emergence of SmartSnap signifies a shift from brute-force execution to cognitive collaboration in GUI agents, enhancing AI reliability and paving the way for large-scale, low-cost AI deployment [21]. - Future AI systems must not only be capable but also trustworthy, emphasizing the importance of self-verification capabilities [22].
蚂蚁再把医疗AI卷出新高度!蚂蚁·安诊儿医疗大模型开源即SOTA
量子位· 2026-01-09 06:05
Core Viewpoint - AntAngelMed, a medical AI model developed by Ant Group in collaboration with Zhejiang Provincial Health Information Center and Zhejiang Anzhener Medical AI Technology Co., has emerged as a significant player in the healthcare AI sector, achieving top rankings in multiple medical benchmark tests [2][3][12]. Group 1: Model Performance and Rankings - AntAngelMed has achieved the highest score in the HealthBench evaluation, surpassing models like Baichuan-M2 and gpt-oss-120B with a score of 62.5 [4][15]. - It also topped the HealthBench-Hard subset, breaking the 32-point barrier that many models struggled with, demonstrating its robustness in complex medical scenarios [16][17]. - In the MedAIBench evaluation, AntAngelMed excelled in medical knowledge Q&A and ethical safety dimensions, indicating a well-rounded capability across various medical fields [19]. - The model ranked first in the MedBench assessment, which focuses on Chinese medical scenarios, showcasing its adaptability to local healthcare needs [21]. Group 2: Model Architecture and Training - AntAngelMed is the largest open-source medical model to date, with 100 billion parameters, designed for real-world medical applications [6][12]. - The model employs a three-stage training process, including continuous pre-training with clinical guidelines, supervised fine-tuning for real-world applications, and GRPO reinforcement learning for enhanced task handling [43][45][48]. - The architecture is based on the efficient mixture of experts (MoE) framework, allowing for significant improvements in efficiency and performance compared to traditional dense architectures [51][52]. Group 3: User Interaction and Application - AntAngelMed demonstrates a high level of user interaction, providing quick and empathetic responses to medical inquiries, akin to a personal physician [23][41]. - The model effectively explains complex medical terms and offers tailored advice based on user symptoms, enhancing the patient experience [31][36]. - It is designed to integrate seamlessly into clinical workflows, making it suitable for deployment in small to medium-sized healthcare institutions [7][21]. Group 4: Strategic Positioning and Future Outlook - Ant Group's investment in AntAngelMed reflects its commitment to the healthcare AI sector, positioning it as a core business alongside its other financial services [66][68]. - The company aims to bridge the gap between general AI models and specialized medical applications, leveraging its extensive experience in payment and insurance data to enhance AI capabilities in healthcare [75][76]. - AntAngelMed is seen as a foundational model that will support the scalable implementation of AI in professional medical settings, addressing industry pain points effectively [56][59].
让世界模型推理效率提升70倍:上海AI Lab用“恒算力”破解长时记忆与交互瓶颈
量子位· 2026-01-09 04:09
Core Insights - The article discusses the transition of generative AI from static images to dynamic videos, emphasizing the importance of building a "world model" that understands physical laws, possesses long-term memory, and supports real-time interaction as a pathway to achieving Artificial General Intelligence (AGI) [3]. Group 1: Yume Project Overview - The Yume project, developed by Shanghai AI Lab in collaboration with several top institutions, has released Yume1.0 and Yume1.5, which are the first fully open-source world models aimed at real-world applications [3][4]. - Yume1.5 introduces a core architectural innovation called Time-Space Channel Modeling (TSCM), which addresses the memory bottleneck in long video generation [4][11]. Group 2: Technical Innovations - TSCM employs a unified context compression and linear attention mechanism to solve the memory challenges associated with long video generation [5]. - The framework integrates long-term memory, real-time reasoning, and "text + keyboard" interaction control into a single system, demonstrating a feasible path for engineering world models [2]. Group 3: Data Utilization - Yume utilizes the Sekai dataset, which includes high-quality first-person (POV) video data covering 750 cities and totaling 5000 hours [8]. - Yume1.5 also incorporates a high-quality T2V synthesis dataset and a specialized event dataset for generating events like "sudden ghost appearances" [10]. Group 4: TSCM Mechanism - TSCM's compression mechanism includes two parallel streams: time-space compression and channel compression, effectively reducing the number of tokens processed [16]. - Time-space compression retains visual details by downsampling historical frames, while channel compression reduces the channel dimension to enhance processing efficiency [19][23]. Group 5: Performance Evaluation - Yume1.5 achieved an instruction-following (IF) score of 0.836, demonstrating the effectiveness of its control methods, and reduced generation time from 572 seconds in Yume1.0 to just 8 seconds [29]. - An ablation study showed that removing TSCM and using simple spatial compression led to a decrease in instruction-following ability from 0.836 to 0.767, highlighting TSCM's significance [30][32]. Group 6: Future Prospects - The open-sourcing of Yume and its datasets is expected to accelerate research in world models, with the potential for the distinction between "real" and "generated" content to become increasingly blurred in the near future [38].
一口气集齐老黄苏妈英特尔,还得是AI,还得是联想
量子位· 2026-01-09 04:09
Core Viewpoint - The article discusses the emerging trend of AI hardware and the concept of "super entrance" in the tech industry, emphasizing that all devices will evolve into AI devices, marking a significant shift in technology at CES 2026 [1][6]. Group 1: AI Hardware Evolution - The CES 2026 showcased a consensus among manufacturers that all devices, including traditional smartphones and PCs, will adopt more intelligent forms [1]. - The emergence of new intelligent hardware species is increasingly diverse, indicating a shift in how AI is integrated into everyday devices [3]. Group 2: Super Entrance Concept - The "super entrance" concept refers to platforms that aggregate user traffic and connect various digital scenarios, similar to the role of super apps in the mobile internet era [7]. - The competition for "super entrance" is shifting from foundational technology to application layers and broader ecosystems, as seen in the AI landscape [9]. Group 3: Hybrid AI as the Ultimate Path - Lenovo's CEO proposed that integrating personal, enterprise, and public intelligence into a hybrid AI model is essential for creating personalized and diverse AI solutions [14][17]. - The hybrid AI model emphasizes the deep integration of cloud-based large models with localized customized small models to better meet user needs [18]. Group 4: Lenovo's Innovations - Lenovo introduced the world's first personal AI super intelligent agent, Lenovo Qira, which connects various devices and enhances task execution through cross-platform capabilities [20]. - The Qira agent can remember user preferences and interact in a personalized manner while ensuring privacy protection [22]. Group 5: Enterprise AI Solutions - Lenovo launched a series of AI inference servers aimed at improving efficiency and reducing operational costs for enterprises, adapting to diverse AI deployment needs [24]. - The collaboration with NVIDIA to establish an AI cloud super factory aims to expedite AI deployment for cloud service providers [25]. Group 6: Market Position and Future Outlook - Lenovo's AI-related business accounted for 30% of its total revenue, showing a 13% year-on-year growth, indicating a strong market position in both consumer and enterprise segments [34][35]. - The company aims to quadruple its business cooperation scale with NVIDIA over the next 3-4 years, highlighting its commitment to expanding its AI ecosystem [38].
「AI 100」榜单启动招募,AI产品“年会”不能停丨量子位智库
量子位· 2026-01-09 04:09
Core Insights - The article discusses the emergence of numerous keywords in the AI product sector in China by 2025, highlighting the rapid evolution and innovation in AI technologies [4] - The "AI 100" list by Quantum Bit Think Tank aims to evaluate and recognize the top AI products that represent China's AI capabilities [4][12] Group 1: AI 100 List Overview - The "AI 100" list is divided into three main categories: "Flagship AI 100," "Innovative AI 100," and the top three products in ten popular sub-sectors [6] - The "Flagship AI 100" will focus on the strongest AI products of 2025, showcasing those that have achieved significant technological breakthroughs and practical application value [7] - The "Innovative AI 100" aims to identify products that are expected to emerge in 2025 and have the potential to lead industry changes in 2026 [8] Group 2: Sub-sector Focus - The ten hottest sub-sectors for the top three products include AI browsers, AI agents, AI smart assistants, AI workstations, AI creation, AI education, AI healthcare, AI entertainment, Vibe Coding, and AI consumer hardware [9] Group 3: Application and Evaluation - The evaluation of the "AI 100" list employs a dual assessment system combining quantitative and qualitative measures, focusing on user data and expert evaluations [13] - Quantitative metrics include user scale, growth, activity, and retention, while qualitative assessments consider long-term potential, technology, market space, and user experience [13]