量子位
Search documents
中国团队首次在Nature子刊发布医疗AI标准,未来医生MedGPT摘得全球桂冠
量子位· 2026-01-21 04:09
衡宇 发自 凹非寺 量子位 | 公众号 QbitAI 中国团队首次在全球顶尖期刊发表"大模型+医疗"领域的相关标准研究! 作为Nature体系中专注于数字医疗的旗舰期刊, 《npj Digital Medicine》 (JCR影响因子15.1,中科院医学大类1区Top期刊) 此次收录 的CSEDB研究,首次提出了一套用于评估医疗大模型真实临床能力的系统性框架。 它由中国AI医疗公司"未来医生"协同32位来自北京协和医院、中国医学科学院肿瘤医院、北京大学口腔医院、中国医学科学院阜外医院、中国 人民解放军总医院、复旦大学附属华山医院、上海市同济医院等顶尖医疗机构的23个核心专科的一线临床专家共同制定。 从行业角度看,这项研究释放出了一个清晰的信号: 医疗AI的竞争,正在从能力展示阶段,正式进入责任定义阶段。 CSEDB全称为Clinical Safety-Effectiveness Dual-Track Benchmark (临床安全性与有效性双轨基准) , 它首次为评估医疗AI真实临床 能力建立了一个基于临床专家共识、覆盖全面风险维度,并将安全性与有效性分开考量的标准化基准。 通过公开实验,CSEDB直接给出了 ...
量子位编辑作者招聘
量子位· 2026-01-21 04:09
Core Viewpoint - The article emphasizes the ongoing AI boom and invites individuals to join the company "Quantum Bit," which focuses on tracking AI advancements and has established itself as a leading content platform in the industry [1]. Group 1: Job Opportunities - The company is hiring for three main directions: AI Industry, AI Finance, and AI Product, with positions available for both experienced professionals and fresh graduates [2][4]. - Positions are full-time and based in Beijing, with various levels of roles open for application [2][4]. Group 2: Job Responsibilities - **AI Industry Direction**: Focuses on innovations in infrastructure, including chips, AI infrastructure, and cloud computing [6]. - **AI Finance Direction**: Involves tracking venture capital and financial reports in the AI sector, monitoring capital movements within the industry [6]. - **AI Product Direction**: Concentrates on the application and hardware advancements in AI [6]. Group 3: Benefits and Growth Opportunities - Employees will have the chance to engage with the latest AI technologies, enhance their work efficiency through new AI tools, and build personal influence by creating original content [6]. - The company offers competitive salaries, comprehensive benefits including social insurance, meal allowances, project performance bonuses, and overtime compensation [6]. Group 4: Company Growth and Reach - By 2025, Quantum Bit aims to have over 2.4 million subscribers on WeChat and more than 7 million users across all platforms, with a daily reading volume exceeding 2 million [12]. - The company is recognized as the top new media outlet in the AI and frontier technology sector according to third-party data platforms [12].
马斯克罕见低头:开源𝕏推荐算法,自嘲“很烂”不过未来月更
量子位· 2026-01-21 04:09
我们移除了所有人工设计特征和绝大多数启发式规则。 消息一出,整个社区立刻沸腾了,最高赞上去就是一顿猛夸: incredible!没有其他平台能做到如此透明。 马斯克本人也火速转发了工程团队原帖,不过一向言辞高调的老马,此番却低调表示: 一水 发自 凹非寺 量子位 | 公众号 QbitAI 就现在,GitHub已经能完整看到马斯克开源的 推荐算法系统 了。 开源文件里明确表示,这是一个几乎完全由AI模型驱动的算法系统。 我们知道这个算法很蠢(dumb),需要大幅改进,但至少您可以实时、透明地看到我们为改进它而努力。 其他社交媒体公司都没有这样做。 早在2022年收购 (原Twitter) 之前,马斯克就多次批评该平台过于封闭。 自收购之后,他也兑现承诺多次公开Twitter核心推荐算法,这一次也算是不忘初心了。 原来纯AI驱动的推荐系统,是这样运作的! 话不多说,咱这就扒一扒整套系统的运作机制。 一句话概括这个系统即为: 基于Grok-1同款Transformer架构打造,能通过学习你的历史互动行为 (点赞/回复/转发过什么) ,来决定给你推荐什么内容。 从用户打开"For You"开始,客户端会向服务器发送一 ...
世界模型+强化学习=具身智能性能翻倍!清华&加州伯克利最新开源
量子位· 2026-01-21 04:09
BOOM团队 投稿 量子位 | 公众号 QbitAI 在具身智能 (Embodied AI) 的快速发展中, 样本效率 已成为制约智能体从实验室环境走向复杂开放世界的瓶颈问题。 不同于纯数字域的对话任务, 具身任务 通常涉及极度复杂的物理环境感知以及高维度的连续控制输出,这意味着智能体面临着巨大的状态- 动作搜索空间,导致学习效率低下且难以收敛。 传统的无模型强化学习由于缺乏对底层物理逻辑的理解,完全依赖于海量的盲目试错来获取学习信号。 然而,在现实物理世界中,每一次交互都伴随着不可忽视的时间损耗、高昂的硬件维护成本以及潜在的安全风险,这使得动辄数亿次的交互 需求变得极不现实。 在线规划能够让智能体在环境交互前通过模拟未来轨迹来优化动作,显著提升强化学习的样本效率。 为了应对这一挑战, 世界模型强化学习 (World Model RL) 研究应运而生。 其核心范式在于通过额外学习一个能够表征环境内在转移规律的预测模型,使智能体具备在想象空间中进行自我进化的能力。 这种机制允许智能体在潜空间内进行大规模、低成本的轨迹预演与策略优化,从而显著降低对环境交互的依赖,加速具身智能机器人的落地 应用。 在世界模型强化学 ...
2026年OpenAI最看好的3个方向
量子位· 2026-01-21 04:09
鹭羽 发自 凹非寺 量子位 | 公众号 QbitAI OpenAI最新播客释出—— 首席财务官Sarah Friar&著名投资人Vinod Khosla聚在一起,聊了聊 2026年的AI趋势 。 信息量很大,比如说明年将会是多智能体正式登场的一年、AI行业如何用算力换收入、大模型能力的上限突破,以及对医疗健康和具身智能行 业的变革影响…..应有尽有。 不过u1s1,此时公开这样一份访谈,其中意味不言而喻: 不仅是回应OpenAI近期的舆论,也是为投资者们打下定心剂, "AI不是泡沫,OpenAI值得投资" 。 潜台词就是,在为OpenAI即将到来的 IPO 铺路。而这也将是OpenAI2026年的重中之重。 如果说2025年AI发展围绕着Agent和Vibe Coding,那么2026年将会是多智能体系统走向成熟并产生实际影响的关键节点。 在企业层面,多智能体系统将能够处理一系列完整的复杂任务,比如运行企业资源规划系统 (ERP) 、日常对账和实时跟踪合同执行情况 等。 除去OpenAI的自述,其中对于行业的宏观视角也相当有趣,一些核心观点包括: 2026年将会是真正的智能体之年。 算力与收入之间存在明显的正 ...
MiniMax把自家“实习生”放出来了!
量子位· 2026-01-20 13:04
Core Insights - The article discusses the evolution of AI agents, emphasizing the need for them to deeply integrate into work environments and understand professional contexts to become effective long-term partners [3][29]. Group 1: AI Agent Evolution - Traditional workflows that separate demand, design, and code are rapidly dissolving [1]. - The new MiniMax AI-native workspace, Agent 2.0, is designed to act as a reliable partner by directly accessing local resources and adhering to established workflows [4][8]. - The update focuses on two core components: Desktop App for execution and Expert Agents for understanding business contexts [5][24]. Group 2: Desktop App Functionality - The Desktop App connects cloud capabilities directly to local computers, enabling it to read files and perform various tasks seamlessly [6][7]. - It can autonomously retrieve local resources, eliminating the need for users to manually input information [8]. - A complex task was designed to test the Desktop App's capabilities, requiring it to gather detailed information on 20 products and generate a comprehensive report and presentation [12][22]. Group 3: Expert Agents - Expert Agents allow for the injection of private knowledge and experience into the AI system, enabling it to understand specific business logic [26]. - This approach addresses the limitations of general models in handling highly specialized tasks [25]. Group 4: Long-term Partnership with Agents - The ultimate goal is for agents to evolve into long-term partners capable of delivering results by fully embedding themselves in the work environment [29]. - Key capabilities include continuous memory, the ability to internalize implicit experiences, and a keen awareness of the business environment [31][33][35]. Group 5: Real-world Applications - The article illustrates practical applications of Agent 2.0 in various departments, showcasing its ability to generate customized emails, modify website code, and analyze system alerts [36][37][39]. - The release of Agent 2.0 standardizes a high-efficiency production model that has already been successfully implemented within MiniMax [40][41].
豆包的新身份曝光:在国际艺术展当起了“AI讲解员”
量子位· 2026-01-20 10:04
Core Viewpoint - The article discusses the innovative use of AI, specifically the Doubao model, as an art exhibition guide, showcasing its advanced capabilities in real-time visual understanding and interaction with users [1][38]. Group 1: AI Capabilities - Doubao, the AI guide, demonstrated the ability to identify and recommend key artworks in a high-density exhibition environment, effectively filtering important pieces for the user [10][11]. - The AI's real-time visual perception allows it to continuously understand the presented images during video calls, providing seamless explanations of artworks without requiring additional user input [14][15]. - Doubao can autonomously search for additional information during the interaction, enriching the conversation with deeper insights about the artworks being discussed [20][22]. Group 2: Model Performance - The Doubao model 1.8 exhibits superior multi-modal processing capabilities, significantly improving its performance in visual understanding tasks compared to previous versions [24][25]. - In various benchmark tests, Doubao 1.8 outperformed other leading models in areas such as reasoning, visual comprehension, and real-time interaction, establishing itself in the top tier of AI models [26][34]. - The model's ability to handle complex instructions and maintain logical coherence during dynamic interactions highlights its advanced capabilities in practical applications [36][37]. Group 3: User Experience - The interaction with Doubao feels natural and human-like, enhancing the overall user experience during art exhibitions by providing a continuous flow of information and engagement [36][40]. - The AI's role in real-life scenarios, such as guiding users through exhibitions, signifies a shift towards more integrated and useful AI applications in everyday life [39][41].
量子位编辑作者招聘
量子位· 2026-01-20 04:17
Core Viewpoint - The article emphasizes the ongoing AI boom and invites individuals to join the company "Quantum Bit," which focuses on tracking AI advancements and has established itself as a leading content platform in the industry [1]. Group 1: Job Opportunities - The company is hiring for three main directions: AI Industry, AI Finance, and AI Product, with positions available for both experienced professionals and fresh graduates [2][4]. - Positions are open for various levels, including editors, lead writers, and chief editors, with a focus on matching roles to individual capabilities [6]. Group 2: Job Responsibilities - **AI Industry Direction**: Responsibilities include tracking innovations in infrastructure, such as chips, AI infrastructure, and cloud computing, as well as producing accessible reports on technical conferences and papers [6][7]. - **AI Finance Direction**: Focuses on venture capital, financial reports, and analyzing capital movements within the AI industry, including interviews with investors and entrepreneurs [11]. - **AI Product Direction**: Involves monitoring AI applications and hardware developments, writing in-depth product evaluations, and engaging with product experts [11]. Group 3: Benefits and Work Environment - Employees can expect a vibrant team atmosphere, opportunities for personal influence through original content creation, and professional mentorship from senior editors [6][11]. - The company offers competitive salaries and comprehensive benefits, including social insurance, meal allowances, and performance bonuses [6]. Group 4: Company Growth and Reach - By 2025, Quantum Bit aims to have over 2.4 million subscribers on WeChat and more than 7 million users across platforms, with a daily reading volume exceeding 2 million [12]. - The company is recognized as the top new media outlet in the AI and frontier technology sectors according to third-party data platforms [12].
从「能用」到「好用」:数据可视化的三个维度,你还在第一层吗?——人大提出图表创作新方式
量子位· 2026-01-20 04:17
Core Insights - The article discusses the evolution of data visualization from merely creating charts to addressing deeper challenges such as enhancing visual appeal and storytelling through dynamic data representation [2][9] - It highlights the need for tools that can streamline the process of creating visually engaging and interactive data presentations, moving beyond traditional methods that are often labor-intensive and not easily reusable [10][12] Group 1: Challenges in Data Visualization - The first challenge is creating visually appealing data representations without excessive manual effort, which often leads to time-consuming processes in design software [2][3][4] - The second challenge involves animating data visualizations, where the complexity of coding and limited flexibility in templates can deter users from implementing dynamic features [5][6] - The third challenge is the repetitive nature of implementing interactive features across different visualization types, which often requires starting from scratch with each new project [7][8] Group 2: Proposed Solutions - The IDEAS Lab team has developed three key projects: PiCCL for enhancing static chart creation, CAST for simplifying animation processes, and Libra for improving interactive capabilities [11][12][13] - PiCCL redefines the creation of static charts by focusing on graphic operations and constraints, allowing for more efficient and reusable designs [20][21][23] - CAST introduces a declarative model for animation that emphasizes data-driven timing structures, making it easier to create complex animations without extensive coding [28][35][36] Group 3: Enhancements in Interactivity - Libra aims to treat interactivity as a first-class citizen by breaking it down into reusable components, enhancing the ability to create complex interactions without starting from scratch [39][45] - The system supports features like undo/redo and provides a structured approach to managing interactions, making it easier to implement and maintain [42][43] - By leveraging the capabilities of PiCCL, CAST, and Libra, the future of data visualization is expected to incorporate more efficient and user-friendly tools, potentially utilizing large models for enhanced visualization generation [44]
首个真正“能用”的LLM游戏Agent诞生!可实时高频决策,思维链还全程可见
量子位· 2026-01-20 04:17
Core Viewpoint - The article discusses the emergence of AI in the gaming industry, highlighting the capabilities of a new AI agent called COTA developed by Chao Can Shu Technology, which demonstrates advanced decision-making and operational skills in gaming environments [1][6][55]. Group 1: AI in Gaming - A mysterious gaming account named "快递员" has gained significant attention for its impressive performance in League of Legends, raising questions about the role of AI in gaming [2][4]. - The gaming industry is increasingly focusing on AI, with various companies exploring this technology to enhance gaming experiences [6][7]. - Chao Can Shu Technology has successfully commercialized AI agents across multiple game types, showcasing their expertise in this field [8][9]. Group 2: COTA's Features and Performance - COTA is described as a versatile gaming agent capable of cognitive reasoning, operational execution, tactical planning, and assistance, all powered by a large model [9][10]. - The agent has demonstrated professional-level performance in a first-person shooter (FPS) game demo, where it must make rapid decisions in high-stakes environments [12][13]. - COTA's design allows it to perform complex actions fluidly, simulating human-like gameplay while maintaining high levels of strategy and decision-making [28][34]. Group 3: Technical Innovations - COTA employs a dual-system architecture that separates fast action execution from deep analysis, mimicking human cognitive processes [40][41]. - The agent utilizes a base model called Qwen3-VL-8B-Thinking, balancing performance and efficiency to meet the demands of real-time gaming [39]. - COTA's training pipeline includes stages for supervised fine-tuning, self-play for strategy optimization, and alignment with human preferences, enhancing its gameplay realism [50][51][52]. Group 4: Industry Implications - COTA represents a significant advancement in AI gaming technology, indicating a shift from experimental models to practical applications in the gaming industry [55][56]. - The success of COTA suggests a broader trend where AI agents are becoming integral to enhancing player experiences and game design [57][59]. - The potential applications of COTA extend beyond gaming, offering insights into solving complex real-world problems through its innovative architecture [72][76].