量子位
Search documents
马斯克SpaceX背后的她:现实版钢铁侠小辣椒
量子位· 2026-01-24 05:19
Core Viewpoint - The article discusses the significance of Gwynne Shotwell, the President of SpaceX, as a key figure behind the company's success, especially in light of the upcoming IPO that aims for a valuation of $1.5 trillion and over $30 billion in financing [2][55]. Group 1: Gwynne Shotwell's Role and Characteristics - Gwynne Shotwell is recognized as the "Adult in the Room" at SpaceX, effectively managing the company alongside Elon Musk [6][5]. - She is described as bold, passionate, and direct, which allows her to maintain a productive relationship with Musk despite his challenging personality [10][11]. - Shotwell has a strong technical background, holding degrees in mechanical engineering and applied mathematics, which contributes to her credibility in the aerospace industry [15]. Group 2: Key Contributions to SpaceX - Shotwell played a crucial role in saving SpaceX during its early struggles in 2008, particularly after the failure of the Falcon 1 rocket launches [28][29]. - She successfully negotiated a $1.6 billion contract with NASA for cargo transport to the International Space Station, which was pivotal for SpaceX's survival [32][34]. - Her ability to understand Musk's vision and provide constructive feedback has been essential in aligning the company's goals and strategies [40][41]. Group 3: Leadership and Team Dynamics - Shotwell's leadership style is characterized by her loyalty to Musk while also advocating for the team, ensuring a balance between company demands and employee welfare [49][50]. - She emphasizes teamwork and collective achievement over individual recognition, often attributing SpaceX's success to the efforts of all employees [52]. - The dynamic between Musk and Shotwell is likened to a partnership where Musk's visionary ideas are complemented by Shotwell's operational stability, akin to a ship's sail and rudder [54]. Group 4: Market Impact and Future Outlook - As SpaceX approaches a critical juncture in the capital markets, Shotwell's presence is seen as a stabilizing factor that reassures investors and stakeholders [55].
微软发布医疗时序基座模型:基于4540亿数据预训练,解决不规则采样难题
量子位· 2026-01-24 05:19
Core Viewpoint - The article discusses the introduction of MIRA, a universal base model designed for medical time series data, which addresses the challenges of irregular and heterogeneous medical data, aiming to enhance predictive capabilities in healthcare AI [5][25]. Group 1: Medical AI Landscape - Large Language Models (LLMs) and Computer Vision (CV) are transforming the healthcare industry, enabling AI to interpret CT images and write medical summaries [1]. - A critical missing piece in medical AI is the ability to understand the "dynamic evolution of life," which is essential for capturing the continuous trajectory of vital signs [2][4]. Group 2: Challenges in Medical Time Series Data - Traditional deep learning models rely on idealized assumptions of uniform data sampling, which do not hold true in real-world medical scenarios, particularly in Intensive Care Units (ICUs) where vital signs are recorded at irregular intervals [9][10]. - The characteristics of medical time series data include irregular time intervals, heterogeneous sampling rates, and data missing due to non-standard clinical workflows [12]. Group 3: MIRA Model Introduction - MIRA is built on 454 billion medical data points and aims to overcome the limitations of traditional models by learning physiological dynamic patterns across various scenarios and modalities [5][25]. - MIRA employs two core technologies: Continuous Time Rotational Position Encoding (CT-RoPE) for understanding historical data and Neural ODE for predicting future states [13][18]. Group 4: Experimental Validation - MIRA demonstrates zero-shot transfer capabilities, outperforming some supervised models in out-of-distribution tests, indicating its ability to learn general physiological signal changes [21]. - MIRA shows high robustness in handling sparse data, maintaining performance even with only 30% of observation points, unlike traditional models that rely on interpolation [23][24]. Group 5: Future Implications - The introduction of MIRA marks a significant step towards a "universal base" era in medical AI, allowing hospitals to quickly develop high-precision customized models with minimal local data [25].
将登央视春晚,今年冲击IPO!苏州具身新贵魔法原子联创披露一堆新信息
量子位· 2026-01-24 01:40
衡宇 发自 太湖边 量子位 | 公众号 QbitAI 就在昨晚,具身智能创业公司 魔法原子,宣布将登上今年央视春晚舞台 。 几乎在官宣新闻发布的同一时间,魔法原子联合创始人顾诗韬在与我们的交流中透露, 魔法原子将在今年冲击IPO 。 魔法原子相对还比较年轻,我们是在2024年的时候刚成立。 现在正在加速上市的进程,正严格按照最快的时间表在排期。比较期待2026年能在二级市场看到我们的消息。 魔法原子 成立于2024年1月 ,旗下产品包括Magic Bot机器人系列及Magic Dog四足机器人系列。 据介绍,Magic Bot可完成复杂动作并具拟人社交能力;Magic Dog能耐受-20℃至55℃极端环境,凭精准定位穿梭复杂场景。 沟通过程中,顾诗韬同步透露了一些数据和此前未曾公开的信息。 她告诉我们, 魔法原子即将发布新产品 ,除此之外,还传递了关于魔法原子"现在"和"未来"的更多消息。 顾诗韬 :长期具身智能之战当前还在商业化落地的非常早期的阶段,未来的具身智能之争, 最终会变成一个资源之争,资金之争和人才之争 竞争越来越激烈的前夜,我们要做两件事。 第一件事情是 快速去跑通自己商业化的闭环 ,需要从经营 ...
以最低图像分辨率斩获SOTA!全栈开源具身模型发布:3.5万小时炼出通用大脑
量子位· 2026-01-23 12:09
Core Insights - The article discusses the breakthrough of the Being-H0.5 model in the field of embodied intelligence, addressing the challenges posed by data isolation and the "Matthew Effect" in the industry [1][3][39] - Being-H0.5 is the largest VLA model with 35,000 hours of training data, enabling cross-robot zero-shot skill transfer and showcasing remarkable generalization capabilities [2][3][30] Data and Model Development - The Being-H0.5 model integrates 35,000 hours of data, including 14,000 hours of robot data and 16,000 hours of human data, across 30 robot types, allowing for rapid adaptation and stable execution regardless of hardware configuration [2][8] - The UniHand-2.0 dataset, an iteration of UniHand-1.0, features over 35,000 hours of high-quality data, marking a significant advancement in cross-domain data integration [8][9] Training Paradigms - The model employs a human-centric learning paradigm, aligning human intent with robotic actions through a unified token sequence that captures physical interaction signals [20][39] - A unified action space framework is established to overcome the dimensional gap between heterogeneous hardware, facilitating joint training and knowledge sharing [16][17] Architectural Innovations - The Mixture-of-Flow (MoF) architecture allows for the decoupling of action experts, focusing on learning universal motion primitives while ensuring precise execution for specific robot types [22][23] - The model incorporates mechanisms like manifold-preserving gating and universal async chunking to enhance robustness and adaptability across different hardware [23][24] Performance and Validation - Extensive real-world testing on various robot types demonstrated that Being-H0.5 can perform complex tasks, achieving competitive success rates compared to specialized models [28][30][35] - The model's performance in quantitative evaluations shows it surpasses existing VLA models, achieving an average success rate of 98.9% in specific tasks [35][36] Open Source and Future Directions - The BeingBeyond team commits to a full-stack open-source approach, providing not only pre-trained models but also complete training frameworks and evaluation tools to foster community innovation [37][38] - The vision is to establish Being-H0.5 as a foundational infrastructure in the embodied intelligence sector, enabling rapid development without the need for extensive data collection [39]
把医疗AI禁锢在严肃区间:百川M3 Plus首创“证据锚定”,幻觉率2.6%刷新全球纪录
量子位· 2026-01-23 12:09
Core Viewpoint - The article discusses the increasing integration of AI in the medical field, particularly focusing on the advancements made by Baichuan Intelligent in developing a reliable AI model for clinical use, addressing the challenges of trust and cost in medical AI applications [5][6][20]. Group 1: AI Adoption in Healthcare - Many doctors, especially younger ones, are beginning to embrace AI technologies in their practice, with Baichuan's professional model having around 100,000 doctor users [2]. - The medical community generally agrees on the potential of AI, but there are significant barriers to its clinical implementation, primarily trust and cost [4][5]. Group 2: Baichuan's M3 Plus Model - Baichuan's latest model, M3 Plus, has achieved a hallucination rate of 2.6%, the lowest in global evaluations, thanks to its unique six-source evidence technology [6][19]. - The model's success is attributed to Fact-Aware Reinforcement Learning, which incorporates medical facts into the training process to reduce hallucinations [12][46]. Group 3: Cost Reduction and Accessibility - M3 Plus has undergone extensive optimization, resulting in a 70% reduction in API call costs compared to its predecessor, making it more accessible for hospitals and doctors [21][47]. - The Gated Eagle-3 architecture enhances inference throughput by approximately 15%, further lowering the cost per request [22]. Group 4: Evidence Anchoring Technology - Baichuan has introduced "Evidence Anchoring" in M3 Plus, ensuring that every medical conclusion made by the AI is directly supported by original evidence from literature [32][46]. - This approach addresses common issues in AI-generated medical responses, such as incorrect citations and conflicting information, which have historically plagued the industry [25][30]. Group 5: Free Access Initiative - Baichuan has launched the "Haina Baichuan" free plan, allowing unlimited access to M3 Plus for institutions serving medical professionals, provided they display "Powered by Baichuan" [47][48]. - This initiative aims to prevent redundant technological development in the industry and facilitate real-world testing and iteration of AI applications in healthcare [54][56]. Group 6: Impact on Medical Professionals - The advancements in AI, particularly the reduction in hallucination rates, provide medical professionals with greater confidence in their decision-making processes [57]. - The article emphasizes the importance of translating these technological improvements into practical applications that benefit patients directly [61].
2.4万亿参数“最强文科生”,文心5.0正式版,你挺懂山东人啊?
量子位· 2026-01-23 12:09
星星 发自 凹非寺 量子位 | 公众号 QbitAI 文心大 模型5.0正式 版 ,来了。 这意味着,自2025年11月Preview版初露锋芒后,这个 参数量高达2.4万亿、主打原生全模态 的"巨无霸"模型的"完全体"来了。 先来看一组"入场成绩单": 近三个月,在全球大模型竞技场LMArena上,文心5.0 Preview版多次在文本榜 (Text Arena) 和视觉理解榜 ( Visi on Arena) 上拿下 国产第一。 1月8日,ERNIE-5.0-Preview-1220以1226分登上 视觉理解榜国产第一、全球Top8 ;1月15日最新榜单,ERNIE-5.0-0110以1460分登上 文本榜国产第一、全球Top8 , 文心5. 0已稳 稳站进了全球第一梯队。 尤其引人注 目的是 ,在LMArena的用户反馈 和评测维度中,在 创意写作、复杂指令遵循、高难度理解 等任务中,文心5.0优势明显。 根据官方晒出的40余项权威基准的综合评测结果,文心5.0在语言、音频、视觉理解、视觉生成的多个维度 超越Gemini-2.5-Pro、GPT-5- High 等模 型,整体处于领先水平。 文心Mome ...
量子位编辑作者招聘
量子位· 2026-01-23 12:09
Core Viewpoint - The article emphasizes the ongoing AI boom and invites individuals to join the company "Quantum Bit," which focuses on tracking AI advancements and has established itself as a leading content platform in the industry [1]. Group 1: Job Opportunities - The company is hiring for three main directions: AI Industry, AI Finance, and AI Product, with positions available for both experienced professionals and fresh graduates [2][4]. - Positions are open for various levels, including editors, lead writers, and chief editors, with a focus on matching roles to individual capabilities [6]. Group 2: Job Responsibilities - **AI Industry Direction**: Responsibilities include tracking innovations in infrastructure, such as chips, AI infrastructure, and cloud computing, as well as interpreting technical reports from conferences [6][7]. - **AI Finance Direction**: Focuses on venture capital, financial reports, and capital movements within the AI industry, requiring strong analytical skills and a passion for interviews [11]. - **AI Product Direction**: Involves monitoring AI applications and hardware developments, producing in-depth evaluations of AI products, and engaging with industry experts [11]. Group 3: Benefits and Work Environment - Employees will have the opportunity to engage with cutting-edge AI technologies, enhance their work efficiency through new tools, and build personal influence in the AI field [6]. - The company offers competitive salaries, comprehensive benefits including social insurance, meal allowances, and performance bonuses, and promotes a dynamic and open team culture [6]. Group 4: Company Growth and Reach - By 2025, Quantum Bit aims to have over 2.4 million subscribers on WeChat and more than 7 million users across platforms, with a daily reading volume exceeding 2 million [12]. - The company is recognized as the top new media outlet in the AI and frontier technology sectors according to third-party data platforms [12].
内存条涨速超金条!100根可换上海一套房,你的手机电脑汽车都逃不过涨价
量子位· 2026-01-23 10:25
Core Viewpoint - The memory chip market is experiencing a significant price surge, driven primarily by the demand from AI servers, which require substantially more memory than traditional servers. This has led to a supply shortage that is expected to last until at least 2026, with prices projected to continue rising in the near term [6][19][20]. Group 1: Price Surge and Market Dynamics - The price of DDR5 server memory has skyrocketed, with a single 256G DDR5 module costing over 40,000 yuan, leading to a total price of 4-5 million yuan for 100 modules, which is comparable to the price of a residential property in Shanghai [1]. - From the second half of 2025, DDR5 memory prices have increased by over 300%, while DDR4 prices have risen by more than 150% [2]. - The market is characterized by extreme volatility, with prices changing daily, marking one of the most intense periods in the storage industry [3]. Group 2: Supply Shortage and Industry Response - Investment banks like UBS have indicated that the storage industry is entering a severe supply shortage phase, surpassing the historical highs seen in 2018 [4]. - Major manufacturers such as Samsung, SK Hynix, and Micron are reallocating production resources towards higher-margin High Bandwidth Memory (HBM), which is consuming a significant portion of general DRAM capacity [6][7]. - AI servers currently account for 53% of global memory production capacity, leading to a drastic reduction in the supply of general memory types like DDR5 and LPDDR5 [9]. Group 3: Manufacturer Strategies and Challenges - Manufacturers are cautious about expanding production due to previous losses during the industry downturn from 2023 to early 2024, with some companies like Micron exiting consumer markets to focus on data centers [10][11]. - Despite DDR5 becoming mainstream, there is still a high demand for DDR4, but major manufacturers have cut back on DDR4 production, leading to price anomalies where DDR4 prices exceed those of DDR5 [11]. Group 4: Impact on Various Industries - The price increases are affecting downstream industries, with PC brands like Lenovo and Dell beginning to raise prices, forcing consumers to either accept higher costs or opt for devices with reduced storage capacity [15][16]. - The automotive industry is particularly impacted, as the demand for memory has surged from a few GB to 256GB or even TB levels due to increased vehicle intelligence [18]. - Companies with strong supply chain management, such as Apple and Huawei, are less affected, while smaller firms with thin profit margins are facing significant challenges [18]. Group 5: Future Outlook - The peak of the supply shortage is expected in the first and second quarters of 2026, with prices likely to maintain a growth rate of over 20% quarter-on-quarter during that period [19]. - The price surge cycle is anticipated to last at least until the end of 2026, with a projected 26% increase in DRAM demand against a 20% increase in supply [19]. - Historical patterns suggest that the price surge will eventually correct once AI infrastructure stabilizes and new production capacity comes online, but this is not expected before 2027 [20].
VS Code现在能像Figma一样搞设计了
量子位· 2026-01-23 10:25
Core Viewpoint - The article discusses the emergence of a new tool called Pencil, which integrates design and coding by allowing users to convert Figma designs directly into code using AI, thereby redefining UI design processes [2][4][25]. Group 1: Introduction of Pencil - Pencil is described as an agent-driven MCP canvas tool that allows for real-time updates of code logic as design elements are manipulated [6][10]. - The tool enables users to drag and drop elements on a design canvas, with the underlying code being updated instantly [9][10]. Group 2: Functionality and Applications - Users can either download Pencil and connect it to Claude Code for Vibe design or install a Pencil plugin in IDEs like VS Code, integrating design and coding environments [11]. - The process involves inputting ideas into an AI prompt window to generate a temporary design, which can then be adjusted and converted into code for browser preview [13][14]. Group 3: Design and Code Integration - Pencil operates on the principle of "design as code," directly modifying UI definitions in the codebase rather than generating visual files [30][31]. - This integration allows for real-time changes in the codebase as design elements are adjusted, ensuring pixel-perfect alignment [32]. Group 4: Compatibility and Version Control - Pencil is compatible with Figma, allowing users to copy and paste designs while retaining vector, text, and style integrity [33][34]. - The design files can be managed like code, enabling version control, branching, and merging within the code repository [35]. Group 5: Impact on UI Design - The introduction of Pencil signifies a shift in UI design, moving from static visual files to dynamic, code-based design processes [24][25]. - This AI-driven collaboration redefines how designers and developers interact, breaking down traditional barriers between design and development [10][25].
猜AI视频,你猜你也错!只有10%的人过关了
量子位· 2026-01-23 07:44
梦瑶 发自 凹非寺 量子位 | 公众号 QbitAI 坏事儿了,这回我是真有点分不清AI和真人视频了…… 我已经说不清楚了,大家伙干脆直接一起猜猜看吧。 Round1 :先来个热身题,左右这两段「眼睛」视频,哪一个是AI生成的? Round2 :难度稍微加一点,左右这两只狮子特写镜头,哪一段更像是AI做的? 好了,答案揭晓时刻来了!每一道题里,由AI生成的视频依次是: Round3 :换个生活点的,左右这两块拿披萨的视频,哪一块是AI生成的? Round4 :动态场景来了,左右这两段骑摩托车的画面,哪一段更像AI视频? Round5 :最后一题收官——左右这两个日历画面,哪一个是AI生成的? 右、右、左、左、右 。 怎么样,猜对了几个? (欢迎友友们在评论区报战绩~) 这组视频,其实来自 Runway 做的一项AI实验:结果发现,在1043名参与者里,只有 10% 的人能成功分辨出哪些是AI视频。 以前刷视频:这不一眼AI吗? 现在刷视频:到底哪个才是真的啊?? 反正我是真分辨不出来了。 只有10%的人,真的分得清AI视频和真实视频 其实Runway搞这么个实验,也真不是一时兴起。 起因是他们偶然发现,连《自 ...