量子位
Search documents
马斯克xAI联创11跑10!仅剩特斯拉嫡系独苗留守
量子位· 2026-03-26 04:12
Core Viewpoint - The article discusses the significant turnover within xAI's founding team, highlighting that out of the original 11 co-founders, only one remains, Ross Nordeen, following the departure of key personnel, including Manuel Kroiss, who was responsible for pre-training and reported directly to Elon Musk [3][21][35]. Group 1: Founding Team Departures - Manuel Kroiss, known as Makro, has announced his departure, marking the exit of 10 out of 11 co-founders [2][3]. - The departures include notable figures such as Zihang Dai and Guodong Zhang, who left in early March, with Dai having a PhD from Carnegie Mellon and previously working at Google [5][6]. - Other key departures include Toby Pohlen, Jimmy Ba, Tony Wu, Greg Yang, Christian Szegedy, Igor Babuschkin, and Kyle Kosic, with various roles in AI development and infrastructure [8][9][10][11][12][13][14]. Group 2: Current Status and Future Plans - Ross Nordeen remains the last co-founder, having a strong background from Tesla, where he managed technical projects and AI-related hardware [21][24][26]. - Nordeen's current focus at xAI involves building a large-scale infrastructure for AI, including data centers and high-performance computing resources [30][34]. - Elon Musk has acknowledged the need to rebuild xAI from the ground up due to the high turnover, indicating a restructuring to improve efficiency and a reevaluation of past hiring decisions [35][36]. Group 3: Recruitment and New Talent - Musk is actively seeking to bring in new talent, including re-engaging previously rejected candidates and hiring from within Tesla and SpaceX [36][37]. - Recently, Musk recruited two talented individuals from Cursor, Andrew Milich and Jason Ginsberg, to strengthen the team [38][41]. - The restructuring and new hires aim to create a more robust xAI team capable of advancing its AI initiatives [42].
全新OCR将图片变代码无损重绘!华中科大&小红书发布3B模型,图形重建超越Gemini 3 Pro
量子位· 2026-03-26 04:12
MOCR团队 投稿 量子位 | 公众号 QbitAI 一个3B参数的小模型,在文档解析上打败了一众开源大模型,在图形重建上甚至反超了Gemini 3 Pro——而且不只是某一项指标,是在六个 图形重建基准上全面超越。 这就是华中科技大学与小红书hi lab联合推出的 MOCR (Multimodal OCR) 。 它提出了一个大胆的新范式: 不只识别文字,而是「解析一切」 ——文字、表格、图表、公式、流程图、化学结构式、UI组件……通通变成 可编辑、可渲染的结构化代码。用论文的话说,这是把文档图形从「二等公民」升级为「一等解析目标」。 更关键的是,这不只是一个技术上的改进,而是一次范式级别的重新定义:文档解析的终点不应该是「把字认出来」,而应该是「把页面上的 一切信息都变成机器可理解、可复用的结构化表示」 △ MOCR整体流程:给定文档图片,将页面上所有元素统一解析为结构化输出,忠实重建原始文档 但传统OCR怎么处理? 框出来,裁成图片,丢掉。 这意味着文档里大量的结构化语义信息在解析环节就被永久性地丢弃了。论文中用一张对比图把这个问题说得很清楚: △ 传统OCR vs MOCR:传统方案把图形当像素丢弃,M ...
谷歌新论文把内存股价干崩了!KV cache压缩6倍,“谷歌的DeepSeek时刻”
量子位· 2026-03-26 01:38
Core Viewpoint - The significant drop in stock prices of Micron and Western Digital is linked to Google's presentation of a new compression algorithm, TurboQuant, which could reduce memory requirements for AI inference by at least six times, negatively impacting the memory chip market [1][5][36]. Group 1: TurboQuant Algorithm - Google Research introduced TurboQuant, a compression algorithm that compresses the memory-intensive KV cache used in AI inference by at least six times without loss of precision [4][5]. - TurboQuant employs two key innovations: PolarQuant, which uses polar coordinates to eliminate the need for additional storage of normalization constants, and QJL, which compresses high-dimensional data into binary symbols without extra memory [16][21]. - The combination of these methods allows for 3-bit quantization, achieving zero loss in precision and significantly reducing memory usage during AI inference [23][30]. Group 2: Performance Improvements - TurboQuant demonstrated an 8x speed increase in calculating attention scores on NVIDIA H100 GPUs compared to the unquantized 32-bit version [29]. - In benchmark tests across various tasks, TurboQuant achieved perfect scores while reducing KV cache memory usage by at least six times [25][24]. - The algorithm not only conserves memory but also enhances speed, outperforming existing quantization methods in vector search without requiring dataset-specific tuning [30]. Group 3: Industry Implications - The introduction of TurboQuant is seen as a pivotal moment for AI memory efficiency, akin to Cloudflare CEO's reference to Google's "DeepSeek moment," suggesting that high-quality models can be trained with fewer resources [32][33]. - TurboQuant is expected to improve the efficiency of semantic search and large-scale vector indexing, making queries faster and more cost-effective for Google [36]. - However, it is important to note that TurboQuant is still a laboratory result and has not yet been deployed on a large scale, and it only addresses memory issues during the inference phase, leaving the training phase unaffected [37][38].
Meta华人实习生搞出超级智能体!自己写代码实现自我进化
量子位· 2026-03-26 01:38
Core Viewpoint - The article discusses the emergence of "Hyperagents," a new generation of AI that can continuously self-improve and optimize its underlying logic through meta-learning, combining concepts from Gödel Machines and Darwinian algorithms [5][7][8]. Group 1: Understanding Hyperagents - Hyperagents are defined as AI systems capable of not only improving their task execution but also modifying the process of generating future improvement suggestions [29]. - The foundation of Hyperagents lies in the Gödel Machine, a hypothetical self-improving AI that can rewrite its own code to enhance its performance [12][13]. - The Darwin Gödel Machine (DGM) extends the Gödel Machine concept by utilizing open-ended algorithms to propose and search for code improvements, leading to a diverse and high-quality AI agent library [18][20]. Group 2: Performance Improvements - Experimental results show that DGM can significantly enhance its performance, with improvements from 20.0% to 50.0% on the SWE-bench [36][37]. - DGM's performance also increased from 14.2% to 30.7% on the Polyglot benchmark, outperforming traditional AI agents [42]. - The self-improvement capabilities of DGM are enhanced by its ability to explore multiple evolutionary paths through an open-ended evolution search strategy [41]. Group 3: Limitations and Future Directions - DGM primarily excels in programming tasks due to the alignment between task evaluation and self-modification, which may not hold in non-programming domains [22][28]. - The article emphasizes the need for AI safety as these systems can potentially exceed human-defined algorithmic boundaries [8]. - Future iterations of Hyperagents, such as DGM-H, aim to incorporate cross-domain transfer and cumulative improvements, enhancing their adaptability and effectiveness [35].
它石智航用“吉尼斯纪录”交卷真干活的具身大脑,丁文超:从来没有Plan B
量子位· 2026-03-25 23:00
Jay 李根 发自 凹非寺 量子位 | 公众号 QbitAI 天使轮拿下 2.42亿美元 后, 它 石智航 到底干啥去了? 它石A1机器人,1小时内完成亚毫米级柔性线束完整装配任务百余次,创下全新吉尼斯世界纪录。 一系列成果让它石首次在线下亮相,便吸引了 央视、新华网 等头部官媒的组团报道。 线束装配 ,是地狱级的工业场景。 长程操作、柔性操作、亚毫米精度, 不可能三角全部集齐 ,被喻为工业自动化界的「哥德巴赫猜想」。 然而接下来的一年里,它石智航选择了一条截然不同的路:没有参加各种行业大会,没有频繁对外发声, 没有出现在春晚或各类展示活动中 ,一直踏实干活。 一年后,这家以技术工程和产业落地集结的明星团队,交出了答卷—— 利刃出鞘 ,一 鸣惊人。 2025年年初,这家公司是资本市场毫无疑问的宠儿。天使轮融资额刷新纪录,明星创始团队,让它石自创立之初,便一跃成为具身智能赛道 炙手可热的名字。 时隔一年,高调亮相的它石,直接打爆了这个场景,如期兑现了对投资人的承诺。 而同样是时隔一年, 量子位也再次来到了上海—— 从它石智航联合创始人、首席科学家 丁文超 这里,获得了关于这家公司更多的技术进展和背后的具身洞察。 ...
CCF强硬反制NeurIPS,不纠正就除名!号召全国科研人员集体抵制
量子位· 2026-03-25 23:00
闻乐 发自 凹非寺 量子位 | 公众号 QbitAI 强硬回应,坚决反对! 最新消息,中国计算机学会(CCF)发声硬刚NeurIPS不公平制裁,直接提出最严厉的反制: 如果NeurIPS不及时纠正错误,CCF将正式将其移出推荐国际学术会议目录 。 与此同时,CCF号召全国计算机领域科研人员全面抵制,拒绝投稿、拒绝提供任何学术服务。 CCF这波是要动真格了,支持!! NeurIPS不公平制裁 事情的起因,要从NeurIPS最新发布的2026年征稿规则说起。 作为全球人工智能与机器学习领域最具影响力的顶会之一,NeurIPS本应坚守学术中立、开放无国界的原则。 但在今年的征稿通知中,会议方明确以"遵守美国制裁与贸易管制规定"为由,直接禁止被列入美国清单的机构投稿、参与评审、担任编辑等一 切学术活动。 这份名单涉及大量中国科研机构与科技企业,其中 包括华为、多家高校院所、重点实验室,甚至三大运营商等共计数百家机构 。 这意味着,这些单位的科研人员哪怕做出再顶尖的成果,也被剥夺了在国际顶会展示、交流、发表的权利。 消息一出,国内AI圈瞬间炸锅。 第二,给出最后通牒,如果NeurIPS不及时纠正错误,CCF将 直接把N ...
第一批“首席龙虾官”,月薪6万
量子位· 2026-03-25 10:45
Core Viewpoint - The emergence of new job titles such as "Chief Claw Officer" and "OpenClaw Engineer" indicates a growing trend in the job market, particularly in AI and technology sectors, reflecting the integration of AI technologies into various industries [1][7][24]. Group 1: Job Market Trends - Companies are actively recruiting for positions related to "OpenClaw," with job postings appearing not only in major cities like Beijing and Shanghai but also in places like Xiamen and Chengdu [4][6]. - The salary range for these positions is generally between 30K to 60K, with some companies offering over a million for top roles, and even internships with daily pay exceeding a thousand [8][19]. - The job titles are not limited to tech companies; they span across various sectors including healthcare, real estate, and tourism, indicating a broad application of AI technologies [7][24]. Group 2: Job Responsibilities and Requirements - The role of Chief Claw Officer (CCO) involves reporting directly to the CEO and is focused on driving the company's AI Native transformation, primarily through the development of Agent systems [12][19]. - Requirements for the CCO position include a strong background in AI technologies, particularly in Agent systems, and a preference for candidates from the post-2000 generation [14][19]. - Other roles, such as OpenClaw Development Engineer, require experience in system design and deployment, familiarity with AI frameworks, and the ability to build complex systems from scratch [16][19]. Group 3: Industry Impact - The rise of new job titles like "Prompt Engineer" and "Vibe Coding Engineer" alongside "Dragon Officer" reflects the ongoing evolution of job roles due to advancements in AI and intelligent agent technologies [24][25]. - The integration of AI is not only transforming tech and internet sectors but is also influencing traditional industries, leading to a gradual change in work dynamics [25][26]. - The trend suggests a shift towards a new workplace ecosystem where AI technologies are seamlessly integrated into various job functions, indicating an irreversible change in the job market [26][27].
龙虾爆火之后,AI初创的全球化机会来了吗?|明天见
量子位· 2026-03-25 10:45
Core Viewpoint - The article discusses the potential of AI startups in the context of globalization, emphasizing the need for these companies to identify the right applications, scenarios, and channels from the outset to succeed in the global market [2][30]. Group 1: Event Overview - A salon event is organized featuring leading global practitioners from companies like Xiaoying Technology, FluxA, Google, JD, Agora, and Meshy, who will share reusable experiences in going global from perspectives of product, technology, and growth [4]. - The event aims to facilitate discussions on the real logic of global AI entrepreneurship, welcoming participants at various stages of their international journey [6]. Group 2: Startup Presentations - Leewow allows users to design products freely, covering various creative themes and offering items like T-shirts and bags [12]. - Brain Recording focuses on consumer-grade non-invasive brain-machine interfaces for sports and cognitive health, developing products like the Nuromova smart sports headband [14]. - MeetaVista aims to create an entry point for AI in the real world, integrating technologies like naked-eye 3D and spatial AI terminals for applications in retail and education [16]. - DataElem specializes in the application of large models, with products like BISHENG and Clawith aimed at enterprise-level services [17]. Group 3: Key Participants - Xiaoying Technology's VP of C-end business, Lin Xiaodong, has developed a video template creation tool for the Indian market, achieving significant success in app rankings [20]. - FluxA's CTO, Qiu Honglin, focuses on reshaping e-commerce productivity through AI and is actively recruiting AI service providers [23]. - Agora's AI product growth leader, Yang Fan, has extensive experience in the audio-video and mobile internet sectors, focusing on the commercialization of dialogue-based AI engines [26].
PyPI遭投毒!LiteLLM用户Python启动就中招,个人凭证秒泄露
量子位· 2026-03-25 06:31
Core Viewpoint - The article highlights a significant security breach involving the LiteLLM Python package, which was found to contain malicious code that could steal sensitive user information upon installation [1][2][3]. Group 1: Incident Overview - The malicious versions LiteLLM 1.82.7 and 1.82.8 were uploaded to PyPI, leading to the immediate leakage of SSH keys, AWS credentials, and API keys upon installation [2][4]. - The package was downloaded approximately 3.4 million times daily, raising concerns about the widespread impact on developers who automatically install new versions [4][5]. - The malicious code was discovered by Callum McMahon while testing a plugin, which led to the identification of the harmful file litellm_init.pth [11][12]. Group 2: Malicious Code Functionality - The malicious file collects sensitive user data, including SSH private keys, cloud credentials, and database passwords, and executes commands to export environment variables [13]. - The stolen data is encrypted using a hardcoded RSA public key and sent to an attacker-controlled cloud server [15]. - If Kubernetes service account tokens are detected, the malware attempts to create privileged pods on each node, potentially leading to further exploitation [17]. Group 3: Source of the Vulnerability - The vulnerability originated from a compromised security tool, Trivy, which was manipulated to inject malicious code into the LiteLLM package during its CI/CD pipeline [18][19]. - Attackers exploited the compromised Trivy to gain access to the PyPI credentials of LiteLLM's maintainer, allowing them to publish the malicious versions [19]. Group 4: Response and Mitigation Steps - Users who installed the malicious versions are advised to check their installed version and remove the package immediately, along with clearing the package manager cache [21][22][23]. - It is recommended to audit Kubernetes environments for unauthorized access and to rotate personal credentials to prevent further breaches [24][25]. - Developers who have not yet installed the malicious versions should temporarily lock to version 1.82.6 until a secure update is released [26]. Group 5: Broader Implications - The article emphasizes that supply chain attacks are becoming more common, particularly targeting high-permission tools like Trivy and LiteLLM [27][28]. - The impact of such attacks can be extensive, affecting users who may not have directly installed the malicious software but rely on affected dependencies [29]. - Developers are urged to reassess their dependency management practices and the security of underlying tools to mitigate future risks [30][31].
中国AI音乐,悄悄把全球第一拿走了
量子位· 2026-03-25 06:31
Core Viewpoint - The article highlights that China's AI music model, Mureka V8 by Kunlun Wanwei, has achieved the top position in the Artificial Analysis music model leaderboard, surpassing international models like Suno V4.5 and Udio v1.5 Allegro, marking a significant milestone in the AI music industry [1][25]. Group 1: Mureka V8's Achievements - Mureka V8 has claimed the number one spot in both vocal and instrumental categories, showcasing its dual capabilities [2][25]. - The model's performance is characterized by its ability to generate music that feels natural and emotionally resonant, with clear articulation and nuanced breathing effects in vocal performances [5][20]. - Mureka V8's instrumental capabilities have also impressed, producing engaging and recognizable riffs, demonstrating a significant understanding of musical elements [18][19]. Group 2: Development Timeline - The development of Mureka has been rapid, with eight major versions released in less than two years, averaging an update every three months [27][28]. - The evolution of Mureka reflects a systematic approach to making AI-generated music not only usable but also enjoyable, transitioning from basic functionality to high-quality production [30][31]. - By the time of the V8 release, Mureka had achieved a level of sophistication that allowed it to produce complete songs with coherent structure and emotional depth [38][40]. Group 3: Industry Context - The article discusses a shift in the AI music landscape, where domestic models like Mureka V8 are reclaiming leadership from previously dominant international models [46][47]. - This trend is part of a broader pattern in the AI sector, where Chinese companies are increasingly catching up and even surpassing their Western counterparts in various AI applications [49][51]. - The success of Kunlun Wanwei in AI music is attributed to its long-term investment in making "good music" a reproducible system capability, supported by a robust domestic user base and diverse application scenarios [57][58].