量子位
Search documents
谷歌Gemini变身免费家教:接入全真模考,错题还能掰碎了讲
量子位· 2026-01-22 05:39
克雷西 发自 凹非寺 量子位 | 公众号 QbitAI 谷歌来给考生送福利了! 从现在起,备考SAT的学生可以免费通过Gemini进行模拟考试,分数立等可取,还能帮你讲解错题。 谷歌CEO劈柴哥表示,这对学生党来说是一个大好的消息。 除了劈柴哥的个人号的75万阅读,谷歌官号还有两百多万网友在线围观。 而且不少人都对这个功能赞不绝口,还有网友在许愿能不能支持其他考试。 这次,我们也简单体验了一下这套模拟考试流程,接下来就一起看看。 实测用Gemini备考SAT 谷歌这次确实没少下功夫,直接拉来了老牌机构The Princeton Review,把那一整套经过验证的SAT模拟题全都塞进了Gemini里。 有网友表示,刚花了1500美金买princeton review的冤种真的是亏大了。 打开实际体验一下,它还真不是随便编几道题糊弄。 在对话框里敲一句「I want to take a practice SAT test.」,就能召唤出这套全真模拟考试系统。 整个测试结构基本复刻了真实的SAT流程,分成了Reading and Writing与Math两大块,每一块又细分成两个章节,难度也是循序渐进的。 而且它给 ...
57.1%的人分不清真假!Runway新视频模型太爆炸
量子位· 2026-01-22 05:39
再看看这个,湿漉逼真的头发、肉眼可见的面部雀斑、超自然的景深,有点好莱坞大片内味儿了嗷: 还没完,咱再来看这个,机械义体与人脸的融合,以及构图处理都非常到位,妥妥滴赛博大片即视感! 梦瑶 发自 凹非寺 量子位 | 公众号 QbitAI 不er,这个世界还有什么是真的?反正我是已经分不清了... 短短3秒,连续切了3个镜头,从人物脸上的皮肤纹理,到满天纷飞的大雪,细节真实到有点离谱! 你就说逼真不逼真吧… 不卖关子,就是Runway刚刚发布的——全新 「 Gen 4.5」 模型。 这次更新主打的是 图生视频 ,在 镜头控制 和 故事叙事 上,明显往next level推了一步~ 这波效果一出来,网友当场坐不住了,直呼:感觉都能吊打好莱坞制作团队了好吧?太逼真!(doge) 甚至逼真到,在Runway做的一项1000人参与的调查中,结果只有约 一半 的人能分辨出该模型和真实视频的区别…… 问题来了,那这超超超逼真的——Gen 4.5模型效果到底咋样?咱一起来看! 长故事表达能力 :能承载更长时序的内容结构,视频的完整度和长度大大提升。 精准的镜头控制能力 :镜头的景别、角度、运动轨迹以及切换节奏都更可控,输出效果 ...
Video版的Deep Research来了?先浏览再定位后精读:精度提升token消耗反降58.3%
量子位· 2026-01-22 05:39
Core Insights - The article discusses the evolution of AI Research, particularly focusing on Autonomous Agents and their ability to actively retrieve information rather than passively receive it [1] - It highlights a significant gap in current AI capabilities, specifically in video processing, where existing agents struggle to effectively analyze video content [2][4] Video Processing Challenges - Current AI agents either excel in text comprehension or can only perform limited question-answering on short video clips, failing to handle the dense information in videos [4] - The article identifies two main approaches to video processing: Direct Visual Inference, which is computationally expensive and suffers from context explosion, and Text Summarization, which loses critical visual details [8] Proposed Solution: Video-Browser - The research team introduces the Video-Browser, which aims to enhance video browsing capabilities by mimicking human-like search behaviors [5][6] - The Video-Browser employs a Pyramidal Perception architecture, processing video data in a tiered manner to balance efficiency and accuracy [10][11] Core Components of Video-Browser - The Video-Browser consists of three main components: Planner, Watcher, and Analyst [13] - The Watcher utilizes a three-stage pyramid mechanism: - Stage I: Semantic Filter, which quickly eliminates irrelevant videos using metadata analysis [14] - Stage II: Sparse Localization, which identifies potential answer time windows using subtitles and sparse frame sampling [15] - Stage III: Zoom-in, where high-frame-rate decoding and detailed visual reasoning occur within the identified time windows [16] Benchmark Testing: Video-BrowseComp - The research team created the Video-BrowseComp benchmark to evaluate the true capabilities of agents in video searching, emphasizing the need for agents to actively seek information [17] - The benchmark includes three difficulty levels, ranging from explicit retrieval to multi-source reasoning [18][20] Experimental Results - The Video-Browser achieved a 26.19% accuracy rate, outperforming existing models by 37.5% in accuracy [21] - The architecture led to a 58.3% reduction in token consumption, demonstrating significant efficiency improvements [22] Case Study - A case study illustrates the effectiveness of the Video-Browser in identifying specific details, such as the color of a pen in a film, which traditional methods failed to capture [24][26] Conclusion and Future Directions - The Video-Browser represents a significant advancement towards effective open-web video browsing, addressing the trade-off between accuracy and cost in video search [27] - The research team has made all code, data, and benchmarks open-source to encourage further research in the community [28][29]
马斯克下场抢人!xAI组建「人才狙击队」,极客版HR年薪168万
量子位· 2026-01-22 02:12
Jay 发自 凹非寺 量子位 | 公众号 QbitAI 马斯克要亲自下场抢人了。 最新消息,xAI正组建一支「AI人才狙击队」, 直接向马斯克汇报 。 这支特种队伍将与xAI的工程团队和招聘团队紧密合作,探索快速、大规模招聘优秀人才的新方法。 值得注意的是,xAI把这一岗位称为「人才工程师」,而 非传统意义上的HR 。 比起人力资源管理背景的应聘者,公司希望该岗位能由「极客」担任,以工程思维做招聘。 马斯克BOSS直聘 马斯克在搞一种很新的招聘方式。 组建一支工程思维的「AI人才狙击队」,搭建工程化的招聘体系,快速识别、触达并吸引各个领域的顶尖人才,主打一个 用工程师招工程师 体系搭好之后,团队成员还要 亲力亲为 ,从头到尾参与招聘流程,始终站在一线。 。 xAI认为,想从常规人才市场招真正的顶尖人才,没戏。还得靠 熟人推荐、线下活动、竞赛选拔、特定线上社区 ,以及各种更具创造性的渠 道。 因此,应聘者的工作重点不是在LinkedIn上发私信,而是能在各种场合如鱼得水,通过极强的判断力,一眼找到最强的那个。 此外,应聘者还需要有在人才密度极高的机构工作的经历,既推荐过优秀人才,也真正参与过招聘。 所以必须要具 ...
让机器人拥有本能反应!清华开源:一套代码实现跑酷、野外徒步两大能力
量子位· 2026-01-22 02:12
清华MARSLab团队 投稿 量子位 | 公众号 QbitAI 实现人形机器人高速跑步(2.5m/s)跨越障碍物/翻越较高障碍 核心定位:为"本能级"运动智能研究而生 人形机器人的"本能级"智能,指的是像人类一样无需预设轨迹,能通过实时感知自主应对复杂环境的能力——比如看到障碍自动调整跳跃姿 势,踩在楼梯边缘下意识保持平衡。 但长期以来,这类研究面临两大痛点:一是 "感知与运动割裂" ,要么能感知地形却只会简单行走,要么能做高难度动作却"眼盲";二是 "工具链不通用" ,高动态动作与野外locomotion研究需单独搭建环境,适配成本极高。 如何让机器人同时具备"本能反应"与复杂运动能力? 清华大学交叉信息研究院与上海期智研究院联合推出的Project-Instinct框架,给出了一个新答案。 ——专为"本能级"人形机器人运动智能研究设计,以模块化、可灵活配置的全链路工具包,让科研人员无需重复造轮子,专注突破核心技 术。 Project-Instinct旨在以"统一框架+灵活配置"打破僵局: 整套工具包从算法设计、环境搭建到真机部署,全链路围绕"本能级"智能核心,既支持高动态多接触动作的精准训练,也能适配野外 ...
高通砸钱、雷军入股!刚刚,上海诞生一个183亿手机代工巨头
量子位· 2026-01-22 02:12
Core Viewpoint - Longqi Technology, a leading global smartphone ODM, has successfully listed on the Hong Kong Stock Exchange, marking its position as the "first stock of consumer electronics ODM" in Hong Kong, with an opening price of HKD 35 per share, approximately 12.9% higher than the issue price [1][4][7]. Group 1: Company Overview - Longqi Technology holds a one-third share of the global smartphone ODM market, serving major brands such as Xiaomi, Samsung, Lenovo, Honor, OPPO, and vivo [3][22]. - The company has established a comprehensive solution matrix covering product design, hardware innovation, software platform development, lean manufacturing, supply chain integration, and quality control [11]. - Longqi's product offerings include smartphones, AI PCs, automotive electronics, tablets, smartwatches, and smart glasses, structured under a "1+2+X" framework aimed at expanding production capacity and enhancing R&D [11][12]. Group 2: Financial Performance - Longqi's revenue from 2022 to 2024 was CNY 293.4 billion, CNY 271.9 billion, and CNY 463.8 billion, with a decline of 10.3% in the first nine months of 2025 [27][28]. - The company's main revenue source is smartphones, contributing 82.7%, 80.3%, 77.9%, and 69.3% of total revenue from 2022 to 2025 [32]. - The gross profit margins from 2022 to 2024 were 8.1%, 9.5%, and 5.8%, with a recovery to 8.3% in the first nine months of 2025 due to strategic adjustments and improved project quality [36][38]. Group 3: Market Position and Client Base - Longqi is the largest smartphone ODM globally, with a market share of 32.6%, and ranks second in the consumer electronics ODM sector with a 22.4% market share [24][26]. - The company has established long-term partnerships with eight of the top ten smartphone brands, with an average collaboration duration of over five years [15][16]. - Xiaomi is Longqi's largest client, contributing significant revenue across multiple years, accounting for 45.5%, 42.4%, 37.2%, and 28.6% of total revenue from 2022 to 2025 [34][35]. Group 4: R&D and Future Prospects - Longqi has a dedicated R&D team of approximately 5,200 professionals, with R&D expenditures of CNY 15 billion, CNY 16.9 billion, CNY 20.8 billion, and CNY 19.5 billion from 2022 to 2025 [41]. - The company is actively expanding into AI and smart manufacturing, with significant progress in AIoT and new product launches, including smart glasses and AI PCs [21][19]. - Longqi's cash and cash equivalents reached CNY 6.85 billion by the end of the third quarter of 2025, indicating a strong liquidity position [42].
xAI工程师播客聊太嗨,马斯克解雇了他
量子位· 2026-01-21 10:00
料太真,真的马斯克立马跳起来开人了。 (d oge) 当事人是一名xAI工程师,名叫Sulaiman Ghori (下面就叫他阿苏吧) ,刚刚把xAI再次拱上舆论风口: 在一档播客上激情聊公司,状态相当好,口若悬河地唠了一个多小时。 并且相当实诚, 讲 的全部 是干货 ,很少有xAI员工会对外公开这么多 内部细节 。 Jay 发自 凹非寺 量子位 | 公众号 QbitAI 问题是……好像有点「实诚」过头了啊。 几乎全部都是机密等级的,直接给马斯克的「巨硬」 (MacroHard) 老底掀完了: 1、xAI内部已经把MacroHard包装成「同事」,有人去工位找「同事」, 结果发现是空桌 。 2、技术路线押注 小模型 ,不搞Scaling那套,靠「迭代速度」取胜。 3、为了部署MacroHard,xAI正考虑 租用北美约400万辆特斯拉的闲置算力 。 慷慨,太慷慨了,任何一条单拎出来都够写篇文章的程度。 有这么多炸裂的消息,自然是一经发布便火爆全网,获得无数网友点赞。 阿苏也表示唠爽了: 非常有意思! 结果没多久,悲报传来…… 咋工作没了啊!! 我已经离开了xAI。 对我的前团队和同事们只有满满的爱! 再次引爆 ...
Node.js之父:手写代码已死
量子位· 2026-01-21 10:00
Core Viewpoint - The era of human-written code is coming to an end, as AI programming tools are increasingly taking over coding tasks, fundamentally changing the programming landscape [1][28]. Group 1: Influential Figures and Their Statements - Ryan Dahl, the creator of Node.js, stated that the era of human coding is over, which garnered significant attention with over four million views [2][4]. - Salvatore Sanfilippo, the creator of Redis, echoed this sentiment by asserting that programming has been permanently altered by AI [7][8]. - Linus Torvalds, initially critical of AI-generated code, has shifted his stance, acknowledging the effectiveness of AI in coding while emphasizing that programmers will still be needed for maintenance and oversight [30][34]. Group 2: AI Programming Tools and Their Impact - AI programming tools like OpenAI Codex's Copilot have accelerated development speed by over 50% [15]. - Companies are increasingly adopting AI tools for development, with ByteDance's TRAE generating 100 billion lines of code in 2025, equivalent to the output of 3 million programmers working continuously for a year [22][23]. - A Stack Overflow report indicated that 84% of developers use AI tools, with 69% believing these tools enhance productivity [24]. Group 3: Future Trends and Predictions - Gartner predicts that by 2030, over 80% of enterprises will deeply integrate AI for coding tasks [26]. - The demand for programmers is evolving, with companies now seeking candidates proficient in AI programming tools [28]. - The shift in programming focus is moving from syntax to intent, indicating a transformation in how coding is approached in the AI era [12].
突发!xAI联创杨格过劳病离职,给马斯克干活压力山大
量子位· 2026-01-21 07:47
henry 发自 凹非寺 量子位 | 公众号 QbitAI 给马斯克干活,压力真的好大! 刚刚,xAI十二位联合创始人之一的 杨格 在推特上宣布离职。 这是继 Igor Babuschkin (25年8月)、 Christian Szegedy (25年2月)、 Kyle Kosic (24年4月)等大佬离职后,又一位离开的xAI 联创。 联创离职,在今天的硅谷并不算稀奇。 但杨格的情况,还真有点不一样—— 他在离职声明中直言,自己因 长期高强度工作 ,导致免疫系统出现问题,不得不退居幕后,专注恢复健康,并转为公司的非正式顾问。 多轮检查后,医生确认问题并非心理因素,而是莱姆病所致。 换句话说,一位年纪轻轻,30多岁的联创,"肝"出了免疫疾病。 难道说,现在的硅谷也不work-life balance了? 积劳成疾 在宣布离开xAI的推文中,杨格透露自己已确诊 莱姆病(Lyme disease) ,将离职xAI,专注于恢复健康。 他表示,症状最早出现在2025年初的一次感冒之后,此后长期受到精力下降、疲惫和身体虚弱的困扰。 (注:莱姆病是一种由蜱虫传播的细菌感染,免疫力低下者更易发展为长期症状,常表现为持续性 ...
微软打包收购OpenAI?就差一点!
量子位· 2026-01-21 07:47
但比起其他雇佣式收购,微软的这波操作更像是 项庄舞剑 ,当中的沛公,则是OpenAI员工可能流向的竞争对手。 当时微软给全员开出了250亿美元的保底承诺,在董事会面前为奥特曼和OpenAI员工站台。 这给了OpenAI员工集体辞职逼宫董事会的底气,并 最终促成了奥特曼的回归 。 克雷西 鹭羽 发自 凹非寺 量子位 | 公众号 QbitAI 太抓马了!奥特曼的宫斗大戏,竟然还有内幕? 就在奥特曼被董事会罢免的那个周末, 微软差点来了一波雇佣式收购。 短短一天,资金、法律文书就全部到位,连名字都想好了…… 但其实在两家合作的初期,微软高层也曾对这个烧钱的实验室充满疑虑,甚至在内部邮件中质疑其商业化能力不过是一场缺乏回报的幻梦。 另一边,被救起的奥特曼也不想被微软一家完全掌控,在去年完成了重组并立刻签下微软竞争对手亚马逊的算力大单。 当然,对OpenAI虎视眈眈的,还远不止是微软和亚马逊…… 奥特曼被逼宫,微软紧急救场 回到OpenAI这场轰轰烈烈的 宫斗大戏 ,想必大家已经不算陌生。 从无预警开除奥特曼→新CEO走马上任→员工集体联名施压董事会,再到奥特曼回宫,五天时间极限拉扯,进度条快到飞起~ 但你以为事情已经 ...