Workflow
量子位
icon
Search documents
Gemini证明数学新定理!全程没联网
量子位· 2026-01-16 12:20
闻乐 发自 凹非寺 量子位 | 公众号 QbitAI Gemini又偷偷藏不住了。 内部数学版学霸模型 FullProof 全程不联网,直接帮数学家证明了代数几何领域的一个新定理—— Gemini的证明严谨、正确、优雅……这是我本人也会引以为傲的见解。 那咱就来看看怎么严谨、怎么优雅的?? Gemini埋下关键思路伏笔 0亏格映射到旗簇空间的motivic类等价结论。 好好好,咱先来简单理解一下,就是把一堆无缺口的橡皮筋按一定的规则套进层层嵌套的盒子里,橡皮筋所有的摆放方式就对应了一个空间; 新结论证明这个空间可以用「一般线性群+仿射空间」的组合来表示,后续研究相关问题直接分析这个简单的样板就行。 在这项研究中,Gemini埋下关键思路的伏笔,甚至能独立给出反例,精彩表现直接让美国数学学会主席都点赞: 这篇论文聚焦的核心问题,是确定0亏格映射到旗簇空间的motivic类等价形式。 旗簇空间 是一种由不同维度子空间层层嵌套构成的几何结构,类似大盒套中盒套小盒的收纳系统; 0亏格映射 对应把无洞的光滑曲线(像橡皮筋)放进这个嵌套空间的所有摆放方式; 格罗滕迪克群 代数几何里一个用来给几何空间分类归档的数学工具,专 ...
北大数院新院长:80后院士刘若川
量子位· 2026-01-16 07:21
鱼羊 发自 凹非寺 量子位 | 公众号 QbitAI 首位"80后"院士刘若川,现在是 北大数院院长 了。 北京大学数学科学学院官网最新显示,院长一职现已由刘若川接任。 刘若川出生于1980年5月,辽宁沈阳人。 他是1999年第40届国际数学奥林匹克竞赛(IMO)金牌得主。同年,保送进入北京大学数学科学学院学习。 在北大,刘若川师从田刚教授,5年就完成了本硕阶段课程:2002年获理学学士学位,2004年获理学硕士学位。 2008年,从MIT博士毕业后,刘若川赴法国巴黎第七大学从事博士后研究。 2012年回归北大后,他相继在北京大学北京国际数学研究中心、数学科学学院任教。并在2021年年底出任北京大学数学科学学院副院长。 此前,北大数院院长为1963年出生的陈大岳教授。 新晋院长刘若川 2025年11月,刘若川当选中国科学院院士,入选年龄44岁,是新增选两院院士中最年轻的一位,也是首位"80后"院士。 刘若川的主要研究领域是算术几何与代数数论,研究工作聚焦于p进霍奇理论、p进自守形式以及代数K理论等当代数学的重要前沿方向。 据北大数院官网介绍,他的工作对p进霍奇理论有基础性贡献,建立了相对p进霍奇理论的基础理论 ...
天玑9500s正式登场!扩图消除本地跑,《原神》极高画质满帧运行
量子位· 2026-01-16 07:21
Core Viewpoint - The article discusses the increasing trend of advanced AI functionalities being integrated into mid-range chips, exemplified by MediaTek's newly launched Dimensity 9500s, which offers flagship-level features at a lower price point [1][2][5]. Group 1: AI Features of Dimensity 9500s - The Dimensity 9500s integrates MediaTek's latest flagship NPU, enabling smooth operation of complex AI models for tasks such as voice summarization and content analysis [3][7]. - The chip's AI capabilities extend to photo and video processing, allowing for dynamic video creation from still images and real-time focus tracking during fast-moving scenes [9][10]. - AI features also include automatic background enhancement and object removal, improving the overall quality of photos taken [12][13]. Group 2: Performance Specifications - The Dimensity 9500s is built on TSMC's advanced 3nm process technology, housing nearly 30 billion transistors [16]. - It employs a full big-core architecture with a Cortex-X925 core clocked at 3.73GHz, supported by a large 29MB cache for efficient data handling [18][20]. - The chip's second-generation scheduling engine and memory compression technology enhance app launch speeds by 44%, ensuring smooth multitasking even with multiple background applications [22][24]. Group 3: Gaming and Graphics Capabilities - The Immortalis-G925 GPU provides top-tier graphics quality while maintaining approximately 10% lower power consumption compared to other flagship products [26]. - The chip supports hardware-level ray tracing, delivering realistic lighting effects in mobile games, and can achieve 90 frames per second in demanding open-world games [30][31]. - The Dimensity 9500s is set to debut in the Redmi Turbo 5 Max, highlighting its gaming capabilities [39].
英伟达DLSS 4.5来了:Transformer再进化消除鬼影,“拼好帧”最高提至6倍还能动态调节
量子位· 2026-01-16 07:21
Core Viewpoint - NVIDIA has introduced DLSS 4.5 at CES 2026, enhancing gaming experiences by addressing key player concerns regarding image quality and frame rates through a "dual-core" strategy [1][3]. Group 1: Image Quality Enhancement - The first core focuses on image quality, utilizing an upgraded super-resolution technology based on the second-generation Transformer model [4][11]. - This new model boasts five times the computational power of the first generation and is trained on a significantly expanded high-fidelity dataset [12]. - The upgraded model directly processes in the game's native linear space, improving clarity and reducing artifacts like ghosting and flickering, especially in high-contrast scenes [17][19]. - Users of all GeForce RTX graphics cards can access the super-resolution feature through an NVIDIA App update, ensuring enhanced stability and clarity [21]. Group 2: Performance Improvement - The second core is dedicated to performance, specifically designed for the RTX 50 series, featuring dynamic multi-frame generation [6][23]. - DLSS 4.5 introduces a new six-fold multi-frame generation mode, allowing for the generation of up to five additional frames for each traditional rendered frame, significantly enhancing game smoothness [25]. - For instance, the game "Black Myth: Wukong" can now run at 240 fps, compared to its previous frame rate of under 190 fps [27]. - The dynamic multi-frame generation adapts to GPU performance and monitor refresh rates, optimizing frame rates while maintaining quality and responsiveness [30][33]. Group 3: Display Technology Advancement - NVIDIA has also unveiled G-SYNC Pulsar, a significant evolution of G-SYNC technology aimed at reducing motion blur in high-speed visuals [34]. - Demonstrations show that this technology can enhance the visual clarity of a 360Hz monitor to the equivalent of 1000Hz [35]. - Initial support for G-SYNC Pulsar has been rolled out by manufacturers such as ASUS, AOC, and MSI [36].
量子位编辑作者招聘
量子位· 2026-01-16 03:43
Core Viewpoint - The article emphasizes the ongoing AI boom and invites individuals to join the company "Quantum Bit," which focuses on tracking AI advancements and has established itself as a leading content platform in the industry [1]. Group 1: Job Opportunities - The company is hiring for three main directions: AI Industry, AI Finance, and AI Product, with positions available for both experienced professionals and fresh graduates [2][4]. - Positions are open for various levels, including editors, lead writers, and chief editors, with a focus on matching roles to individual capabilities [6]. Group 2: Job Responsibilities - **AI Industry Direction**: Responsibilities include tracking innovations in infrastructure, such as chips, AI infrastructure, and cloud computing, as well as interpreting technical reports from conferences [6][7]. - **AI Finance Direction**: Focuses on venture capital, financial reports, and capital movements within the AI industry, requiring strong analytical skills and a passion for interviews [11]. - **AI Product Direction**: Involves monitoring AI applications and hardware developments, producing in-depth evaluations of AI products, and engaging with industry experts [11]. Group 3: Benefits and Work Environment - Employees will have the opportunity to engage with cutting-edge AI technologies, enhance their work efficiency through new tools, and build personal influence in the AI field [6]. - The company offers competitive salaries, comprehensive benefits including social insurance, meal allowances, and performance bonuses, along with a dynamic and open work culture [6]. Group 4: Company Growth - By 2025, Quantum Bit aims to have over 2.4 million subscribers on WeChat and more than 7 million users across platforms, with a daily reading volume exceeding 2 million [12].
吴恩达开新课教OCR!用Agent搞定文档提取
量子位· 2026-01-16 03:43
Core Insights - The article discusses the resurgence of Optical Character Recognition (OCR) technology driven by advancements in AI models, particularly in the context of a new course by Andrew Ng that focuses on "Agent Document Extraction" (ADE) [2][3][4]. Group 1: OCR Technology Developments - Major companies like DeepSeek, Zhizhu, Alibaba, and Tencent are intensively updating their OCR technologies, indicating a competitive landscape [7][14]. - DeepSeek's OCR technology utilizes a specialized visual encoder to compress lengthy documents into visual tokens, achieving a 97% accuracy rate while processing over 200,000 pages daily with a single A100-40G GPU [9]. - Zhizhu's Glyph framework converts long texts into compact images, overcoming context window limitations, and their GLM-4.6V series supports complex document types with high performance [12][13]. Group 2: Agent Document Extraction (ADE) - The ADE approach enhances traditional OCR by integrating a "visual-first" strategy to understand document layouts and relationships, ensuring data accuracy and intelligent processing [24][25]. - The DPT (Document Pre-trained Transformer) model used in ADE achieved a remarkable accuracy of 99.15% in the DocVQA benchmark, surpassing human performance [28][29]. - ADE's robustness allows it to accurately parse complex documents, including large tables and handwritten formulas, while assigning unique IDs and pixel coordinates to data blocks for precise extraction [31][32]. Group 3: Practical Applications and Deployment - The course provides practical guidance on deploying ADE technology on cloud platforms like AWS, enabling automated document processing pipelines [34]. - The integration of visual grounding technology allows for direct referencing of original documents when AI provides answers, enhancing transparency and reliability [33].
开源框架让代码AI偷师GitHub!bug修复率飙升至69.8%,性能创纪录
量子位· 2026-01-16 03:43
MemGovern团队 投稿 量子位 | 公众号 QbitAI 人类程序员碰到棘手bug通常会上网查询前辈经验。 当前AI虽然开始具备联网搜索能力,但仍不能很好地从网络经验中获取修复bug的能力。 让AI学习人类程序员的工作流程或许有助于其提升bug修复能力,名为 MemGovern 的项目团队在此思路下做出的尝试近期得到了良好的效 果。 在自动化软件工程 (SWE) 领域,大语言模型驱动的代码智能体 (Code Agents) 虽然在编程范式上带来了变革,但它们目前普遍面 临"封闭世界"的认知局限: 现有的智能体往往试图从零开始修复Bug,或者仅依赖仓库内的局部上下文,而忽略了GitHub等平台上积累的浩 瀚历史人类经验 。 事实上,人类工程师在解决复杂问题时,往往会搜索开源社区,借鉴相似问题的历史解决方案。 然而,直接让智能体利用这些"开放世界"的经验极具挑战,因为真实的Issue和Pull Request (PR) 数据充斥着非结构化的社交噪音、模棱 两可的描述以及碎片化的信息。 为了突破这一壁垒,前沿开源学术社区 QuantaAlpha 联合 中国科学院大学(UCAS)、新加坡国立大学(NUS)、北京 ...
不用拍的广告片?深度拆解美团闪购AIGC营销新案例
量子位· 2026-01-16 03:43
Core Insights - The article discusses how Meituan's flash purchase service effectively utilizes AIGC (AI-Generated Content) technology to enhance brand value rather than merely as a gimmick [2][3][45] - The shift in marketing focus is highlighted, moving from generating eye-catching content to clearly conveying brand core values [4][6][45] Group 1: AIGC in Marketing - AIGC should be viewed as a "brand value amplifier" rather than just a tool for flashy content [3][45] - The marketing landscape is evolving, with a greater emphasis on whether AI-generated content communicates the brand's core message effectively [6][45] - Meituan's flash purchase service created two AIGC marketing videos that serve as a case study for how technology can articulate brand messages [7][45] Group 2: Video Analysis - The first video, dubbed "Journey to the West," emphasizes the speed of Meituan's service, showcasing the concept of "instant retail" [18][30] - The second video focuses on the diversity of products available through Meituan, illustrating the idea that "everything is reachable" [33][42] - Both videos successfully integrate AIGC to convey the core values of speed and variety, enhancing viewer perception of the brand [43][45] Group 3: AI's Role in Marketing - AI is transitioning from a mere efficiency tool to a foundational element in narrative construction for marketing [48][50] - The use of AI allows for the realization of creative ideas that were previously constrained by budget and technical limitations [52][54] - The successful implementation of AIGC in Meituan's marketing demonstrates a shift in how brands can leverage technology to express their core values [56][75] Group 4: Meituan's Unique Position - Meituan's flash purchase service is uniquely positioned to utilize AIGC due to its business model focused on instant delivery and diverse product offerings [59][66] - The alignment between the immediacy of AI-generated content and Meituan's service promise enhances the effectiveness of their marketing strategy [63][66] - The case study illustrates that effective AIGC marketing requires a clear understanding of brand identity and the appropriate application of AI capabilities [69][70]
OpenAI核心旧部,再创业又内讧了
量子位· 2026-01-15 23:57
Core Viewpoint - The article discusses the unexpected departure of Barret Zoph from Thinking Machines Lab due to alleged unethical behavior and his swift return to OpenAI, raising questions about the circumstances surrounding his exit and the implications for both companies [4][12][41]. Group 1: Departure and Return - Barret Zoph was reportedly terminated from Thinking Machines Lab due to "unethical behavior" and was quickly replaced by Soumith Chintala as the new CTO [4][5][8]. - Following his termination, Zoph announced his return to OpenAI, expressing excitement about rejoining the team, which had been in preparation for several weeks [12][13][41]. - The rapid transition from Thinking Machines Lab to OpenAI has sparked speculation about the nature of Zoph's departure and the internal dynamics at both companies [16][23][41]. Group 2: Company Dynamics - Thinking Machines Lab, co-founded by Zoph and others, is currently valued at $50 billion, making it one of the hottest startups in Silicon Valley [32]. - The article highlights a trend of co-founders leaving top AI labs, with OpenAI losing 8 out of 11 co-founders and Thinking Machines Lab losing 3 out of 6 [44]. - The internal conflicts at Thinking Machines Lab, particularly regarding Zoph's departure, suggest deeper issues within the company, as it lost a key co-founder [43][44]. Group 3: Background on Barret Zoph - Barret Zoph was a significant contributor to OpenAI, particularly in the development of GPT-4, and had previously worked at Google Brain [26][30]. - His expertise in optimizing foundational models has been crucial for the practical applications of AI technologies like ChatGPT [28][30]. - The return of Zoph, along with Luke Metz and Sam Schoenholz, is seen as a substantial gain for OpenAI, especially after the recent loss of another research vice president [41][42].
微软谷歌正在大力招「电工」
量子位· 2026-01-15 23:57
Core Insights - The competition for AI talent among tech giants has expanded beyond the computer field to include energy experts [1][3] - Major companies are significantly increasing their hiring in the energy sector to address power supply issues critical for AI development [8][20] Group 1: Hiring Trends - Since 2022, Microsoft has hired over 570 employees in the energy sector [4][11] - Amazon leads with 605 new hires in energy, including AWS [10] - Google has added over 340 energy-related positions [11] - Other companies like Apple and NVIDIA have also increased their energy-related roles by nearly 200 [12] Group 2: Talent Acquisition - Microsoft has poached Betsy Beck from Google, who has over 15 years of experience in the energy field [14] - Google recently hired Eric Schubert from BP and Tyler Norris, a recognized climate figure, to strengthen its energy strategy [16][17] - The competition for skilled candidates in energy infrastructure is intensifying due to limited talent pools [18][19] Group 3: Energy Supply Challenges - Microsoft CEO Satya Nadella stated that the lack of electricity is a more critical issue than the shortage of GPUs for AI development [8][20] - The primary challenge is not chip supply but rather the availability of power and the infrastructure to support data centers [21][22] - Elon Musk emphasized that energy will become the essence of currency, highlighting the shift in limitations for AI development [22] Group 4: Long-term Investments - Tech giants are investing in nuclear energy to secure future power supplies, with Meta partnering with several nuclear companies for operational support [29] - Companies are also exploring nuclear fusion projects, with significant investments from major players like Microsoft and NVIDIA [33][34] - Improving energy efficiency in data centers is another avenue being pursued, which ties back to the need for skilled talent [35][36]