Workflow
Gauss
icon
Search documents
AI Agent搞定世纪首次菲尔兹奖成果形式化!一周时间独立完成,20万行代码已公开
量子位· 2026-03-03 10:11
Core Insights - The article discusses a significant achievement in the field of mathematics, where an AI named Gauss, developed by Math Inc., completed the formal verification of a theorem related to sphere packing in 5 days, a task that would typically take 6 months for human experts [1][17]. Group 1: AI Achievement - Gauss is recognized for formalizing the work of Maryna Viazovska, who won the Fields Medal in 2022 for her contributions to the optimal sphere packing problem in 8 and 24 dimensions [4][8]. - This is the first time in this century that a Fields Medal achievement has been fully formalized [5]. - The project involved a total of 200,000 lines of Lean code, making it the largest single-purpose Lean formalization project in history [6]. Group 2: Methodology and Impact - Gauss autonomously detected and corrected errors in the original paper during the verification process, showcasing its advanced capabilities [7][18]. - The AI's work included generating 450,000 lines of code to complete the formalization of the 24-dimensional case after initially verifying the 8-dimensional case [17]. - The successful formalization of such high-level mathematical results indicates that AI agents like Gauss can significantly accelerate research in mathematical frontiers [18]. Group 3: Future Implications - The expansion of automated formalization is expected to transform the mathematical knowledge system and the way mathematical discoveries are made [19]. - Researchers optimized the code generated by Gauss, reducing it from a peak of 500,000 lines to approximately 200,000 lines for better maintainability [20]. - The code has been made publicly available on GitHub, promoting transparency and collaboration in mathematical research [21].
菲尔兹奖成果首次被AI完整形式化,Gauss20万行代码改写数学史?
机器之心· 2026-03-03 08:14
Core Insights - Math, Inc. has achieved a significant milestone in the field of mathematics by formalizing the proof of the optimal sphere packing theorem in 8-dimensional and 24-dimensional spaces using their AI, Gauss [1][3][10] - The formalization of this theorem marks a revolutionary moment in the collaboration between AI and human mathematicians, showcasing the rapid advancements in AI-assisted mathematical research [6][21][22] Company Overview - Math, Inc. was founded by Christian Szegedy, a former co-founder of xAI and chief scientist at Morph Labs, focusing on developing AI technologies for formal verification in mathematics [3][15] - Gauss, the AI developed by Math, Inc., is designed to assist mathematicians in formal verification tasks, combining natural language reasoning with formal reasoning capabilities [15][16] Achievements - The formal proof of the optimal sphere packing theorem in 8 dimensions was completed in just two weeks, while the 24-dimensional proof involved over 200,000 lines of code [18][23] - Gauss previously completed the formalization of the Prime Number Theorem in just three weeks, a task that took human mathematicians 18 months [3][16] Collaboration and Impact - The collaboration between Math, Inc. and human mathematicians has led to significant advancements, including the identification and correction of errors in existing proofs [16][21] - The formalization of these mathematical results is expected to accelerate research processes and deepen the understanding of the interconnectedness of mathematical knowledge [23][24] Future Directions - Math, Inc. plans to continue collaborating with the sphere packing project and other formal mathematical libraries to ensure the long-term usability and maintainability of the code generated by Gauss [24]
腾讯研究院AI速递 20250915
腾讯研究院· 2025-09-14 16:01
Group 1 - OpenAI and Microsoft have released a non-binding cooperation memorandum addressing key issues such as cloud service hosting, intellectual property ownership, and AGI control, but the final cooperation agreement is still pending [1] - OpenAI plans to establish a public benefit corporation (PBC) with a valuation exceeding $100 billion, where a non-profit organization will hold equity and maintain control, becoming one of the most resource-rich charitable organizations globally [1] - OpenAI faces significant cost pressures, expecting to burn through $115 billion before 2029, with $100 billion needed for server leasing in 2030, leaving little room for error in the coming years [1] Group 2 - Utopai, the world's first AI-native film studio founded by a former Google X team, has generated $110 million in revenue from two film projects and secured a spot at the Cannes Film Festival [2] - Utopai has overcome three major challenges in AI video generation: consistency, controllability, and narrative continuity, achieving millisecond-level lip-sync precision with 3D data training [2] - The company positions itself as a content + AI provider rather than a pure tool supplier, receiving support from top Hollywood resources, including an Oscar-nominated screenwriter for the film "Cortes" [2] Group 3 - MiniMax has launched its new music generation model, Music 1.5, capable of creating complete songs up to 4 minutes long, featuring strong control, natural-sounding vocals, rich arrangements, and clear song structure [3] - The model supports customizable music features across "16 styles × 11 emotions × 10 scenes," enabling the generation of different vocal tones and the inclusion of Chinese traditional instruments [3] - MiniMax's multi-modal self-developed capabilities are now available to global developers via API, applicable in various scenarios such as professional music creation, film and game scoring, and brand-specific audio content [3] Group 4 - Meituan's first AI Agent product, "Xiao Mei," has entered public testing, allowing users to order coffee, find restaurants, and plan breakfast menus through natural language commands, significantly simplifying the ordering process [4] - "Xiao Mei" is based on Meituan's self-developed Longcat model (with 560 billion total parameters), capable of fully automating the selection to payment process based on user preferences and location [4] - Despite the advancements, the AI Agent currently has limitations, such as handling complex ambiguous requests and lacking voice response capabilities, with plans for future optimization in personalization and proactive service [4] Group 5 - Xiaohongshu's audio technology team has released the next-generation dialogue synthesis model, FireRedTTS-2, addressing issues like poor flexibility, frequent pronunciation errors, unstable speaker switching, and unnatural prosody [5][6] - The model has been trained on millions of hours of voice data, supporting sentence-by-sentence generation and multi-speaker tone switching, capable of mimicking voice tones and speaking habits from a single audio sample [6] - FireRedTTS-2 has achieved industry-leading levels in both subjective and objective evaluations, supporting multiple languages including Chinese, English, and Japanese, and serves as an industrial-grade solution for AI podcasting and dialogue synthesis applications [6] Group 6 - Bilibili has open-sourced its new zero-shot voice synthesis model, IndexTTS2, addressing industry pain points by achieving millisecond-level precise duration control for AI dubbing [7] - The model employs a "universal and compatible autoregressive architecture for voice duration control," achieving a duration error rate of 0.02%, and utilizes a two-stage training strategy to decouple emotion and speaker identity [7] - The system consists of three core modules: T2S (text to semantics), S2M (semantics to mel-spectrogram), and BigVGANv2 vocoder, allowing for emotional control in a straightforward manner, with significant implications for cross-language industry applications [7] Group 7 - Meta AI has released the MobileLLM-R1 series of small parameter-efficient models, including sizes of 140M, 360M, and 950M, optimized for mathematics, programming, and scientific questions [8] - The largest 950M model was pre-trained using approximately 2 trillion high-quality tokens (with a total training volume of less than 5 trillion), achieving performance comparable to or better than the Qwen3 0.6B model trained on 36 trillion tokens [8] - The model outperforms Olmo 1.24B by five times and SmolLM2 1.7B by two times on the MATH benchmark, demonstrating high token efficiency and cost-effectiveness, setting a new benchmark among fully open-source models [8] Group 8 - An AI agent named "Gauss" completed a mathematical challenge that took Terence Tao's team 18 months to solve, formalizing the strong prime number theorem (PNT) in Lean in just three weeks [9] - Developed by a company founded by Christian Szegedy, an author of the ICML'25 time verification award, Gauss generated approximately 25,000 lines of Lean code, including thousands of theorems and definitions [9] - Gauss can assist top mathematicians in formal verification, breaking through core challenges in complex analysis, with plans to increase the total amount of formalized code by 100 to 1,000 times in the next 12 months [9] Group 9 - Sequoia Capital USA has interpreted the new AI landscape following the release of GPT-5 by OpenAI, which allows for a more natural interaction resembling conversations with a PhD-level expert, incorporating "thinking" capabilities and a unified model to reduce hallucinations [10][11] - Other players have also launched strategic new products ahead of the release, including Anthropic's Claude Opus 4.1 targeting high-risk enterprise scenarios and Google's Gemini 2.5 Deep Think and Genie 3 enhancing reasoning and simulation capabilities [10][11] - The new AI landscape has been reshaped, with OpenAI dominating both open and closed AI ecosystems, Anthropic focusing on enterprise-level precision and stability, and Google emphasizing long-term foundational research [11] Group 10 - DeepMind's science lead, Pushmeet Kohli, revealed that the team targets three types of problems: transformative challenges, those recognized as unsolvable in 5-10 years, and those that DeepMind is confident it can quickly tackle [12] - The team has successfully transferred capabilities from specialized models like AlphaProof to the Gemini general model, achieving International Mathematical Olympiad gold medal levels with DeepThink [12] - The future goal is to create a "scientific API" that allows global scientists to share AI capabilities, lowering research barriers and enabling ordinary individuals to contribute to Nobel-level achievements [12]
陶哲轩18个月没搞定的数学挑战,被这个“AI高斯”三周完成了
3 6 Ke· 2025-09-14 05:16
Core Insights - The new AI agent named Gauss has demonstrated remarkable capabilities by solving a mathematical challenge in just three weeks, a task that took renowned mathematicians 18 months to make limited progress on [2][4][6]. Company Overview - Gauss is developed by a company called Math, which specializes in AI applications for formal verification in mathematics [4][6]. - The founder of Math, Christian Szegedy, is a notable figure in the AI community, recognized for his contributions to the field, including the influential paper on Batch Normalization [13][15][17]. Technical Achievements - Gauss generated approximately 25,000 lines of Lean code, encompassing over a thousand theorems and definitions, a scale of formal proof that typically requires years to complete [7]. - The largest previous formalization projects took up to a decade and involved significantly more code, highlighting Gauss's efficiency [7]. - The Math team has partnered with Morph Labs to develop the Trinity infrastructure, enabling Gauss to operate with thousands of concurrent agents, each requiring substantial computational resources [8]. Future Prospects - The Math team anticipates that Gauss will significantly reduce the time required to complete large mathematical projects, with plans to increase the volume of formalized code by 100 to 1,000 times within the next 12 months [9]. - This advancement is seen as a step towards achieving "verifiable superintelligence" and creating a "generalist machine mathematician" [9].
啥?陶哲轩18个月没搞定的数学挑战,被这个“AI高斯”三周完成了
量子位· 2025-09-14 05:05
Core Viewpoint - The new AI agent named Gauss has demonstrated remarkable capabilities by solving a mathematical challenge in just three weeks, a task that took renowned mathematicians 18 months to make progress on [2][4][8]. Group 1: Gauss and Its Capabilities - Gauss is developed by a company called Math and is the first AI agent capable of assisting top mathematicians in formal verification through autoformalization [5]. - The process of formalization involves converting human-written mathematical content into a machine-readable format, allowing for verification of correctness [6]. - Gauss has generated approximately 25,000 lines of Lean code, which includes over a thousand theorems and definitions, a task that typically requires years to complete [10][11]. Group 2: Comparison with Historical Projects - The largest historical formalization projects have taken up to ten years and produced around 500,000 lines of code, while Gauss's output is significantly faster and more efficient [12]. - In comparison, the standard mathematical library Mathlib, which contains about 2 million lines of code and 350,000 theorems, took over 600 contributors eight years to develop [13]. Group 3: Technical Infrastructure and Future Plans - To support Gauss's operations, Math collaborated with Morph Labs to develop the Trinity infrastructure, which involves thousands of concurrent agents, each with its own Lean environment, consuming several terabytes of cluster memory [14]. - The Math team anticipates that Gauss will significantly reduce the time required to complete large mathematical projects and plans to increase the total amount of formalized code by 100 to 1,000 times within the next 12 months [15][16]. Group 4: Insights from Mathematicians - Mathematician Terence Tao highlighted the importance of clearly defining both explicit and implicit goals in formalization projects, especially as powerful AI tools change the dynamics of project execution [18][19]. Group 5: Company Background - The founder of Math, Christian Szegedy, is recognized for his contributions to the field, including co-authoring the influential paper on Batch Normalization, a key technology for scaling deep learning [21][24][26].
陶哲轩团队1年半项目,被他3周搞定,曾与LeCun吵翻天,如今AI大佬创业用智能体震惊整个学界?
3 6 Ke· 2025-09-12 09:01
Core Insights - Christian Szegedy, former co-founder of xAI and chief scientist at Morph Labs, has launched a new company called Math Inc., focused on creating verifiable superintelligence through automated formalization technology [1] - Math Inc. has developed an autonomous formalization agent named Gauss, which has successfully formalized the Prime Number Theorem in just three weeks, a task that previously took 18 months for a team of experts [2][3] Group 1: Gauss and Its Capabilities - Gauss is the first autonomous formalization agent designed to assist mathematicians in formal verification tasks, achieving significant milestones in a short time frame [2] - The agent autonomously operates for over 10 hours, completing approximately 95% of formalization and proof tasks, with human involvement required only for the remaining portion [6] - Gauss generated around 25,000 lines of Lean code, including over 1,000 theorems and definitions, marking a significant achievement in formal proof history [4] Group 2: Historical Context and Challenges - The process of converting human mathematical achievements into verifiable machine code has historically been costly and complex, often requiring specialized talent [3] - Previous efforts in formalizing the Prime Number Theorem took 18 months, highlighting the efficiency of Gauss in comparison [3] - The project faced challenges in scaling the Lean verification environment to support thousands of concurrent agents, which was addressed through collaboration with Morph Labs [6] Group 3: Academic Reception and Future Prospects - Gauss has received positive feedback from the academic community, with experts recognizing its potential to revolutionize formal verification and mathematical collaboration [7] - Math Inc. aims to enhance Gauss's capabilities and autonomy in future versions, with plans to significantly reduce the time required for large formalization projects [7] - The company is currently engaging with mathematicians for beta testing, indicating a proactive approach to integrating Gauss into practical applications [7] Group 4: Background of Christian Szegedy - Christian Szegedy has a notable background, having previously worked at Google and contributed to significant advancements in deep learning, including the development of Batch Normalization [8][10] - His experience in AI and formal verification positions Math Inc. as a promising player in the field of verifiable superintelligence [8][9]
陶哲轩团队1年半项目,被他3周搞定!曾与LeCun吵翻天,如今AI大佬创业用智能体震惊整个学界?
AI前线· 2025-09-12 07:13
Core Viewpoint - Math Inc. has launched a new automated formalization agent named Gauss, which has successfully formalized the Prime Number Theorem in a significantly shorter time compared to traditional methods, showcasing the potential of AI in mathematical verification [2][4][5]. Group 1: Company Overview - Math Inc. was founded by Christian Szegedy, a former co-founder of xAI and chief scientist at Morph Labs, focusing on creating verifiable superintelligence through automated formalization technology [2][12]. - The company has developed Gauss, the first automated formalization agent designed to assist mathematicians in formal verification tasks [4][10]. Group 2: Technological Achievements - Gauss completed the formalization of the Prime Number Theorem in just three weeks, a task that previously took a team 18 months to achieve [5][6]. - The agent generated approximately 25,000 lines of Lean code, including over 1,000 theorems and definitions, marking a significant milestone in formal verification [6][10]. - Gauss can autonomously operate for over 10 hours, completing 95% of the formalization and proof work, with human intervention required only for the remaining tasks [8][10]. Group 3: Future Prospects - Math Inc. aims to enhance Gauss's capabilities and autonomy, with plans to significantly reduce the time required for large formalization projects within the next 12 months [10]. - The company is currently engaging with mathematicians for beta testing and aims to provide practical tools for mathematicians and proof engineers [10][9]. Group 4: Academic Recognition - Gauss has received positive feedback from the academic community, with experts highlighting its potential to revolutionize human-computer collaboration in mathematics [9][10].