AlphaEvolve
Search documents
百度亮出秘密武器:一个自我演化的AI,给出了人类做不到的最优解
机器之心· 2025-11-14 09:30
Core Insights - The article discusses the rapid evolution of AI from being mere executors to becoming inventors, highlighting the introduction of Baidu's FM Agent, a self-evolving intelligent agent capable of solving complex problems autonomously [1][6][30] Group 1: AI Capabilities and Innovations - FM Agent can autonomously generate and optimize algorithms, significantly reducing the time required for tasks that would take human experts days or even weeks [4][8] - The system combines large language models with evolutionary search algorithms to tackle real-world problems, demonstrating a leap from executing commands to discovering solutions independently [6][8] - The agent's performance has been validated in various benchmarks, achieving a medal rate of 43.56% on MLE-Bench, outperforming the human median by 51.56% [13] Group 2: Technical Features - FM Agent employs four core technologies: automated machine learning processes, combination optimization, GPU kernel generation, and mathematical problem-solving capabilities [13][14] - The system operates through a workflow that includes cold start initialization, adaptive diversity sampling, and a distributed asynchronous infrastructure based on the Ray framework [12][14] Group 3: Industry Applications - FM Agent has shown effectiveness in multiple sectors, including finance, urban traffic optimization, and large-scale engineering projects, providing solutions that are faster and more efficient than traditional methods [25][18] - The agent can abstract real-world problems into mathematical algorithms, continuously iterating and optimizing solutions based on clear evaluation metrics [18][20] Group 4: Future Implications - The emergence of FM Agent signifies a shift towards a new paradigm where humans define problems and AI executes solutions, potentially transforming productivity across various industries [22][30] - Baidu's FM Agent has already attracted over 1,000 enterprises for testing, indicating strong interest and potential for widespread application in sectors like transportation, energy, and finance [33][32]
陶哲轩力推AlphaEvolve:解决67个不同数学问题,多个难题中超越人类最优解
3 6 Ke· 2025-11-07 07:40
Core Insights - The article discusses the introduction of AlphaEvolve, a powerful new tool for mathematical discovery, co-authored by Bogdan Georgiev and Terence Tao [1][5]. Group 1: AlphaEvolve's Capabilities - AlphaEvolve was tested on 67 mathematical problems across various fields, including combinatorial mathematics, geometry, mathematical analysis, and number theory [3]. - The system outperformed traditional tools in scalability, robustness, and interpretability, and it can autonomously discover novel mathematical constructs, surpassing existing human optimal results in some cases [5][6]. Group 2: Human-AI Collaboration - In the Nikodym set problem, AlphaEvolve generated initial constructs that, while not optimal, provided valuable insights for human researchers, leading to improved upper bounds in a subsequent independent paper [6][7]. - Similarly, in the arithmetic Kakeya conjecture, AlphaEvolve played a crucial role in advancing understanding [8]. Group 3: Interpretability and Insight Generation - AlphaEvolve's ability to generate clear and interpretable program code allows human experts to analyze and extract general mathematical formulas from its outputs [10]. - For the stacking blocks problem, the system initially created a correct recursive program, which it later simplified into a more efficient explicit program, revealing the mathematical relationship with harmonic numbers [14]. Group 4: Problem-Solving Techniques - The system demonstrated its ability to navigate complex problem spaces by adapting its scoring functions to avoid local traps, ultimately converging on known theoretical optimal solutions [19]. - AlphaEvolve exhibited excellent generalization capabilities, successfully identifying universal constructs for all perfect square inputs [20][21]. Group 5: Efficiency and Expert Guidance - AlphaEvolve operates efficiently with minimal high-quality prompts, and expert guidance significantly enhances the quality of its outputs [23]. - The system supports parallelization, allowing researchers to explore multiple problem instances simultaneously, which is particularly effective for multi-parameter geometric problems [23]. Group 6: Operational Modes - AlphaEvolve functions in two primary modes: "search mode" for efficiently discovering optimal mathematical constructs and "generalizer mode" for creating universal programs applicable to various parameters [24][26]. - In search mode, the system evolves heuristic algorithms to optimize the search process, while in generalizer mode, it aims to identify patterns and develop general formulas based on observed optimal solutions [25][26]. Conclusion - Overall, AlphaEvolve exemplifies how AI-driven evolutionary search can complement human intuition, providing a robust new paradigm for mathematical research [28].
陶哲轩力推AlphaEvolve:解决67个不同数学问题,多个难题中超越人类最优解
量子位· 2025-11-07 05:32
克雷西 发自 凹非寺 量子位 | 公众号 QbitAI 陶哲轩又来安利AlphaEvolve了。 在与DeepMind高级工程师Bogdan Georgiev等人合著的新论文中,陶哲轩称其为 数学发现的有力新工具 。 具体来说,他们用AlphaEvolve研究了67个数学问题,涵盖组合数学、几何、数学分析与数论等多个领域。 更关键的是,AlphaEvolve已经可以 自主发现新颖的数学构造 ,并在部分问题上超越人类已有的最优结果。 AI自主发现新数学构造 AlphaEvolve在67个问题的测试中,不仅复现了众多已知最优解,更在多个方面展现了其独特的发现能力。 一个关键的成就是AlphaEvolve 能够自主发现人类未曾一窥的新数学构造 。 例如在处理Nikodym集问题时,系统生成的初步构造虽然尚未达到最优,但它为人类研究者提供了"一个极好的人类直觉跳板" 。 基于AI提供的结构,研究人员通过人工简化和直觉推演,最终找到了一个更优的构造,改进了已知的上界,这一人机协作的成果将作为一篇独 立的数学论文发表。 结果发现,AlphaEvolve在可扩展性、鲁棒性、可解释性方面均优于传统工具。 同样地,在算术Kak ...
X @Decrypt
Decrypt· 2025-11-06 19:19
AI Development - Google DeepMind's AlphaEvolve AI 发现解决未解数学难题的新方法 [1]
谷歌AlphaEvolve太香了,陶哲轩甚至发了篇论文,启发数学新构造
机器之心· 2025-11-06 08:58
Core Insights - The paper showcases how AlphaEvolve, a tool developed by Google DeepMind, autonomously discovers new mathematical constructs and enhances understanding of long-standing mathematical problems [2][8]. - AlphaEvolve represents a significant advancement in the field of mathematical discovery, combining large language models (LLMs) with evolutionary computation and automated evaluation mechanisms [8][16]. - The research indicates that AlphaEvolve can rediscover known optimal solutions and improve upon them in several cases, demonstrating its potential to match or exceed existing best results [10][11]. Group 1: AlphaEvolve's Capabilities - AlphaEvolve can autonomously explore mathematical spaces and generate new structures, significantly reducing the time required for problem setup compared to traditional methods [11][12]. - The system operates on multiple abstract levels, optimizing both specific mathematical constructs and the algorithms used to discover them, showcasing a new form of recursive evolution [12][13]. - The research team tested AlphaEvolve on 67 problems across various mathematical domains, including analysis, combinatorics, geometry, and number theory [9]. Group 2: Methodology and Design - AlphaEvolve employs a complex search algorithm that optimizes solutions by iteratively refining candidate solutions, akin to a hill-climbing approach [18][19]. - The system's design allows it to evolve entire code files rather than just single functions, enabling it to handle more complex mathematical problems [20]. - The introduction of a search mode allows AlphaEvolve to evolve heuristic algorithms that can explore a vast number of candidate constructs efficiently [28][29]. Group 3: Integration of AI Tools - The research highlights a workflow that integrates multiple AI tools, such as Deep Think and AlphaProof, to achieve a complete cycle from intuitive discovery to formal verification [34]. - This integration demonstrates the potential for specialized AI systems to collaborate in mathematical research, enhancing the overall discovery process [34]. Group 4: Observations and Limitations - The study notes that while AlphaEvolve excels in discovering constructs within the current mathematical capabilities, it may struggle with problems requiring novel insights [43][44]. - The researchers observed that the design of the verification system significantly impacts the quality of results, emphasizing the need for robust evaluation environments [39]. - The findings suggest that AlphaEvolve's performance improves when trained on related problems, indicating the benefits of cross-problem training [42].
前OpenAI灵魂人物Jason Wei最新演讲,三大思路揭示2025年AI终极走向
3 6 Ke· 2025-11-03 03:02
Core Insights - The core viewpoint of the article is that while AI has made significant advancements, it will not instantaneously surpass human intelligence, and its development can be categorized into two phases: breakthrough and commoditization of intelligence [1][5][42]. Group 1: AI Development Phases - AI development can be divided into two stages: the first stage focuses on unlocking new capabilities when AI struggles with certain tasks, while the second stage involves the rapid replication of these capabilities once AI can perform them effectively [5][30]. - The cost of achieving specific performance benchmarks in AI has been decreasing over the years, indicating a trend towards commoditization [5][12]. Group 2: Knowledge Accessibility - AI is facilitating the democratization of knowledge, making previously high-barrier fields like programming and biohacking accessible to the general public [15]. - The time required to access public knowledge has been significantly reduced, moving from hours in the pre-internet era to seconds in the AI era [14][12]. Group 3: Verifiability and AI - The "Verifier's Law" states that any task that can be verified will eventually be solved by AI, leading to the emergence of various benchmarking standards [16][41]. - Tasks that are easy to verify but difficult to generate will be prioritized for AI automation, creating new entrepreneurial opportunities for defining measurable goals for AI [30][41]. Group 4: Asymmetry in Task Difficulty - There exists an asymmetry in task difficulty where some tasks are easy to verify but hard to generate, such as Sudoku puzzles versus website development [17][18]. - The development speed of AI varies significantly across different tasks, influenced by factors such as digitization, data availability, and the nature of the task [35][36]. Group 5: Future Implications - The future of AI will see a jagged edge of intelligence, where different tasks will evolve at varying rates, and there will not be a singular moment of "superintelligence" emergence [31][42]. - The flow of information will become frictionless, and the boundaries of AI will be determined by what can be defined and verified [43].
陶哲轩敲警钟,谷歌DeepMind联手五大神殿,用AI向世纪难题宣战
3 6 Ke· 2025-10-30 04:12
Core Insights - Google DeepMind has launched the "AI Empowered Mathematics Program," collaborating with five top global institutions to leverage AI in solving complex mathematical problems [1][2][6] - The initiative aims to discover new mathematical challenges that can benefit from AI, build necessary infrastructure, and accelerate scientific discoveries [6][8] - Concerns have been raised by mathematician Terence Tao regarding the potential misuse of AI in mathematical research, emphasizing the need for responsible use and transparency [2][20] Group 1 - The five collaborating institutions include Imperial College London, Princeton Institute for Advanced Study, Institut des Hautes Études Scientifiques, Simons Institute for the Theory of Computing, and Tata Institute for Fundamental Research [2][6] - The program will be funded by Google.org and will utilize advanced technologies from Google DeepMind [8] - Recent advancements in AI, such as AlphaEvolve and Gemini models, have shown significant progress in solving mathematical problems, including achieving gold medal-level performance in competitions [11][14] Group 2 - AlphaEvolve has provided optimal solutions for 20% of 50 public mathematical problems, including a new efficient matrix multiplication method that broke a 50-year-old record [14][16] - The initiative aims to ensure the rigor of mathematical research while paving the way for the integration of AI and mathematics [5][6] - Terence Tao has proposed a set of guidelines for the responsible use of AI in research papers, including clear declarations of AI usage and discussions on potential risks [23][26]
史上最惨一代?AI延长人类寿命,下一代活到200岁不是梦
3 6 Ke· 2025-10-29 07:09
Core Insights - The article discusses the tension between the rapid advancement of AI technologies and the potential risks associated with them, highlighting the contrasting approaches of major tech companies like Google, Microsoft, and Meta towards AI development and commercialization [1][10][14]. Group 1: AI Development and Corporate Strategies - Major tech companies are racing to develop AGI (Artificial General Intelligence), with significant investments and talent acquisition, but they differ in their approach to speed and safety [8][10]. - Google tends to be more cautious in its AI rollout, ensuring technologies are ready before launch, while Microsoft is perceived as more aggressive [8][10]. - OpenAI occupies a middle ground, balancing between caution and the urgency to capture market share [8][10]. Group 2: Energy and Resource Constraints - The article emphasizes that energy may become a critical bottleneck for AI development, despite the U.S. having advantages in chip technology and AI training [10][14]. - The competition for AI supremacy is not solely about capital and talent but increasingly about energy resources [10]. Group 3: The Future of AI and Human Longevity - There are indications that AI may soon exhibit recursive self-improvement, leading to rapid advancements that could result in an "intelligence explosion" [14][17]. - Breakthroughs in biomedical AI could significantly extend human lifespans, with predictions that children today may have a 50% chance of living to 200 years old [26][32]. Group 4: Societal Implications of AI and Robotics - The potential for robots to take over household tasks could lead to a society where humans have more leisure time, but it also raises concerns about societal engagement and productivity [33][37]. - The future may see a divergence in societal outcomes, with one scenario leading to creativity and prosperity, while another could result in widespread complacency and entertainment addiction [39][40].
地理学的AlphaEvolve?MIT斯坦福让AI自我生长、懂地理、懂世界
3 6 Ke· 2025-10-28 03:04
Core Insights - The article discusses the development of GeoEvolve, a framework that integrates geographic knowledge into AI to enhance geospatial modeling, allowing AI to autonomously improve algorithms rather than merely assisting researchers [2][4][19]. Research Background - Geospatial modeling is crucial for understanding climate change and promoting sustainable urban development, traditionally relying on expert experience for hypothesis formulation and algorithm design [4]. - Recent advancements in large language models (LLMs) show potential for automated code evolution, but these systems lack an understanding of geography, which can lead to ineffective models [4][11]. GeoEvolve Framework - GeoEvolve is conceptualized as a research team comprising an AI (acting as a PhD student) and a geographic knowledge base (acting as a mentor), ensuring that the evolution of algorithms aligns with spatial theories [5]. - The framework consists of four core modules: 1. Code Evolver (automatically generates and mutates candidate algorithms) 2. Code Analyzer (diagnoses issues and suggests improvements) 3. Geographic Knowledge Retriever (GeoKnowRAG, provides spatial theory and classic methods) 4. Knowledge-Driven Prompt Generator (translates complex geographic knowledge into AI-understandable optimization instructions) [5][8]. Case Study: Automation of Kriging Improvement - Ordinary Kriging, a classic spatial interpolation method, has seen limited structural improvements over time, primarily relying on external combinations with regression models [13]. - GeoEvolve introduces several enhancements to the Kriging model, including: - Adaptive empirical variogram estimation to reduce the impact of outliers [14]. - Multi-start global fitting to avoid local optima [15]. - Adaptive data transformation for better residual distribution [16]. - Experimental results show that GeoEvolve significantly outperforms traditional and other automated models, achieving lower RMSE and MAE across various metal predictions [18]. Conclusion - GeoEvolve demonstrates that AI can autonomously evolve stronger classic models under the guidance of geographic knowledge, suggesting a shift towards fully automated algorithm development in geospatial modeling [19]. - This advancement opens new possibilities for AI applications in geographic science and sustainable development, positioning AI as a collaborative research partner rather than just a tool [20].
X @Demis Hassabis
Demis Hassabis· 2025-10-01 15:59
RT Google Research (@GoogleResearch)Today we describe how we leverage AlphaEvolve, a @GoogleDeepMind system for iteratively evolving code, to morph snippets of code towards better proof elements in complexity theory that can be automatically verified by a computer program. Read more at: https://t.co/tZ2KU9znVu https://t.co/ytEGze2AOv ...