谷歌AI新里程碑：一个能「做研究」的系统诞生了，用LLM+树搜索编写专家级软件

Core Viewpoint - The article discusses a groundbreaking AI system developed by Google that assists researchers in writing expert-level empirical software, integrating large language models with traditional tree search methods to enhance efficiency in scientific research [2][4][36]. Group 1: AI System Overview - The AI system can automatically write and optimize software programs needed for scientific tasks, surpassing human capabilities in various fields such as genomics and public health [2][4]. - It transforms from a one-time code generation tool to an iterative, search-driven software evolution guided by quantifiable goals [4][36]. Group 2: Methodology and Components - The system focuses on "scorable scientific tasks," which can be quantified through accuracy, error rates, or benchmark rankings, covering a wide range of scientific applications [14]. - Three core components work in synergy: 1. LLM-based code mutation, which continuously rewrites and optimizes existing candidate codes [15]. 2. Tree search navigation, systematically exploring the software solution space using a variant of the PUCT algorithm inspired by AlphaZero [16]. 3. Integration of research ideas from various sources, including expert knowledge and academic papers [17]. Group 3: Achievements Across Scientific Fields - In genomics, the system identified 40 new methods for single-cell RNA sequencing, outperforming all published methods on the OpenProblems leaderboard, with the best method improving performance by 14% over the existing best [19]. - For geospatial analysis, the system's top solutions for satellite image segmentation significantly exceeded recent academic results, achieving an average intersection-over-union score greater than 0.80 [22]. - In neuroscience, the system generated a model predicting neural activity in zebrafish brains that outperformed all baseline models and trained significantly faster [26]. - The system also excelled in time series prediction across 28 datasets, creating a unified prediction library adaptable to various datasets [27]. Group 4: Technical Innovations - A key innovation is the systematic integration and intelligent reorganization of research ideas, allowing the system to analyze core principles of different methods and synthesize instructions for creating hybrid methods [31]. Group 5: Conclusion and Implications - The research indicates that AI can not only automate but also systematically exceed human performance in developing scientific software across multiple fields [36]. - This system has the potential to fundamentally change the way scientific software is developed, making advanced analytical tools more accessible to researchers and expanding the boundaries of scientific exploration [37].