Workflow
科学计算软件
icon
Search documents
AI4S又一瓶颈被攻克:两个AI「吵架」,让科研代码部署成功率突破95%
量子位· 2026-01-13 09:50
Core Insights - The article discusses the challenges in deploying scientific software, emphasizing that most tools are published but not executable, which limits reproducibility and integration in scientific research [3][6][11] - The emergence of AI for Science (AI4S) highlights the need for tools that can interact seamlessly with AI systems, making the ability to run these tools a fundamental issue [8][9][10] - Deploy-Master is introduced as a solution to streamline the deployment process, focusing on creating a shared infrastructure that ensures tools are executable [12][35][37] Group 1: Challenges in Scientific Software - Scientific software often requires extensive manual effort to compile and run, leading to inefficiencies and reliance on individual expertise [4][5] - The deployment bottleneck persists despite advancements in containerization and cloud computing, affecting the usability of scientific software [7] - The lack of a systematic approach to convert tools into executable formats is identified as a structural barrier to the scalability of AI4S and Agentic Science [11][35] Group 2: Deploy-Master Overview - Deploy-Master is designed as a one-stop automated workflow centered on execution, addressing the entire deployment chain from discovery to execution [12] - The tool employs a multi-stage funnel process to filter and validate scientific tools, reducing an initial pool of 500,000 repositories to 52,550 candidates for automated deployment [15] - A dual model review mechanism is implemented to enhance the success rate of building specifications, achieving over 95% success in generating executable tools [22] Group 3: Deployment Insights - The deployment process is characterized by a long-tail distribution of build times, with most tools completing in around 7 minutes, but some requiring significantly longer due to complexity [25][26] - A diverse language distribution is observed among successfully deployed tools, with Python being the most prevalent, followed by C/C++, R, and Java [27][28] - Failure rates in builds are concentrated in specific areas, primarily due to inconsistencies in build processes and missing dependencies [31][32] Group 4: Future Implications - Deploy-Master's success in creating a large collection of executable tools provides a foundation for community agents and various master agents to operate effectively [35][36] - The methodology established by Deploy-Master can be applied beyond scientific computing to other software ecosystems, emphasizing the importance of a robust execution infrastructure [37]
让两个大模型「在线吵架」,他们跑通了全网95%科研代码|深势发布Deploy-Master
机器之心· 2026-01-09 06:16
Core Insights - The article discusses the challenges in deploying scientific software, emphasizing that most tools are published but not executable, leading to inefficiencies in research practices [3][5][21] - It introduces Deploy-Master as a solution to create a shared infrastructure that transforms scientific tools into executable entities, addressing the deployment bottleneck in AI for Science (AI4S) and Agentic Science [5][19][20] Group 1: Challenges in Scientific Software Deployment - A significant issue is that scientific software often requires extensive time to resolve compilation failures and dependency conflicts, resulting in a lack of reproducibility and integration [3][4] - The emergence of AI4S has intensified the need for tools that can interact seamlessly with scientific processes, making the ability to execute tools a fundamental concern [3][5] - The deployment process is not isolated but part of a continuous chain that includes discovery, understanding, environment construction, and execution [5][19] Group 2: Deploy-Master Overview - Deploy-Master is designed to automate the deployment workflow, focusing on execution readiness and addressing the challenges of discovering and verifying scientific tools [5][19] - The initial phase involved searching through 91 scientific and engineering domains, resulting in a refined list of 52,550 candidates for automated deployment from an initial pool of 500,000 repositories [8][9] - A dual-model debate mechanism was implemented to enhance the success rate of building specifications, increasing it to over 95% by iteratively refining the proposed build plans [12][13] Group 3: Deployment Insights and Observations - The deployment process exhibits a long-tail distribution in build times, with most tools completing in around 7 minutes, while some require significantly longer due to complex dependencies [15] - A diverse language distribution was observed among the successfully deployed tools, with Python being the most prevalent, followed by C/C++, R, and Java [16] - The primary reasons for build failures were identified as inconsistencies in the build process, missing dependencies, and mismatched compilers or system libraries, highlighting the need for improved deployment strategies [16][17] Group 4: Implications for the Future - Deploy-Master provides a foundational infrastructure for community agents, enabling them to share verified tools and ensuring a stable action space for planning and execution [19][20] - The methodology established through Deploy-Master can be applied to broader software ecosystems, indicating that deployment challenges are not limited to scientific tools but are prevalent across various software types [20] - The article concludes that in the era of Agentic Science, execution is a prerequisite for all capabilities, and establishing a robust execution infrastructure is essential for future advancements [20][21]