让两个大模型「在线吵架」,他们跑通了全网95%科研代码|深势发布Deploy-Master
机器之心·2026-01-09 06:16

Core Insights - The article discusses the challenges in deploying scientific software, emphasizing that most tools are published but not executable, leading to inefficiencies in research practices [3][5][21] - It introduces Deploy-Master as a solution to create a shared infrastructure that transforms scientific tools into executable entities, addressing the deployment bottleneck in AI for Science (AI4S) and Agentic Science [5][19][20] Group 1: Challenges in Scientific Software Deployment - A significant issue is that scientific software often requires extensive time to resolve compilation failures and dependency conflicts, resulting in a lack of reproducibility and integration [3][4] - The emergence of AI4S has intensified the need for tools that can interact seamlessly with scientific processes, making the ability to execute tools a fundamental concern [3][5] - The deployment process is not isolated but part of a continuous chain that includes discovery, understanding, environment construction, and execution [5][19] Group 2: Deploy-Master Overview - Deploy-Master is designed to automate the deployment workflow, focusing on execution readiness and addressing the challenges of discovering and verifying scientific tools [5][19] - The initial phase involved searching through 91 scientific and engineering domains, resulting in a refined list of 52,550 candidates for automated deployment from an initial pool of 500,000 repositories [8][9] - A dual-model debate mechanism was implemented to enhance the success rate of building specifications, increasing it to over 95% by iteratively refining the proposed build plans [12][13] Group 3: Deployment Insights and Observations - The deployment process exhibits a long-tail distribution in build times, with most tools completing in around 7 minutes, while some require significantly longer due to complex dependencies [15] - A diverse language distribution was observed among the successfully deployed tools, with Python being the most prevalent, followed by C/C++, R, and Java [16] - The primary reasons for build failures were identified as inconsistencies in the build process, missing dependencies, and mismatched compilers or system libraries, highlighting the need for improved deployment strategies [16][17] Group 4: Implications for the Future - Deploy-Master provides a foundational infrastructure for community agents, enabling them to share verified tools and ensuring a stable action space for planning and execution [19][20] - The methodology established through Deploy-Master can be applied to broader software ecosystems, indicating that deployment challenges are not limited to scientific tools but are prevalent across various software types [20] - The article concludes that in the era of Agentic Science, execution is a prerequisite for all capabilities, and establishing a robust execution infrastructure is essential for future advancements [20][21]