CodeFuse

Search documents
不靠Agent,4步修复真Bug!蚂蚁CGM登顶SWE-Bench开源榜
机器之心· 2025-06-27 06:44
Core Viewpoint - The article discusses the advancements in AI code generation, particularly focusing on the performance of the Code Graph Model (CGM) developed by Ant Group, which has achieved significant success in code repair tasks compared to existing models. Group 1: Performance Metrics - The first fully automated AI software engineer, Devin, solved 13.86% of problems in the SWE-Bench benchmark, significantly outperforming GPT-4 at 1.7% and Claude2 at 4.8% [3] - Genie later improved the score to 30.08%, briefly becoming the top AI programmer globally [4] - The CGM achieved a problem-solving rate of 44% in the SWE-Bench Lite leaderboard, surpassing all open-source models and ranking first [10][11] Group 2: Benchmarking and Testing - SWE-Bench is a testing suite developed by Princeton University, designed to reflect real-world coding challenges faced by developers [5][6] - The benchmark includes tasks derived from actual GitHub projects, ensuring a high level of complexity and relevance [6][7] - A simpler subset, SWE-Bench Lite, was also created, but it remains challenging [8] Group 3: Innovations in AI Code Repair - CGM is notable for breaking the closed-source model monopoly by achieving SOTA performance using an open-source model [13] - The model employs a lightweight GraphRAG process, eliminating the need for complex agent architectures [14] - CGM uniquely allows large models to understand repository-level code structures, linking code and graph modalities for better context comprehension [15] Group 4: Technical Advancements - CGM utilizes a multi-granularity code graph modeling approach to capture structural information within code repositories [42] - The model incorporates a two-stage training process that aligns structure and semantics, enhancing its ability to reason about code [45] - The GraphRAG framework streamlines the process into four key modules, improving efficiency in bug fixing [51] Group 5: Market Implications - CGM offers enterprises greater freedom by ensuring data security and reducing reliance on expensive API services [54] - The architecture is expected to attract companies looking for customizable and deployable solutions [55] - The advancements in AI code generation are anticipated to significantly transform software engineering by the end of 2025 [56]