GLM-5真够顶的：超24小时自己跑代码，700次工具调用、800次切上下文

Core Insights - The release of GLM-5 marks a significant advancement in open-source AI, bringing it into the era of long-task capabilities [1] - GLM-5 has demonstrated its ability to perform complex engineering tasks, such as creating a Game Boy Advance emulator from scratch [2][7] - The model has achieved impressive results in various benchmarks, positioning it alongside proprietary models like Claude Opus 4.5 [10][12][18] - The emergence of GLM-5 signifies a shift in the SaaS industry, as it allows developers to create sophisticated applications without relying on traditional software solutions [29] Group 1 - GLM-5 can run code continuously for over 24 hours, performing 700 tool calls and 800 context switches, showcasing its stability and reliability [2][7] - The model's programming capabilities have been validated against established benchmarks, achieving the top score among open-source models [18][20] - Users have already begun to leverage GLM-5 for various applications, including a 3D version of Monopoly and an academic version of TikTok, with multiple apps submitted for App Store approval [24][29] Group 2 - The open-source nature of GLM-5 disrupts the market previously dominated by closed-source models, empowering developers with new tools [20][29] - The performance of GLM-5 has led to concerns in the SaaS sector, with significant stock declines for companies like FactSet and S&P Global as investors reassess the future of software sales [29] - The model's capabilities represent a transformation from AI as a mere assistant to an independent engineer, potentially reshaping the landscape of software development [29]