Workflow
AI代码生成
icon
Search documents
不靠Agent,4步修复真Bug!蚂蚁CGM登顶SWE-Bench开源榜
机器之心· 2025-06-27 06:44
机器之心报道 编辑:吴昕 Agentless+开源模型,也能高质量完成仓库级代码修复任务,效果媲美业界 SOTA 。 一、Agentless 、44% 与 NO.1 说到 AI 写代码的实力,大家最关心的还是一个问题:能不能真修 bug ? 首个全自动 AI 软件工程师 Devin 一出场就引爆了技术圈,其江湖地位也在权威基准 SWE-Bench 上被进一步坐实—— 独立解决了 13.86% 的问题,远远甩开 GPT-4 仅有的 1.7% ,Claude2 也不过 4.8% 。 没过多久,Genie 又在同一测试中直接将得分拉升至 30.08% ,曾一度登顶全球最强 AI 程序员。 SWE-Bench 为何能赢得工业界、学术界和创业团队广泛关注?因为,它够真实。 这套由普林斯顿大学提出的测试集,任务全部来自真实的 GitHub 项目—— 问题要么是开发者在生产环境中遇到的 bug ,要么是功能开发中的典型需求,难度大、上下文复杂,最大程度地还原了程序员在真实开发中的工作状态。 换句话说,能在 SWE-Bench 上拿高分的模型,必须具备一个经验丰富软件工程师的复杂技能和经验,而这些恰恰是传统代码生成 benc ...
AI应用浪潮风靡全球!“OpenAI劲敌“Anthropic 创收规模五个月翻三倍
智通财经网· 2025-05-31 03:41
Core Insights - Anthropic, a leader in generative AI, has achieved an annualized revenue of approximately $3 billion, indicating strong early validation for the commercial application of generative AI software [1] - The company's revenue has surged from nearly $1 billion in December 2024 to $3 billion by May 2025, reflecting a threefold increase in just five months [1] - The growth is primarily driven by the sale of customized "AI large model as a service" to enterprises, enhancing operational efficiency [1] Company Performance - Anthropic's rapid revenue growth positions it as one of the fastest-growing SaaS companies, with a notable increase in demand for AI code generation capabilities [2] - The company has outpaced traditional SaaS firms, achieving a revenue growth rate that is unprecedented according to industry experts [2][3] - In contrast, OpenAI is projected to reach over $12 billion in total revenue by the end of 2025, significantly higher than its previous year's revenue of $3.7 billion [4] Market Dynamics - The demand for enterprise-level AI applications is on the rise, with companies increasingly interested in deploying AI solutions internally, although some remain in experimental phases [1][2] - The competitive landscape shows that while both Anthropic and OpenAI offer enterprise and consumer AI applications, OpenAI is focusing more on consumer products, particularly through its ChatGPT platform [4][5] - The overall market for AI applications is expected to expand significantly, with companies like C3.ai and Palantir reporting strong performance and optimistic future outlooks [6] Future Trends - The introduction of new paradigms in AI training and inference is anticipated to lower costs and drive explosive growth in generative AI applications across various sectors [7] - The evolution of AI applications is shifting towards "AI agents" capable of executing complex tasks autonomously, which could significantly enhance productivity across industries [7]
美团开放AI代码工具,零代码实现全栈能力,项目负责人揭秘架构细节
机器之心· 2025-05-30 04:16
机器之心报道 编辑:泽南 一句话,呈所想。 谁都没有想到,如此实用的 AI 代码生成工具,竟是出自美团。 上周,有媒体曝出了美团的 AI 零代码工具 NoCode,这是一款无需编程背景和经验,仅通过自然语言和对话形式即可快速生成 应用的工具。 顾名思义,NoCode 可帮助很多人以「零代码」的方式创建个人提效工具、产品原型、可交互页面等。它不仅能生成代码,还可 以进行实时预览,局部修改并一键部署,大幅降低了开发的门槛,可以帮助更多人释放创意。 而且,NoCode 是完全免费的,用美团 App 或微信扫码就能登录。 产品链接:https://nocode.cn/ NoCode 是美团开放 AI 生态的最新实践,旨在通过免费开放自身积累的 AI 技术能力,助力中小商户实现 IT 化与数字化升级, 同时让更多用户体验 AI 技术带来的效率提升与创新乐趣。在公司内部,人们已经利用它构建出了从网站页面到效率工具、数据分 析再到简单游戏等大量不同种类的应用。 虽然目前还未正式发布,不过我们已经在社交网络上看到了一些使用 NoCode 构建产品的案例。 NoCode 由美团研发质量与效率团队研发,该团队属于美团基础研发平台 ...
整理:每日科技要闻速递(5月27日)
news flash· 2025-05-26 23:36
New Energy Vehicles - Lithium carbonate futures have fallen below 60,000 [1] - Concerns arise over a new price war initiated by BYD, with industry insiders suggesting that "hidden price cuts" may persist long-term [1] Technology Developments - Tencent is set to release the world's first multimodal model "Hunyuan-O" [2] - Microsoft has open-sourced a browser agent that can track and control intelligent agents in real-time [2] - Apple is expected to undergo a design revolution for its all-platform operating system [2] - A new myasthenia gravis drug, Udis, has been launched in China by UCB [2] - Apple is rumored to adjust its release strategy to launch two new iPhone models each year [2] - OpenAI plans to establish an office in Seoul within the next few months [2] - Xiaomi has denied rumors that its Xuanjie O1 is a custom chip for Arm [2] - Samsung's HBM3E has nearly passed Nvidia's single-chip certification, although final product certification may be delayed until the second half of the year [2] E-commerce and Delivery Services - Meituan reported that the average monthly income for high-frequency delivery riders in first-tier cities is 10,010 yuan [2] - Meituan's CEO Wang Xing responded to JD.com's 10 billion yuan subsidy for food delivery, stating that the company will spare no effort to win the competition [2] - Approximately 52% of Meituan's new code is generated by AI [2]