Workflow
AI编程
icon
Search documents
Claude Opus 4.5夺回编程王座,超Gemini 3 Pro和GPT-5.1
AI前线· 2025-11-25 05:03
目前测试版(Beta 版)已上线,开发者可直接通过 Claude API 调用。 | | Opus 4.5 | Sonnet 4.5 | Opus 4.1 | Gemini 3 Pro | GPT-5.1 | | | --- | --- | --- | --- | --- | --- | --- | | Agentic coding | | | | | 76.3% | | | SWE-bench Verified | 80.9% | 77.2% | 74.5% | 76.2% | 77.9% | | | | | | | | Codex-Max | | | Agentic terminal | | | | | 47.6% | | | coding | 59.3% | 50.0% | 46.5% | 54.2% | | | | Terminal-bench 2.0 | | | | | 58.1% | | | | | | | | Codex-Max | | | | Retail | Retail | Retail | Retail | - | | | Agentic tool use | 88.9% | 86.2% ...
中国AI编程赛道,谁能跑到最后?
3 6 Ke· 2025-11-20 11:34
众多人工智能(AI)应用中,AI编程被普遍视为技术迭代速度最快、商业化路径最清晰、用户渗透率最高、资本认可度最强的AI应用之一。 火遍全球的AI编程工具Cursor就是典型例子。这家公司2022年创立,创立之初就从OpenAI的创业基金和知名风险投资机构Khosla Ventures获得了种 子轮融资。2023年9月到2025年5月20个月间,这家公司估值飙升至99亿美元。根据该公司披露的数据,公司年化经常性收入(ARR)突破5亿美 元,付费用户超36万,日活用户达100万,覆盖英伟达、Uber等1.4万家企业客户。 关于AI编程的商业价值,知名硅谷风险投资公司Andreessen Horowitz(a16z)投资人公开表示,全球约3000万软件开发者,若按每人年创造10万美 元经济价值计算,当前AI编码工具可提升至少20%生产力,最优部署场景下生产力可翻倍,相当于每年将创造3万亿美元GDP(国内生产总值)贡 献,堪比法国GDP(2024年法国GDP为31620.8亿美元)。 a16z投资人还认为,AI编程已经形成了一个生态系统,这个生态系统有潜力支持数十家数十亿美元的公司,甚至是一个万亿美元级巨头。 AI编 ...
狙击Gemini 3!OpenAI发布GPT-5.1-Codex-Max
量子位· 2025-11-20 07:01
Core Insights - The article discusses the competitive landscape of AI programming models, highlighting the release of OpenAI's new model, GPT-5.1-Codex-Max, which aims to outperform Gemini 3 and other models in the market [1][34]. Model Performance - GPT-5.1-Codex-Max has achieved a new state-of-the-art (SOTA) in METR, indicating its ability to complete software engineering tasks with a 50% success rate in a time frame that previously required human intervention of 2 hours and 42 minutes, now reduced by 25 minutes compared to its predecessor [11][12]. - The new model demonstrates improved efficiency in task execution, particularly in software engineering tasks such as PR creation and code review, and is the first OpenAI model capable of operating in a Windows environment [16][18]. Long-Running Tasks - GPT-5.1-Codex-Max can operate independently for over 24 hours, processing millions of tokens continuously, which is a significant advancement for handling long-duration tasks without losing context [25][21]. - The model's ability to compress dialogue when approaching context window limits allows it to maintain coherence over extended tasks, making it suitable for analyzing lengthy documents without information loss [22][27]. Competitive Landscape - The article notes that other AI models, such as Claude, are also evolving, with Claude Code being faster in execution compared to OpenAI's offerings [32][31]. - The rapid advancements in AI programming models indicate a highly competitive environment, with multiple companies releasing new versions and features in quick succession [34][13]. Additional Releases - OpenAI has also introduced GPT-5.1 Pro, which reportedly excels in instruction following, although details are limited [36][38].
AI编程迎来“加速时刻” 互联网大厂“码”力全开|人工智能AI瞭望台
证券时报· 2025-11-18 00:12
花一杯咖啡的钱,就能让AI帮你写一个月的代码,程序员用AI"码"力全开的时代正加速到来。 近期,互联网大厂纷纷加码AI编程领域。火山引擎近日发布豆包编程模型Doubao-Seed-Code,并通过火山方舟平台全量开放API。面向个人开发者,订阅制套餐包 首月低至9.9元。据了解,该模型在Terminal Bench等多项权威基准测试中取得领先成绩。而美团旗下首款AI IDE(集成开发环境,用于提供程序开发环境的应用程 序)产品Meituan CatPaw(以下简称"CatPaw")近日也进入公测。 AI编程是否已成为"兵家必争之地"?广州眺远营销咨询公司总监高承飞接受证券时报记者采访时表示:"写代码是IT支出最刚性、数据最富矿、付费意愿最强的'三 高地带',谁拿到AI IDE入口,谁就拿到未来十年的'云税'权。" "咖啡价"用AI编程 豆包编程掀起"普惠浪潮" 豆包编程模型,正在掀起AI编程领域的"普惠浪潮",加速AI编程的规模化应用。证券时报记者了解到,在价格上,豆包编程模型综合使用成本相比业界平均水平降 低62.7%,达到了国内最低。目前,豆包编程模型Doubao-Seed-Code通过火山方舟面向开发者 ...
AI编程迎来"加速时刻"互联网大厂"码"力全开
Zheng Quan Shi Bao· 2025-11-18 00:09
Core Insights - The rise of AI programming tools is accelerating, with major internet companies investing heavily in this sector, indicating a competitive landscape for AI IDEs [1][3] - The introduction of the Doubao programming model by Huoshan Engine at a low subscription price of 9.9 yuan is expected to democratize access to AI programming tools, significantly reducing costs for individual developers and small teams [1][2] - The AI programming market is projected to exceed $29.5 billion by 2032, highlighting its vast potential despite current commercialization challenges [5] Group 1: Market Dynamics - Major internet companies are rapidly entering the AI programming space, creating a competitive environment characterized by a "three-legged" structure of international giants, domestic leaders, and niche applications [3] - The pricing strategy of Doubao programming model, set at the cost of a cup of coffee, aims to attract a wide range of users, from individual developers to enterprise teams, by making AI programming tools more accessible [2][3] - The AI programming market is facing a "sandwich" dilemma, where enterprise clients demand security and compliance, CTOs require clear ROI, and individual users seek low-cost or free solutions [5] Group 2: Strategic Implications - The push into AI programming by large firms is driven by the need to enhance development efficiency and reduce costs, as even a 10% improvement in productivity can lead to significant savings [4] - Companies without cloud infrastructure may quickly become marginalized as the industry shifts from selling models to offering a combination of cloud services, models, and data [2][4] - The competitive landscape is intensifying, with domestic tech giants and startups facing challenges of product differentiation and the need to establish unique value propositions to survive price wars [6]
AI编程迎来“加速时刻” 互联网大厂“码”力全开
Zheng Quan Shi Bao· 2025-11-17 16:57
Core Insights - The AI programming sector is rapidly evolving, with major internet companies intensifying their investments and product offerings in this area [1][3][4] - The introduction of low-cost AI programming models, such as Doubao-Seed-Code, is expected to democratize access to AI tools for individual developers and small teams [2][5] - The market for AI programming tools is projected to exceed $29.5 billion by 2032, indicating significant growth potential despite existing commercialization challenges [5][6] Group 1: Market Dynamics - Major internet companies are entering the AI programming space, creating a competitive landscape characterized by a "three-legged" structure of international giants, domestic leaders, and niche applications [3][4] - The pricing strategy of Doubao-Seed-Code, set at approximately 9.9 yuan for a month, is designed to lower barriers for entry and stimulate demand among developers [2][5] - The AI programming market is facing a "sandwich" dilemma, where enterprise clients demand security and compliance, CTOs seek clear ROI, and individual users expect low or no costs [5][6] Group 2: Competitive Landscape - The competition among domestic tech giants, including ByteDance, Meituan, Alibaba, Tencent, and others, is intensifying, with a focus on differentiating their AI programming products [3][4] - The low pricing of Doubao-Seed-Code is seen as a strategic move to capture market share and establish a customer base, potentially forcing competitors to follow suit [2][4] - Companies without cloud infrastructure may struggle to compete in the long term, as the industry shifts from selling models to offering integrated solutions that include cloud services and data [2][5] Group 3: Future Outlook - The AI programming tools market is expected to cover the entire lifecycle from code generation to system maintenance, indicating a broad scope for future development [5][6] - Companies are encouraged to explore vertical applications in AI programming, which may offer niche opportunities for growth without the need to compete directly with larger players [6] - The next two years are anticipated to see rapid segmentation within the AI programming sector, with companies needing to address issues of trust and multi-language legacy systems to capture significant enterprise budgets [5][6]
从酷炫功能到真实产业应用,AI卡在了哪里?
3 6 Ke· 2025-11-17 04:20
自2022年11月ChatGPT发布以来,生成式人工智能高速发展,大模型竞赛白热化,性能指标不断刷新,多模态能力持 续提升。AI智能体能自主调用工具,完成越来越复杂的任务。AI大模型厂商纷纷声称,通用人工智能(Artificial General Intelligence,AGI)时代即将到来。 与技术高歌猛进形成鲜明对比的是商业落地的滞后。美国Ramp AI Index数据显示,美国公司采用付费AI产品的比例近 期有停滞迹象,甚至出现下滑。 麻省理工学院在2025年7月的一份研究报告(The GenAI Divide: State of AI in Business 2025)中指出:95%的生成式AI 应用项目效果不佳或中途夭折。这份报告甚至引发了美股震荡。 当"所有行业都需用AI重做一遍"的豪言遭遇"AI项目高失败率"的现实,我们不得不追问:AI从酷炫的功能到真实的产 业应用,究竟卡在了哪里?又该如何穿越迷雾,实现真正的价值闭环? 01 根据我的观察,目前多数企业仍停留在直接套用AI工具的阶段,既未拆解工作流,也未评估AI能力与业务需求的适配 性,未能形成投入-数据-效益的飞轮,结果自然不如预期。 02 ...
专为智能体编程优化 豆包编程模型入局AI编程千亿赛道
当全民目光聚焦于"双11"电商盛宴之际,科技领域迎来一记重磅发布。 在成本方面,火山引擎通过技术创新和全量透明Cache能力,经由火山方舟平台提供的API服务,使得 豆包编程模型的实际使用成本相较行业水平大幅降低62.7%。综合来看,Doubao-Seed-Code输入输出单 价已达国内低价,配合全量透明Cache能力,在多轮对话中进一步降低成本。 举例来说,创建一个美观的交互式英语学习网站,相同tokens量下,在0-32k输入区间,Claude Sonnet 4.5成本约4.05元,GLM-4.6约0.77元,而Doubao-Seed-Code仅约0.34元。 此外,对个人开发者,火山引擎推出Coding Plan订阅服务,支持Claude Code、veCLI、Cursor、Cline、 Codex CLI等主流编程工具,最低9.9元即可享受首月服务。 11月11日,火山引擎发布豆包编程模型 Doubao-Seed-Code。这是一款专为智能体编程任务深度优化而 生的编程模型,其集成开发环境TRAE CN企业版同日也开放公测,为企业级开发场景带来更强大的AI 编程体验。 此举不仅标志着AI编程工具赛道迎 ...
TRAE SOLO正式版发布,面向所有用户开放
Bei Jing Shang Bao· 2025-11-14 10:02
北京商报讯(记者 魏蔚)11月14日,北京商报记者获悉,TRAE推出SOLO正式版,该版定位于旨在为 专业开发者提供实时有感知、随时可掌握、多任务并行的AI编程体验。即日起,SOLO正式版面向 TRAE国际版用户全面开放。7月,TRAE推出SOLO Beta版,内置智能体SOLO Builder,能够结合多模 态上下文进行需求感知、任务分解、工具调度与执行反馈,完整交付软件结果,帮助用户快速搭建端到 端应用。在Beta版的基础上,SOLO 正式版增加了更擅长处理复杂任务的智能体SOLO Coder,并新增多 任务并行、上下文压缩和代码变更等能力。 ...
面向所有用户开放!字节TRAE SOLO正式版发布
Xin Lang Ke Ji· 2025-11-14 09:31
随着Beta版迭代为正式版,TRAE SOLO的定位也从最初的"The First Context Engineer"升级为"The Responsive Coding Agent"。SOLO正式版更加关注人与Agent在整个开发流程中的协同响 应,"Responsive"代表在高度自动化的同时真正做到实时有感知、随时可掌握、多任务并行。基于多项 功能升级,SOLO正式版更加适应专业开发场景,并在整个开发过程中实现了更快的响应速度、更好的 交付质量。(罗宁) 责任编辑:刘万里 SF014 即日起,SOLO正式版面向TRAE国际版用户全面开放,限时免费体验活动同步开启。 今年7月,TRAE推出SOLO Beta版,内置智能体SOLO Builder,能够结合多模态上下文进行需求感知、 任务分解、工具调度与执行反馈,完整交付软件结果,帮助用户快速搭建端到端应用。在Beta版的基础 上,SOLO 正式版增加了更擅长处理复杂任务的智能体SOLO Coder,并新增多任务并行、上下文压缩 和代码变更等核心能力。 新浪科技讯 11月14日晚间消息,近日,TRAE宣布推出SOLO正式版。SOLO正式版定位于"The Res ...