Code review
Search documents
Automatic code reviews with OpenAI Codex
OpenAI· 2025-11-04 17:54
Code review of two comment mark. >> Hey everyone, I'm Roma >> and I'm Maya. >> Codex needs to do two things really well to be an effective coding teammate.First, it needs to work with all your tools and second, it also needs to plug into all of your team workflows. Code review is one of the most important workflows for any engineering team and we want to help there as well. With GPT5 and now GP5 codeex, we train these models specifically to find bugs and investigate some issues.Maya, why don't you tell us m ...
Shipping with Codex
OpenAI· 2025-10-08 17:01
Product Updates & Enhancements - OpenAI's Codex has undergone a significant overhaul, featuring an improved agent with a smarter reasoning model (GPT5 codecs) and a rewritten harness for better planning and interaction capabilities [3][4][5] - The Codex CLI has been revamped with simplified approval modes, a more legible UI, and default sandboxing for safety, with frequent updates based on user feedback [6] - Codex is now natively integrated into IDEs like VS Code and Cursor as an extension, attracting 100,000 users within the first week [7] - Codex Cloud has been upgraded to run tasks 90% faster, enabling parallel task execution and remote command through mobile devices [8] Usage & Impact - 92% of OpenAI's technical staff uses Codex daily, a significant increase from 50% in July [14] - Engineers using Codex submit 70% more pull requests (PRs) per week [14] - Code review, powered by GPT5 codeex, is now frequently enabled by default due to its high signal in identifying critical issues [13][14] Code Review & Testing - Codex has been specifically trained for thorough code review, capable of identifying complex bugs and suggesting fixes [12][44] - Codex supports test-driven development (TDD) by running tests, fixing code, and re-running tests until they pass, and can also verify UI visually using snapshots [21][22] - Local code reviews can be performed using slash commands in the CLI, allowing developers to review and fix code before submitting PRs [47][48] Workflow & Scalability - Codex can be used to create detailed plans and specifications for complex features, acting as a "senior engineer" capable of handling its own documentation [32][31] - Codex can sustain productive sessions for over seven hours, processing more than 150 million tokens for large projects [27] - The workflow from idea to pull request can be streamlined into a few steps with Codex, involving rigorous planning and thorough testing [41]
X @Sam Altman
Sam Altman· 2025-09-19 01:36
Product Update - Codex CLI工具推出,用于本地代码更改的快速审查 [1] - 未来几天将大幅扩展Codex CLI的功能 [1] - 鼓励用户向Daniel反馈使用体验 [1]
GPT-5 Codex is nuts...
Matthew Berman· 2025-09-15 22:31
Product Overview - OpenAI releases GPT5 Codeex, optimized for agentic coding, available in various environments like terminal, IDE, GitHub, and ChatGPT iOS app [1][2] - GPT5 Codeex is included with ChatGpt Plus Pro business edu and enterprise plans [3] Performance Benchmarks - GPT5 Codeex achieves 74.5% on SWEBench verified, a slight improvement over GPT5's 72.8% [3] - Code refactoring sees a significant improvement with GPT5 Codeex at 51.3% compared to GPT5's 33.9% [3] - GPT5 Codeex can work independently for over 7 hours on complex tasks [4] - GPT5 Codeex uses 93.7% fewer tokens than GPT5 for simpler tasks but spends twice as long on complex use cases [6] - GPT5 Codeex reduces incorrect comments to 4.4% compared to GPT5's 13.7% and increases high impact comments to 52.4% from 39.4% [7] Features and Capabilities - Codeex is trained for code reviews, identifying critical flaws by navigating codebase, reasoning through dependencies, and running code and tests [6] - Codeex CLI updates include better formatted tool calls and diffs, simplified approval modes, and conversation state compaction [12][13] - Codeex automates environment setup by scanning for setup scripts and fetching dependencies at runtime [15] - Codeex can spin up its own browser, iterate on its builds, and attach screenshots to tasks and GitHub PRs [15] - Codeex reviews PRs by matching stated intent to the actual diff, reasoning over the codebase, and executing code and tests [16] Windsurf Integration - Windsurf is highlighted as a powerful agentic IDE, especially after being acquired by Cognition [9] - Windsurf offers features like deep wiki, vibe, replace, one-click MCP store, sophisticated memory, and deep integration with Devon [10][11] Pricing and Availability - Pro plan at $200 per month can support a full work week across multiple projects, positioning it as an additional developer [19] - Business plans offer credit purchases for exceeding included limits, while enterprise plans provide a shared credit pool [20] Infrastructure Improvements - Cloud infrastructure performance is improved by caching containers, reducing medium completion time for new tasks and follow-ups by 90% [14]
AI powered entomology: Lessons from millions of AI code reviews — Tomas Reimers, Graphite
AI Engineer· 2025-07-22 19:50
[Music] Thank you all so much for coming to this talk. Um, thank you for being at this conference. Generally, my name is Tomas.I'm one of the co-founders of Graphite and I'm here to talk to you around AI power entomology. If you don't know, entomology is the study of bugs. It's something that we do.We is very near and dear to our heart and part of what our product does. So, Graphite, for those of you that don't know, builds a product called Diamond. Diamond is an AI powered code reviewer.You go ahead, you u ...
Don’t get one-shotted: Use AI to test, review, merge, and deploy code — Tomas Reimers, Graphite
AI Engineer· 2025-06-27 10:25
Industry Trends - Software development has two loops: an inner loop focused on development and an outer loop focused on review [1] - AI adoption is increasing among developers, with nearly every developer surveyed using AI tools [2] - 46% of code on GitHub is being written by AI, indicating a significant shift in code generation [3] - The inner loop is changing due to AI, making developers more productive and producing higher volumes of code [3][4] - The outer loop is becoming a bottleneck as developers have to review, test, merge, and deploy higher volumes of code [5] Graphite's Solution (Diamond) - Graphite aims to create a new outer loop to address the challenges posed by increased code volume [6] - Graphite's AI code review platform, Diamond, focuses on high signal, low noise, deep understanding of codebase and change history [13] - Diamond summarizes, prioritizes, and reviews each change, integrating with CI and testing infrastructure [13] - Diamond aims to reduce code review cycles, enforce quality and consistency, and keep code private and secure [13] - AI-generated feedback from Diamond's comments are accepted at a 52% rate, higher than human comments (45-50%) [15][16]