Workflow
Code review
icon
Search documents
Automatic code reviews with OpenAI Codex
OpenAI· 2025-11-04 17:54
Code review of two comment mark. >> Hey everyone, I'm Roma >> and I'm Maya. >> Codex needs to do two things really well to be an effective coding teammate.First, it needs to work with all your tools and second, it also needs to plug into all of your team workflows. Code review is one of the most important workflows for any engineering team and we want to help there as well. With GPT5 and now GP5 codeex, we train these models specifically to find bugs and investigate some issues.Maya, why don't you tell us m ...
Shipping with Codex
OpenAI· 2025-10-08 17:01
I'm >> here at OpenAI and I build Codex. With Codeex, we're building an AI software engineer. I personally like to think about it as a little bit like a human teammate.You can pair program with it on your computer. You can delegate to it or as you'll see, you can give it a job without explicit prompting. There's been recently a massive vibe shift.This has started from August where we had pretty decent usage and since then, thanks to all of you, we've grown tenfold. Today I want to start by sharing some of t ...
X @Sam Altman
Sam Altman· 2025-09-19 01:36
RT Tibo (@thsottiaux)/review now in Codex CLIUse it to get a quick review from codex on your code changes locally. Will expand a lot in coming days, send Daniel your feedback! ...
GPT-5 Codex is nuts...
Matthew Berman· 2025-09-15 22:31
Product Overview - OpenAI releases GPT5 Codeex, optimized for agentic coding, available in various environments like terminal, IDE, GitHub, and ChatGPT iOS app [1][2] - GPT5 Codeex is included with ChatGpt Plus Pro business edu and enterprise plans [3] Performance Benchmarks - GPT5 Codeex achieves 74.5% on SWEBench verified, a slight improvement over GPT5's 72.8% [3] - Code refactoring sees a significant improvement with GPT5 Codeex at 51.3% compared to GPT5's 33.9% [3] - GPT5 Codeex can work independently for over 7 hours on complex tasks [4] - GPT5 Codeex uses 93.7% fewer tokens than GPT5 for simpler tasks but spends twice as long on complex use cases [6] - GPT5 Codeex reduces incorrect comments to 4.4% compared to GPT5's 13.7% and increases high impact comments to 52.4% from 39.4% [7] Features and Capabilities - Codeex is trained for code reviews, identifying critical flaws by navigating codebase, reasoning through dependencies, and running code and tests [6] - Codeex CLI updates include better formatted tool calls and diffs, simplified approval modes, and conversation state compaction [12][13] - Codeex automates environment setup by scanning for setup scripts and fetching dependencies at runtime [15] - Codeex can spin up its own browser, iterate on its builds, and attach screenshots to tasks and GitHub PRs [15] - Codeex reviews PRs by matching stated intent to the actual diff, reasoning over the codebase, and executing code and tests [16] Windsurf Integration - Windsurf is highlighted as a powerful agentic IDE, especially after being acquired by Cognition [9] - Windsurf offers features like deep wiki, vibe, replace, one-click MCP store, sophisticated memory, and deep integration with Devon [10][11] Pricing and Availability - Pro plan at $200 per month can support a full work week across multiple projects, positioning it as an additional developer [19] - Business plans offer credit purchases for exceeding included limits, while enterprise plans provide a shared credit pool [20] Infrastructure Improvements - Cloud infrastructure performance is improved by caching containers, reducing medium completion time for new tasks and follow-ups by 90% [14]
AI powered entomology: Lessons from millions of AI code reviews — Tomas Reimers, Graphite
AI Engineer· 2025-07-22 19:50
[Music] Thank you all so much for coming to this talk. Um, thank you for being at this conference. Generally, my name is Tomas.I'm one of the co-founders of Graphite and I'm here to talk to you around AI power entomology. If you don't know, entomology is the study of bugs. It's something that we do.We is very near and dear to our heart and part of what our product does. So, Graphite, for those of you that don't know, builds a product called Diamond. Diamond is an AI powered code reviewer.You go ahead, you u ...
Don’t get one-shotted: Use AI to test, review, merge, and deploy code — Tomas Reimers, Graphite
AI Engineer· 2025-06-27 10:25
Industry Trends - Software development has two loops: an inner loop focused on development and an outer loop focused on review [1] - AI adoption is increasing among developers, with nearly every developer surveyed using AI tools [2] - 46% of code on GitHub is being written by AI, indicating a significant shift in code generation [3] - The inner loop is changing due to AI, making developers more productive and producing higher volumes of code [3][4] - The outer loop is becoming a bottleneck as developers have to review, test, merge, and deploy higher volumes of code [5] Graphite's Solution (Diamond) - Graphite aims to create a new outer loop to address the challenges posed by increased code volume [6] - Graphite's AI code review platform, Diamond, focuses on high signal, low noise, deep understanding of codebase and change history [13] - Diamond summarizes, prioritizes, and reviews each change, integrating with CI and testing infrastructure [13] - Diamond aims to reduce code review cycles, enforce quality and consistency, and keep code private and secure [13] - AI-generated feedback from Diamond's comments are accepted at a 52% rate, higher than human comments (45-50%) [15][16]