Workflow
Email Assistant
icon
Search documents
How to Train Your Agent: Building Reliable Agents with RL โ€” Kyle Corbitt, OpenPipe
AI Engineerยท 2025-07-19 21:12
[Music] Um, hey everyone. Glad you're all here. This is the reasoning and reinforcement learning track uh on the afternoon of the last day of the AI engineer world's fair.Glad you're all here. Glad you're sharing it with us. Today, what I'm going to talk about is uh a very specific case study um that we did. Uh this case study, I'm going to talk about lessons learned very concretely.Um what did and didn't work, how we able to build an agent that worked well with reinforcement learning. Uh all of this uh eve ...