AI Engineer
Search documents
AI Engineering with the Google Gemini 2.5 Model Family - Philipp Schmid
AI Engineer· 2025-07-11 19:00
Event Overview - Workshop focused on learning to use Gemini 2.5 Pro with Agentic tooling and MCP Servers [1] - Workshop was recorded at the AI Engineer World's Fair in San Francisco [1] Speaker Information - Philipp Schmid is a Senior AI Developer Relations Engineer at Google DeepMind [1] - Philipp Schmid's mission is to help developers create and benefit from AI responsibly [1] Resources - Newsletter available for updates on upcoming events and content [1] - Newsletter signup link: https://www.ai.engineer/newsletter [1]
The New Code — Sean Grove, OpenAI
AI Engineer· 2025-07-11 16:00
Core Argument - In the age of AI-driven software development, the ability to precisely communicate intent through specifications is paramount, surpassing the importance of coding itself [1] - Specifications, rather than prompts or code, are emerging as the fundamental unit of programming, positioning spec-writing as a critical skill [1] - Rigorous, versioned specifications serve as the single source of truth, compiling into documentation, evaluations, model behaviors, and potentially code [1] Technical Focus - The industry emphasizes the need for executable specifications in AI systems to align human teams and machine intelligence, drawing a parallel to the US Constitution [1] - OpenAI's Model Spec is presented as a real-world example of executable specifications [1] Future Implications - The industry anticipates a shift in developer tooling, where communication becomes the most important artifact in engineering [1]
Boris explains Claude Code
AI Engineer· 2025-07-10 20:30
Product Development & Engineering - Entropic's Quad Code aims for a more general model with exponential capability increase [1] - Quad is used to summarize weekly git commits, aiding in tracking progress [2] - Quad facilitates Test-Driven Development (TDD) [2] AI & Automation - Claude is now available on GitHub [1] - AI coding tools, specifically models, are improving TDD effectiveness [2]
Production software keeps breaking and it will only get worse — Anish Agarwal, Traversal.ai
AI Engineer· 2025-07-10 16:29
Problem Statement - The current software engineering workflow is inefficient, with too much time spent on troubleshooting production incidents [2][9] - Existing approaches to automated troubleshooting, such as AIOps and LLMs, have fundamental limitations [10][11][12][13][14][15][16][17][18] - Troubleshooting is becoming increasingly complex due to AI-generated code and increasingly complex systems [3][4] Solution: Traversal's Approach - Traversal combines causal machine learning (statistics), reasoning models (semantics), and a novel agentic control flow (swarms of agents) for autonomous troubleshooting [19][20][21][22][23][24] - Causal machine learning helps identify cause-and-effect relationships in data, addressing the issue of correlated failures [20][21] - Reasoning models provide semantic understanding of logs, metrics, and code [22] - Swarms of agents enable exhaustive search through telemetry data in an efficient way [23][24] Results and Impact - Traversal has achieved a 40% reduction in mean time to resolution (MTTR) for Digital Ocean, a cloud provider serving hundreds of thousands of customers [32][37] - Traversal AI orchestrates a swarm of expert AIs to sift through petabytes of observability data in parallel, providing users with the root cause of incidents within five minutes [39][40] - Traversal integrates with various observability tools, processing trillions of logs [45] Future Applications - The principles of exhaustive search and swarms of agents can be applied to other domains such as network observability and cybersecurity [47]
Thinking Deeper in Gemini — Jack Rae, Google DeepMind
AI Engineer· 2025-07-10 16:00
Model Development & Architecture - Gemini Thinking is presented as a solution to address limitations in test-time compute, marking progress towards general intelligence [1] - The industry focuses on identifying fundamental intelligence bottlenecks within existing models and developing solutions to improve architecture or training objectives [1] Capabilities & Steerability - Recent progress in Thinking is highlighted, emphasizing both capability and steerability improvements [1] Future Directions - The document outlines the future direction of the models, indicating ongoing development and evolution [1]
A year of Gemini progress + what comes next — Logan Kilpatrick, Google DeepMind
AI Engineer· 2025-07-10 07:00
Over the last year, Google and Gemini models have shown rapid progress across all dimensions (model, product, etc). Let's highlight all the work that has happened, how we got the worlds best models, and where we are going next (across both the model landscape and out AI products). About Logan Kilpatrick Logan leads product for Google AI Studio and works on the Gemini API. Before Google, Logan led developer relations at OpenAI. Recorded at the AI Engineer World's Fair in San Francisco. Stay up to date on our ...
The Wild World of AI: 6 Months That Changed Everything
AI Engineer· 2025-07-10 03:23
There are all of these benchmarks full of numbers. I don't like the numbers. There are the leaderboards.I'm kind of beginning to lose trust in the leaderboards as well. So for my own work, I've been leaning increasingly into my own little benchmark, which started as a joke and has actually turned into something that I I rely on quite a lot. And that's this.I prompt models with generate an SVG of a pelican riding a bicycle. I have good reasons for this. Um firstly, these are not image models. These are text ...
2025 in LLMs so far, illustrated by Pelicans on Bicycles — Simon Willison
AI Engineer· 2025-07-09 16:00
LLM Advancements - The field of LLMs has experienced significant advancements in the past 12 months [1] - The report reviews the latest models, free from vendor or employer influence [1] Speaker Information - Simon Willison is the creator of Datasette, an open source tool for exploring and publishing data [1] - Simon Willison was an engineering director at Eventbrite [1] - Simon Willison is a co-creator of the Django Web Framework [1] Event Information - The recording took place at the AI Engineer World's Fair in San Francisco [1] - Readers can stay updated on upcoming events and content by joining the newsletter [1]
Trends Across the AI Frontier — George Cameron, ArtificialAnalysis.ai
AI Engineer· 2025-07-08 16:00
Company Overview - Artificial Analysis is an independent benchmarking and insights company focused on helping developers and companies select appropriate AI models and technologies for application development [1] - The company provides extensive benchmarking results on its website, covering intelligence, performance, cost, and other factors [1] - Artificial Analysis develops reports to inform key strategic decisions related to AI [1] AI Industry Trends - The entire AI stack, from chips to infrastructure to models, is developing rapidly [1] - It is important to differentiate the signal from the noise in the rapidly evolving AI landscape [1] Expertise - Artificial Analysis' CEO, Micah Hill-Smith, has a background in AI engineering and strategy consulting with McKinsey & Company [1] - George Cameron is the CPO of Artificial Analysis [1] Events and Content - Artificial Analysis presented at the AI Engineer World's Fair in San Francisco [1] - The company encourages individuals to subscribe to its newsletter for updates on upcoming events and content [1]
Claude Code & the evolution of agentic coding - Boris Cherny
AI Engineer· 2025-07-04 16:00
[Music] Hello. This awesome. This is a big crowd.Who here has used quad code before. Jesus. Awesome.That's what I like to see. Cool. So, my name is Boris.I'm a member of technical staff at Enthropic and creator of Quad Code. And um I was struggling with what to talk about for audience that already knows quad code, already knows AI and all the coding tools and agentic coding and stuff like that. So, I'm going to zoom out a little bit and then we'll zoom back in.So here's my TLDDR. The model is moving really ...