AI Engineer
Search documents
Skills at Scale — Nick Nisi and Zack Proser, WorkOS
AI Engineer· 2026-05-06 17:00
Chat interfaces are no longer limited to walls of text. In this talk, Liad Yosef and Ido Salomon explain how MCP Apps turn tools into interactive UI inside hosts like ChatGPT, Claude, VS Code, Cursor, and Copilot, letting companies send branded, functional app experiences instead of plain text responses. The session covers the core architecture behind MCP Apps, how UI is passed over MCP, how interactions stay in context through the host, and why this changes how applications get distributed in an agent-firs ...
Multi-Agent Systems That Ship for Days — Luke Alvoeiro, Factory
AI Engineer· 2026-05-06 15:00
[music] >> Hi everyone. My name is Luke and my goal is that 20 minutes from now you'll be able to assemble agent teams that can complete tasks orders of magnitude harder than what you can complete with a single agent today. A little bit about me.So I come from a background in dev tools. About 2 and 1/2 years ago I started a project at Block which is where I was working at the time. And that project evolved into Goose.Goose is now one of the leading coding agents is open source and it's recently was was dona ...
MCP UI: Extending the frontier — Liad Yosef and Ido Salomon, MCP Apps
AI Engineer· 2026-05-06 13:00
Chat interfaces are no longer limited to walls of text. In this talk, Liad Yosef and Ido Salomon explain how MCP Apps turn tools into interactive UI inside hosts like ChatGPT, Claude, VS Code, Cursor, and Copilot, letting companies send branded, functional app experiences instead of plain text responses. The session covers the core architecture behind MCP Apps, how UI is passed over MCP, how interactions stay in context through the host, and why this changes how applications get distributed in an agent-firs ...
The Small Model Infrastructure Nobody Built (So We Did) — Filip Makraduli, Superlinked
AI Engineer· 2026-05-05 17:00
Most embedding infrastructure assumes you know exactly which model you want ahead of time. This talk starts where that assumption breaks. Filip Makraduli walks through the real profiling mistakes, infrastructure gaps, and production constraints that led to building an embedding inference engine designed for dynamic model loading, hot-swapping, and memory-aware eviction instead of brittle one-model-per-container deployments. If you're working on small-model inference, embeddings, or GPU infrastructure, this ...
Accelerating AI on Edge — Chintan Parikh and Weiyi Wang, Google DeepMind
AI Engineer· 2026-05-05 15:00
As models get smaller and more capable, more AI workloads can move onto the device itself. In this talk, Chintan Parikh from Google DeepMind walks through what that looks like in practice, from Gemma 4 edge models and on-device agent skills to the real tradeoffs around latency, privacy, cost, and cross-platform deployment. The session covers LiteRT, the Google AI Edge stack for running models across Android, iOS, desktop, web, and IoT, along with demos of local tool calling, structured output, reasoning, be ...
Let AI Agents Tell You What They Need — Raj Navakoti, IKEA
AI Engineer· 2026-05-05 13:00
Enterprise teams spend a lot of time trying to guess what AI agents need to know. This workshop flips that around. Instead of curating context top-down, Raj Navakoti shows how to build a demand-driven context base by giving agents real problems, watching where they fail, and using those failures to reveal exactly what knowledge is missing. Using practical exercises and real examples from IKEA Digital, the session walks through how to grow a knowledge base problem by problem, structure it in Markdown, and us ...