Workflow
Future of Evals
Greylockยท2025-09-30 19:43

AI Model Evaluation (Eval) Industry Trends - Eval remains a core driver for building great AI software, expected to be relevant in the future [2] - The implementation of running evals has changed significantly and will continue to evolve [2] - Updates based on eval results have transitioned from slow and manual to fast and manual [3] - The industry anticipates a shift towards faster updates that are partially or entirely automatic [3] Future of Human-AI Interaction in Evals - Human interaction with evals will evolve from analyzing dashboards to a collaborative process with LLM systems suggesting changes [3][4] - LLM systems may contextualize why changes should be made based on eval results [4] Brain Trust's Perspective - Brain Trust was founded partly due to the lack of significant changes in evals prior to its inception [1] - Brain Trust is excited about the anticipated shift in how humans interact with evals [4]