Production Systems
Search documents
Resolve AI's Spiros Xanthos on Building AI Agents that Keep Software Running
Greylock· 2025-11-04 23:48
AI in Software Engineering - AI models have solved coding, but not software engineering, as production speed is crucial [4] - Building AI to accelerate production is challenging due to reliability requirements and the need for multi-agent orchestration [5][6][7] - Resolve AI focuses on using AI to address the complexities of production systems, which involves more than just code [13] Resolve AI's Solution - Resolve AI provides AI site reliability engineer agents to troubleshoot alerts and incidents [11] - Resolve's agents can understand production systems from code to backend databases, offering a faster solution [11] - Customers are using Resolve AI for "vibe debugging," indicating usage beyond incidents and alerts, leading to increased product usage [12] Talent Acquisition - Resolve AI competes with companies like Meta, OpenAI, and Anthropic for AI engineers [14] - Resolve AI attracts top researchers by offering the opportunity to significantly impact the company and change software engineering [16] Future of Software Engineering - Humans will operate at a higher level of abstraction, with agents handling much of the work [17] - Underlying infrastructure and tools will adapt to be more suitable for agents [17]
Resolve AI's Spiros Xanthos on Building Agents that Keep Software Running
Greylock· 2025-11-03 16:30
Today's conversation is with Spiro Zanthos. Spiros is a serial entrepreneur who's achieved two successful exits to Splunk and VMware. Now he and his co-founder May Agaral are on a mission to rethink software engineering tools for an AI first world with their new company Resolve AI.In our conversation, Spiro shares the status of productivity gains from AI and software engineering. How Resolve was born after realizing he spent 15 years in infrastructure and observability software just creating more work for h ...
Optimizing inference for voice models in production - Philip Kiely, Baseten
AI Engineer· 2025-06-28 02:39
Key Optimization Goal - Aims to achieve Time To First Byte (TTFB) below 150 milliseconds for voice models [1] Technology and Tools - Leverages open-source TTS models like Orpheus, which have an LLM backbone [1] - Employs tools and optimizations such as TensorRT-LLM and FP8 quantization [1] Production Challenges - Client code, network infrastructure, and other outside-the-GPU factors can introduce latency [1] - Common pitfalls exist when integrating TTS models into production systems [1] Scalability and Customization - Focuses on scaling TTS models in production [1] - Extends the system to serve customized models with voice cloning and fine-tuning [1]