AI agents

Search documents
Ship Production Software in Minutes, Not Months — Eno Reyes, Factory
AI Engineer· 2025-07-25 23:11
[Music] Hi everybody, my name is Eno. I really appreciate that introduction. Um, and maybe I can start with a bit of background.Uh, I started working on LLMs about two and a half years ago. uh when uh GBT3.5% was coming out and it became increasingly clear that agentic systems were going to be possible with the help of LLMs. . At factory we believe that the way that we use agents in particular to build software is going to radically change the field of software development. We're transitioning from the era ...
Beyond the Prototype: Using AI to Write High-Quality Code - Josh Albrecht, Imbue
AI Engineer· 2025-07-25 23:10
[Music] It's great to be here. So, I'm Josh Albertch. I'm the CTO of Imbue.Uh, and our focus is on making more robust, useful AI agents. In particular, we're focusing on software agents right now. And the main product that we're working on today is called Sculptor. So, the purpose of Sculptor is to kind of help us with something that we've all experienced.You know, we've all tried these vibe coding tools and you, you know, tell it to go off and do something. It goes off and creates a bunch of code for you. ...
X @Anthropic
Anthropic· 2025-07-24 17:21
New Anthropic research: Building and evaluating alignment auditing agents.We developed three AI agents to autonomously complete alignment auditing tasks.In testing, our agents successfully uncovered hidden goals, built safety evaluations, and surfaced concerning behaviors. https://t.co/HMQhMaA4v0 ...
Structuring a modern AI team — Denys Linkov, Wisedocs
AI Engineer· 2025-07-24 15:45
All [Music] right, thanks everybody for joining today. My name is Dennis Linkov. I lead the machine learning team at Wisdocs and I'll be talking about hiring a modern AI team. So, who's heard this message before? We are now an AI first company. We've seen companies like Shopify, Dolingo, Zapier all make these announcements saying that they're AI first companies and they're saying that there are new expectations that before you hire a person, you need to make the the claim that you can't hire an AI agent or ...
X @Avalanche🔺
Avalanche🔺· 2025-07-24 15:03
AI agents are coming fast. But without their own L1, they’ll be locked in private silos.Youmio puts agents on-chain with transparent identity, memory, and provenance.Users stay in control. Developers get composable rails.It all starts here:https://t.co/cLTuR1t323 ...
Building Applications with AI Agents — Michael Albada, Microsoft
AI Engineer· 2025-07-24 15:00
Agentic Development Landscape - The adoption of agentic technology is rapidly increasing, with a 254% increase in companies self-identifying as agentic in the last three years based on Y Combinator data [5] - Agentic systems are complex, and while initial prototypes may achieve around 70% accuracy, reaching perfection is difficult due to the long tail of complex scenarios [6][7] - The industry defines an agent as an entity that can reason, act, communicate, and adapt to solve tasks, viewing the foundation model as a base for adding components to enhance performance [8] - The industry emphasizes that agency should not be the ultimate goal but a tool to solve problems, ensuring that increased agency maintains a high level of effectiveness [9][11][12] Tool Use and Orchestration - Exposing tools and functionalities to language models enables agents to invoke functions via APIs, but requires careful consideration of which functionalities to expose [14] - The industry advises against a one-to-one mapping between APIs and tools, recommending grouping tools logically to reduce semantic collision and improve accuracy [17][18] - Simple workflow patterns, such as single chains, are recommended for orchestration to improve measurability, reduce costs, and enhance reliability [19][20] - For complex scenarios, the industry suggests considering a move to more agentic patterns and potentially fine-tuning the model [22][23] Multi-Agent Systems and Evaluation - Multi-agent systems can help scale the number of tools by breaking them into semantically similar groups and routing tasks to appropriate agents [24][25] - The industry recommends investing more in evaluation to address the numerous hyperparameters involved in building agentic systems [27][28] - AI architects and engineers should take ownership of defining the inputs and outputs of agents to accelerate team progress [29][30] - Tools like Intel Agent, Microsoft's Pirate, and Label Studio can aid in generating synthetic inputs, red teaming agents, and building evaluation sets [33][34][35] Observability and Common Pitfalls - The industry emphasizes the importance of observability using tools like OpenTelemetry to understand failure modes and improve systems [38] - Common pitfalls include insufficient evaluation, inadequate tool descriptions, semantic overlap between tools, and excessive complexity [39][40] - The industry stresses the importance of designing for safety at every layer of agentic systems, including building tripwires and detectors [41][42]
Introducing LlamaIndex FlowMaker, an open source GUI for building LlamaIndex Workflows
LlamaIndex· 2025-07-24 14:00
Core Functionality - LlamaIndex introduces FlowMaker, an experimental open-source visual agent builder enabling AI agent creation via drag-and-drop without coding [1] - FlowMaker automatically generates TypeScript code for visual flows [1] - The platform integrates with LlamaCloud indexes and tools [1] - It offers an interactive browser testing environment for real-time feedback [1] Key Features - FlowMaker features a visual drag-and-drop interface for no-code agent development [1] - It supports complex flow patterns with loops and conditional logic [1] Use Cases - FlowMaker facilitates basic agent creation by connecting user input nodes to language models [1] - It enables tool integration, demonstrated by a resume-searching agent using LlamaCloud indexes [1] - The platform allows implementing decision logic, conditional branching, and loop-back mechanisms for intelligent conversation routing [1] Feedback - LlamaIndex is actively seeking user feedback on FlowMaker [1]
X @The Wall Street Journal
The Wall Street Journal· 2025-07-24 13:53
Exclusive: Walmart built so many AI agents, things started to get confusing. Now the retail giant is looking to simplify. https://t.co/FxdgFZF1OC ...