Cloud Scale

Search documents
Building Agents at Cloud Scale — Antje Barth, AWS
AI Engineer· 2025-08-02 18:15
Let's explore practical strategies for building and scaling agents in production. Discover how to move from local MCP implementations to cloud-scale architectures and how engineering teams leverage these patterns to develop sophisticated agent systems. Expect a mix of demos, use case discussions, and a glimpse into the future of agentic services! About Antje Barth Antje Barth is a Principal Developer Advocate at AWS, based in San Francisco. She frequently speaks at AI engineering conferences, events, and me ...
Ask the Experts Multi Tenancy Final
DDN· 2025-07-25 10:19
AI Infrastructure Challenges - AI workloads (inference, training, RAG) competing for resources can cause performance bottlenecks and delays [1] - Mixed-tenant AI loads can lead to noisy-neighbor issues, impacting performance [1] Solutions & Benefits - Next-gen AI infrastructure provides full control over the environment, regardless of workload complexity [1] - Dynamic resource isolation prevents noisy-neighbor issues [1] - Efficient scaling of AI infrastructure while maintaining performance is achievable [1] Key Learning Objectives - Guarantee performance under heavy, mixed-tenant AI loads [1] - Prevent noisy-neighbor issues with dynamic resource isolation [1] - Scale AI infrastructure efficiently while maintaining performance [1]
Ship it! Building Production Ready Agents — Mike Chambers, AWS
AI Engineer· 2025-06-27 10:45
Generative AI and Agent Technology - Amazon Web Services (AWS) specializes in generative AI, evolving from machine learning [1] - The presentation focuses on deploying generative AI agents to cloud scale, targeting both developers and leaders [1] - The core components of an agent include a model for natural language understanding, a prompt defining the agent's role, an agentic loop for processing input and using tools, history for maintaining context, and tools for external interaction [1][2] - AWS Bedrock offers a suite of capabilities for building generative AI components, including models from Anthropic, Meta, and Mistral [2] - Amazon Bedrock Agents is a fully managed service for deploying agents without infrastructure management [2] Practical Implementation and Tools - The demonstration uses a simple Python agent with a dice rolling tool, initially running locally on a laptop with the Llama 3 8 billion parameter model [1] - The agent is configured with instructions (similar to a prompt) and action groups, which connect to tools [2] - Lambda functions are used to host the tools, enabling them to perform various actions, including interacting with other AWS services [2] - The AWS console provides a user interface for creating and configuring agents, including defining parameters and descriptions for tools [3][4][5][6][7][8][9][10][11][12][13][14][15] - Amazon Q developer is integrated into the console's code editor, offering code suggestions [17][18][19][20][21] Deployment and Scalability - The presentation emphasizes deploying agents to a production-ready, cloud-scale environment [1] - Infrastructure as code frameworks like Terraform, Palumi, and CloudFormation can be used for deployment [3] - AWS offers free courses on deeplearning.ai with AWS environments for experimenting with Amazon Bedrock Agents [25]