Agents

Search documents
X @Avi Chawla
Avi Chawla· 2025-08-06 19:13
12 MCP, RAG, and Agents cheat sheets covering:- Function calling & MCP for LLMs- 4 stages of training LLMs from scratch- Training LLMs using other LLMs- Supervised & Reinforcement fine-tuning- RAG vs Agentic RAG, and more.Check the detailed thread below 👇 https://t.co/erWhHLhldqAvi Chawla (@_avichawla):12 MCP, RAG, and Agents cheat sheets for AI engineers (with visuals): ...
企业级LLM:性能为王,开源采用趋于平缓 | Jinqiu Select
锦秋集· 2025-08-03 04:31
在性能至上的铁律下,多数人眼中"开源"的未来正面临前所未有的挑战。 Me n l o Ve n t u r e s最新发布的2 0 2 5年中期报告基于对1 5 0家以上企业及初创公司的技术决策 者调查揭示了一个令人震惊的行业转向: 当市场沉迷于开源模型的成本优势时,企业级LLM API支出却在过去六个月内从3 5亿美元 增长至8 4亿美元,实现翻倍增长,而闭源模型正在这个价值千亿的市场中建立难以逾越的 性能护城河。 这份报告的核心观点如下: 锦 秋 基 金 ( 公 众 号 : 锦 秋 集 ;ID : j q c a p it a l ) 认 为 , 虽 然 Me n l o Ve n t u r e s 是 An t h r o p i c 的 投资人,可能立场上有倾向,但这篇报告在一定程度上反映了企业用户对AI产品采购的核 心考量与实际偏好,因此也做了编译。 " 企 业 级 市 场 中 , 开 源 模 型 的 采 用 率 趋 于 平 稳 , 因 其 性 能 仍 落 后 闭 源 模 型 9 到 1 2 个 月。" 尽 管 开 源 模 型 具 备 成 本 和 定 制 优 势 , 但 性 能 差 距 、 部 署 ...
Building Agents at Cloud Scale — Antje Barth, AWS
AI Engineer· 2025-08-02 18:15
Let's explore practical strategies for building and scaling agents in production. Discover how to move from local MCP implementations to cloud-scale architectures and how engineering teams leverage these patterns to develop sophisticated agent systems. Expect a mix of demos, use case discussions, and a glimpse into the future of agentic services! About Antje Barth Antje Barth is a Principal Developer Advocate at AWS, based in San Francisco. She frequently speaks at AI engineering conferences, events, and me ...
The 2025 AI Engineering Report — Barr Yaron, Amplify
AI Engineer· 2025-08-01 22:51
AI Engineering Landscape - The AI engineering community is broad, technical, and growing, with the "AI Engineer" title expected to gain more ground [5] - Many seasoned software developers are AI newcomers, with nearly half of those with 10+ years of experience having worked with AI for three years or less [7] LLM Usage and Customization - Over half of respondents are using LLMs for both internal and external use cases, with OpenAI models dominating external, customer-facing applications [8] - LLM users are leveraging them across multiple use cases, with 94% using them for at least two and 82% for at least three [9] - Retrieval-Augmented Generation (RAG) is the most popular customization method, with 70% of respondents using it [10] - Parameter-efficient fine-tuning methods like LoRA/Q-LoRA are strongly preferred, mentioned by 40% of fine-tuners [12] Model and Prompt Management - Over 50% of respondents are updating their models at least monthly, with 17% doing so weekly [14] - 70% of respondents are updating prompts at least monthly, and 10% are doing so daily [14] - A significant 31% of respondents lack any system for managing their prompts [15] Multimodal AI and Agents - Image, video, and audio usage lag text usage significantly, indicating a "multimodal production gap" [16][17] - Audio has the highest intent to adopt among those not currently using it, with 37% planning to eventually adopt audio [18] - While 80% of respondents say LLMs are working well, less than 20% say the same about agents [20] Monitoring and Evaluation - Most respondents use multiple methods to monitor their AI systems, with 60% using standard observability and over 50% relying on offline evaluation [22] - Human review remains the most popular method for evaluating model and system accuracy and quality [23] - 65% of respondents are using a dedicated vector database [24] Industry Outlook - The mean guess for the percentage of the US Gen Z population that will have AI girlfriends/boyfriends is 26% [27] - Evaluation is the number one most painful thing about AI engineering today [28]
X @Polyhedra
Polyhedra· 2025-07-29 07:22
2/ We’re building @EggDotParty as a workspace where creators use agents to build, automate, and monetize, without needing crypto knowledge or 10 disconnected apps. ...
How to build Enterprise Aware Agents - Chau Tran, Glean
AI Engineer· 2025-07-24 09:22
[Music] Thanks Alex for the introduction. That was a very impressive LLM generated summary of me. Uh I've never heard it before but uh nice.Um so um today I'm going to talk to you about something that has been keeping me up at night. Uh probably some of you too. So how to build enterprise aware agents.How to bring the brilliance of AI into the messy complex realities of uh how your business operated. So let's jump straight to the hottest question of the month for AI builders. Uh should I build workflows or ...
X @Avi Chawla
Avi Chawla· 2025-07-23 19:16
RT Avi Chawla (@_avichawla)Building front-end Agentic apps just got 10x easier (open-source)!If you're building apps where Agents are part of the interface, not just running in the background, AG-UI protocol has become the standard.For context:- MCP connects agents to tools- A2A connects agents to other agents- AG-UI connects agents to usersIt defines a common interface between Agents and the UI layer.AG-UI itself is Agent framework agnostic, and it lets you:- stream token-level updates- show tool progress ...
X @Avi Chawla
Avi Chawla· 2025-07-23 06:30
If you found it insightful, reshare it with your network.Find me → @_avichawlaEvery day, I share tutorials and insights on DS, ML, LLMs, and RAGs.Avi Chawla (@_avichawla):Building front-end Agentic apps just got 10x easier (open-source)!If you're building apps where Agents are part of the interface, not just running in the background, AG-UI protocol has become the standard.For context:- MCP connects agents to tools- A2A connects agents to https://t.co/Sg41TjXiiw ...
X @Avi Chawla
Avi Chawla· 2025-07-23 06:30
Building front-end Agentic apps just got 10x easier (open-source)!If you're building apps where Agents are part of the interface, not just running in the background, AG-UI protocol has become the standard.For context:- MCP connects agents to tools- A2A connects agents to other agents- AG-UI connects agents to usersIt defines a common interface between Agents and the UI layer.AG-UI itself is Agent framework agnostic, and it lets you:- stream token-level updates- show tool progress in real time- share mutable ...
美股AI巨头&季报:值得关注的产业变化
2025-07-16 06:13
Summary of Conference Call Industry Overview - The conference focused on the U.S. AI industry and stock market changes, highlighting significant movements by major companies like NVIDIA and Microsoft [1][2] - The discussion emphasized the evolving landscape of AI, particularly the introduction of next-generation Internet concepts such as Agents Network and Agent Web [2][10] Key Companies and Developments NVIDIA - NVIDIA's NVLink Fusion product was a major highlight, showcasing advancements in AI chip architecture and model training capabilities [3][4] - NVLink has evolved from version 1.0 in 2016 to version 5.0, enhancing interconnectivity between different computing units [3] - The company is adapting to customer needs by offering customized solutions for AI and IoT applications, indicating a shift towards more tailored products [4][5] - NVIDIA's partnerships with companies like Boton and Marvell are expanding its market reach, particularly in customized chip solutions [5][6] - The anticipated launch of larger AI clusters (up to 500,000 cards) for model training is expected by the end of the third quarter [9] Microsoft - Microsoft's Build conference emphasized the concept of Agents as a core focus, with a shift towards a more integrated Internet experience [10][11] - The company is primarily targeting B2B markets, but its progress in AI model development is perceived as average compared to competitors like Google [11][12] - Microsoft’s token generation in the last quarter was about 1 million tokens, significantly lower than Google's performance [11] Google - Google’s IoT conference was noted for its comprehensive approach to AI, with a focus on its Gemini model and various cloud-based products [12][14] - The company is leading in AI commercialization, with a monthly token processing capacity significantly higher than Microsoft’s [15][17] - Google’s AI strategy includes a robust framework for developers and a strong emphasis on integrating AI into its existing products [14][15] OpenAI - OpenAI's acquisition of the design company IOU for $6.5 billion aims to enhance its hardware product offerings, indicating a strategic move towards the next generation of Internet [18][20] - The focus on hardware development is seen as crucial for maintaining competitive advantage in the evolving AI landscape [21][22] Additional Insights - The conference highlighted the competitive dynamics between major players in the AI space, with NVIDIA, Google, and OpenAI positioned as leaders [15][20] - The discussion also touched on the importance of product design and innovation, particularly in the context of hardware development in Silicon Valley [22] - Future trends in AI are expected to revolve around the integration of agents and the development of platforms that unify various AI applications [31] Conclusion - The AI industry is rapidly evolving, with significant advancements in technology and strategic partnerships among leading companies. The focus on customized solutions and the integration of AI into various sectors will likely shape the future landscape of the industry [31]