Workflow
Avi Chawla
icon
Search documents
X @Avi Chawla
Avi Chawla· 2025-09-25 06:34
General Overview - The author encourages readers to reshare the content if they found it insightful [1] - The author shares tutorials and insights on Data Science (DS), Machine Learning (ML), Large Language Models (LLMs), and Retrieval-Augmented Generation (RAGs) daily [1] Author Information - The author can be found on Twitter/X with the handle @_avichawla [1]
X @Avi Chawla
Avi Chawla· 2025-09-25 06:34
Building Agents is about engineering “behavior” at scale. So you cannot vibe-prompt an Agent and expect it to work.Parlant gives the structure to build Agents that behave exactly as instructed.GitHub repo: https://t.co/kjVj5Rp7Xm(don't forget to star it ⭐) ...
X @Avi Chawla
Avi Chawla· 2025-09-25 06:33
AI Risk & Failure - AI failures can lead to job loss [1] - Building robust Agents is crucial to avoid production failures [1] Financial Losses due to AI - Zillow experienced a $304 million loss due to its home-buying AI [1] - iTutor paid $365 thousand when AI rejected older applicants [1] Examples of AI Failures - Replit's Agent wiped out a production database [1]
X @Avi Chawla
Avi Chawla· 2025-09-24 21:05
RT Avi Chawla (@_avichawla)Pytest for LLM Apps is finally here!DeepEval turns LLM evals into a two-line test suite to help you identify the best models, prompts, and architecture for AI workflows (including MCPs).Works with all frameworks like LlamaIndex, CrewAI, etc.100% open-source with 11k stars! https://t.co/Xayu1aFGFV ...
X @Avi Chawla
Avi Chawla· 2025-09-24 06:33
LLM Evaluation Tools - DeepEval transforms LLM evaluations into a two-line test suite [1] - DeepEval helps identify the best models, prompts, and architecture for AI workflows, including MCPs (Multi-Choice Preference) [1] - DeepEval is 100% open-source with 11 thousand stars [1] Framework Compatibility - DeepEval works with all frameworks like LlamaIndex, CrewAI, etc [1] Community Engagement - The author encourages readers to reshare the information [1] - The author shares tutorials and insights on DS (Data Science), ML (Machine Learning), LLMs (Large Language Models), and RAGs (Retrieval-Augmented Generation) daily [1]
X @Avi Chawla
Avi Chawla· 2025-09-24 06:33
Repository Information - GitHub repository is available at https://t.co/LfM6AdsO74 [1] - Encouragement to star the GitHub repository [1]
X @Avi Chawla
Avi Chawla· 2025-09-24 06:33
Pytest for LLM Apps is finally here!DeepEval turns LLM evals into a two-line test suite to help you identify the best models, prompts, and architecture for AI workflows (including MCPs).Works with all frameworks like LlamaIndex, CrewAI, etc.100% open-source with 11k stars! https://t.co/Xayu1aFGFV ...
X @Avi Chawla
Avi Chawla· 2025-09-23 20:05
RT Avi Chawla (@_avichawla)Researchers from AssemblyAI built a state-of-the-art model that:- transcribes speech across 99 languages.- works even if the audio has many speakers.- outperforms Deepgram and OpenAI models.And much more.(2-step setup below) https://t.co/7eg0zpE4pM ...
X @Avi Chawla
Avi Chawla· 2025-09-23 06:35
That's a wrap!If you found it insightful, reshare it with your network.Find me → @_avichawlaEvery day, I share tutorials and insights on DS, ML, LLMs, and RAGs.AAvi Chawla (@_avichawla):Researchers from AssemblyAI built a state-of-the-art model that:- transcribes speech across 99 languages.- works even if the audio has many speakers.- outperforms Deepgram and OpenAI models.And much more.(2-step setup below) https://t.co/7eg0zpE4pM ...
X @Avi Chawla
Avi Chawla· 2025-09-23 06:35
Try it here with zero setup: https://t.co/ZIfq4ugtkYAssemblyAI can:- Process 1 hr of speech in ~35s- Provide industry-leading accuracy of 93.3%- Support diarization to detect multiple speakers- Detect speech in 99 languagesThanks to AssemblyAI for working with me today! ...