Avi Chawla
Search documents
X @Avi Chawla
Avi Chawla· 2026-02-12 06:30
GitHub repo: https://t.co/gVKnEp8AI6(don't forget to star it ⭐ ) ...
X @Avi Chawla
Avi Chawla· 2026-02-12 06:30
Context engineering is the new bottleneck!Most of the work in building production agents today has nothing to do with the model.Instead, it's:→ Setting up PostgreSQL for conversation history→ Wiring S3 for file storage→ Writing custom logic to compress long contexts→ Building format converters between OpenAI and Anthropic→ Stitching together monitoring from scratchAnd you do this separately for every agent you build.A smarter approach is now actually implemented in Acontext, which is an open-source context ...
X @Avi Chawla
Avi Chawla· 2026-02-12 01:50
RT Avi Chawla (@_avichawla)Google.OpenAI.Anthropic.They're all working on the same problem for agents.How to let agents control the UI layer at runtime, rather than just output text.That's Generative UI, and it's built on three parts:Anthropic's MCP Apps + Google's A2UI + CopilotKit's AG-UIThese are the building blocks that power Generative UI behind agentic apps like Claude.Until now, bringing them into your app has been complex, with no clear resources to follow.But I found 2 resources that cover everythi ...
X @Avi Chawla
Avi Chawla· 2026-02-11 08:10
Developers guide: https://t.co/Uvxo09LZQEGitHub repo: https://t.co/GjASqxZrmF ...
X @Avi Chawla
Avi Chawla· 2026-02-11 08:10
Google.OpenAI.Anthropic.They're all working on the same problem for agents.How to let agents control the UI layer at runtime, rather than just output text.That's Generative UI, and it's built on three parts:Anthropic's MCP Apps + Google's A2UI + CopilotKit's AG-UIThese are the building blocks that power Generative UI behind agentic apps like Claude.Until now, bringing them into your app has been complex, with no clear resources to follow.But I found 2 resources that cover everything you need to get started. ...
X @Avi Chawla
Avi Chawla· 2026-02-10 06:30
Mini-SGLang GitHub:(don't forget to star 🌟)https://t.co/sf4ZbktapY ...
X @Avi Chawla
Avi Chawla· 2026-02-10 06:30
Learn how LLM inference actually works under the hood.vLLM has 100k+ lines of code. Mini-SGLang does the same core job in 5,000.It's a compact codebase that serves as both a capable inference engine and a transparent reference for researchers and devs. Something you can actually finish reading over a weekend.Here's what makes it special:↳ Clean, type-annotated code you can actually read↳ Radix cache to reuse KV cache across shared prefixes↳ Chunked prefill for long contexts without memory blowup↳ Tensor par ...
X @Avi Chawla
Avi Chawla· 2026-02-10 01:13
RT Avi Chawla (@_avichawla)Vector search is not always the answer.A 30-year-old algorithm with zero training, zero embeddings, and zero fine-tuning still powers Elasticsearch, OpenSearch, and most production search systems today.It's called BM25, and it's worth understanding why it refuses to die.Let's say you're searching for "transformer attention mechanism" in a library of ML papers.BM25 scores documents using three core ideas:1) Word rarity matters more than word frequencyEvery paper contains "the" and ...
X @Avi Chawla
Avi Chawla· 2026-02-09 06:30
This hybrid search stack I mentioned in the post is actually implemented in this open-source context retrieval layer for agents.GitHub repo: https://t.co/iU6P0KoaRf ...
X @Avi Chawla
Avi Chawla· 2026-02-09 06:30
Vector search is not always the answer.A 30-year-old algorithm with zero training, zero embeddings, and zero fine-tuning still powers Elasticsearch, OpenSearch, and most production search systems today.It's called BM25, and it's worth understanding why it refuses to die.Let's say you're searching for "transformer attention mechanism" in a library of ML papers.BM25 scores documents using three core ideas:1) Word rarity matters more than word frequencyEvery paper contains "the" and "is" so those words carry n ...