Avi Chawla
Search documents
X @Avi Chawla
Avi Chawla· 2025-10-16 06:31
A great tool to estimate how much VRAM your LLMs actually need.Alter the hardware config, quantization, etc., it tells you about:- Generation speed (tokens/sec)- Precise memory allocation- System throughput, etc.No more VRAM guessing! https://t.co/FlaeMVaWmK ...
X @Avi Chawla
Avi Chawla· 2025-10-15 19:35
RT Avi Chawla (@_avichawla)A time-complexity cheat sheet of 10 ML algorithms:What's the inference time-complexity of KMeans? https://t.co/8qlDxpDubA ...
X @Avi Chawla
Avi Chawla· 2025-10-15 06:54
A time-complexity cheat sheet of 10 ML algorithms:What's the inference time-complexity of KMeans? https://t.co/8qlDxpDubA ...
X @Avi Chawla
Avi Chawla· 2025-10-14 19:08
RT Avi Chawla (@_avichawla)Finally, Python 3.14 lets you disable GIL!It's a big deal because earlier, even if you wrote multi-threaded code, Python could only run one thread at a time, giving no performance benefit.But now, Python can run your multi-threaded code in parallel.And uv fully supports it! https://t.co/pfqh58En3K ...
X @Avi Chawla
Avi Chawla· 2025-10-14 06:31
Finally, Python 3.14 lets you disable GIL!It's a big deal because earlier, even if you wrote multi-threaded code, Python could only run one thread at a time, giving no performance benefit.But now, Python can run your multi-threaded code in parallel.And uv fully supports it! https://t.co/pfqh58En3K ...
X @Avi Chawla
Avi Chawla· 2025-10-13 19:22
RT Avi Chawla (@_avichawla)This should be impossible!You can clean any ML dataset in just three lines of code. Flag outliers, find label errors, and more, across:- Any data (tabular, text, image, etc.)- Any task (classification, entity recognition, etc.)100% open-source, built by MIT researchers. https://t.co/xAaKjK4zIM ...
X @Avi Chawla
Avi Chawla· 2025-10-13 06:56
If you found it insightful, reshare it with your network.Find me → @_avichawlaEvery day, I share tutorials and insights on DS, ML, LLMs, and RAGs.Avi Chawla (@_avichawla):This should be impossible!You can clean any ML dataset in just three lines of code. Flag outliers, find label errors, and more, across:- Any data (tabular, text, image, etc.)- Any task (classification, entity recognition, etc.)100% open-source, built by MIT researchers. https://t.co/xAaKjK4zIM ...
X @Avi Chawla
Avi Chawla· 2025-10-13 06:55
Cleanlab's GitHub repo: https://t.co/IiAR1sFSdJ(don't forget to star it ⭐ ) ...
X @Avi Chawla
Avi Chawla· 2025-10-13 06:55
This should be impossible!You can clean any ML dataset in just three lines of code. Flag outliers, find label errors, and more, across:- Any data (tabular, text, image, etc.)- Any task (classification, entity recognition, etc.)100% open-source, built by MIT researchers. https://t.co/xAaKjK4zIM ...
X @Avi Chawla
Avi Chawla· 2025-10-12 19:29
Core Problem of Traditional RAG - Most retrieved chunks in traditional RAG setups do not effectively aid the LLM, leading to increased computational costs, latency, and context processing [1][5] - Classic RAG involves fetching similar chunks from a vector database and directly inputting the retrieved context into the LLM [5] REFRAG Solution by Meta AI - Meta AI's REFRAG introduces a novel approach by compressing and filtering context at a vector level, focusing on relevance [1][2] - REFRAG employs chunk compression, relevance policy (RL-trained), and selective expansion to process only essential information [2] - The process involves encoding documents, finding relevant chunks, using a relevance policy to select chunks, and concatenating token-level representations [3][4] Performance Metrics of REFRAG - REFRAG outperforms LLaMA on 16 RAG benchmarks, demonstrating enhanced performance [5][7] - REFRAG achieves 30.85x faster time-to-first-token, significantly improving processing speed [5][7] - REFRAG handles 16x larger context windows, allowing for more extensive information processing [5][7] - REFRAG utilizes 2-4x fewer tokens, reducing computational resource consumption [5][7] - REFRAG leads to no accuracy loss across RAG, summarization, and multi-turn conversation tasks [7]