Workflow
Avi Chawla
icon
Search documents
X @Avi Chawla
Avi Chawla· 2025-07-21 06:40
Those were the 4 stages of training an LLM from scratch.- Start with a randomly initialized model.- Pre-train it on large-scale corpora.- Use instruction fine-tuning to make it follow commands.- Use preference & reasoning fine-tuning to sharpen responses.Check this 👇 https://t.co/y273TGiFGM ...
X @Avi Chawla
Avi Chawla· 2025-07-21 06:39
4 stages of training LLMs from scratch, clearly explained (with visuals): ...
X @Avi Chawla
Avi Chawla· 2025-07-20 19:18
RT Avi Chawla (@_avichawla)I have been training neural networks for 9 years now.Here are 16 ways I actively use to optimize model training: ...
X @Avi Chawla
Avi Chawla· 2025-07-20 06:34
That's a wrap!If you found it insightful, reshare it with your network.Find me → @_avichawlaEvery day, I share tutorials and insights on DS, ML, LLMs, and RAGs.Avi Chawla (@_avichawla):I have been training neural networks for 9 years now.Here are 16 ways I actively use to optimize model training: ...
X @Avi Chawla
Avi Chawla· 2025-07-20 06:34
Those were 16 techniques that I actively use to optimize neural network training.If I missed something, please drop that in the replies.Here's the visual again for your reference 👇 https://t.co/7sGyTJAHtT ...
X @Avi Chawla
Avi Chawla· 2025-07-20 06:33
I have been training neural networks for 9 years now.Here are 16 ways I actively use to optimize model training: ...
X @Avi Chawla
Avi Chawla· 2025-07-19 06:37
That's a wrap!If you found it insightful, reshare it with your network.Find me → @_avichawlaEvery day, I share tutorials and insights on DS, ML, LLMs, and RAGs.Avi Chawla (@_avichawla):Andrew Ng's team once made a big mistake in a research paper.And it happened due to randomly splitting the data.Here's what happened: ...
X @Avi Chawla
Avi Chawla· 2025-07-19 06:37
This led to data leakage, and validation scores looked much better than they should have.A few days later, the team updated the paper after using the group shuffle split strategy to ensure the same patients did not end up in both the training and validation sets.Check this 👇 https://t.co/1UM6PRYdQz ...
X @Avi Chawla
Avi Chawla· 2025-07-19 06:36
Andrew Ng's team once made a big mistake in a research paper.And it happened due to randomly splitting the data.Here's what happened: ...
X @Avi Chawla
Avi Chawla· 2025-07-18 19:12
First MCP, then A2A and AG-UI......now ACP is added to the Agent protocol stack. It's a fully open-source protocol by IBM.The thread below gives a detailed walkthrough on ACP (with implementation)👇 https://t.co/efc996DTsaAvi Chawla (@_avichawla):After MCP, A2A, & AG-UI, there's another Agent protocol.It's fully open-source and launched by IBM Research.Here's a complete breakdown (with code): https://t.co/GAbnvOGIdU ...