X @Avi Chawla
Avi Chawlaยท2025-10-23 06:30
Fine-tuning LLM Agents without Fine-tuning LLMs!Imagine improving your AI agent's performance from experience without ever touching the model weights.It's just like how humans remember past episodes and learn from them.That's precisely what Memento does.The core concept:Instead of updating LLM weights, Memento learns from experiences using memory.It reframes continual learning as memory-based online reinforcement learning over a memory-augmented MDP.Think of it as giving your agent a notebook to remember wh ...