Workflow
Why Synthetic Data Is Overrated
20VC with Harry Stebbingsยท2025-08-07 05:00

So I think synthetic data is actually really useful in some places, but I think people overestimate what it can do. I'll give a couple examples. So right now there are a bunch of models that have been trained really heavily on synthetic data, but like I mentioned earlier, it means that they're only good at very academic homework style benchmark style problems.They're actually terrible at real world use cases. So yeah, synthetic data, it's made models good at synthetic problems, not real ones. And we actuall ...