Workflow
Zero-shot voice cloning
icon
Search documents
X @Avi Chawla
Avi Chawla· 2026-03-04 06:48
clone any voice with a 5-second audio clip.VoxCPM is an open-source project that takes a fundamentally different approach to text-to-speech.most TTS systems convert speech into discrete tokens. this creates a bottleneck that limits how natural the output can sound.VoxCPM skips tokenization entirely. it models audio in continuous space using an end-to-end diffusion autoregressive architecture.the result is speech that actually sounds human.here's what makes it special:> context-aware generation: it reads you ...