Duplex model

Search documents
Passing the Turing Test w/ ElevenLabs' Mati Staniszewski #ai #nextgenai #machinelearning
Sequoia Capitalยท 2025-07-01 20:46
Goal & Timeline - The company aims to achieve human-like conversational AI, potentially passing the Turing test with an agent, possibly by the end of the year or early 2026 [1][2] - The timeline depends on whether the model will be cascading (speech-to-text-to-speech) or a truly duplex "omni model" [3] Model Architecture - The company is developing both cascading and duplex models, with the cascading model currently in production and the duplex model soon to be deployed [4] - The industry faces a reliability versus expressivity trade-off between the two models [5] Trade-offs & Challenges - The duplex model is expected to be quicker and more expressive but potentially less reliable, while the cascaded model is more reliable and can be extremely expressive but may lack contextual responsiveness [5] - Latency is a significant engineering challenge, especially in fusing modalities of language models with audio [5] - No company has successfully fused language models with audio well, and the company hopes to be the first [5]