Core Insights - Cartesia, a voice AI company, has recently launched its new voice model Sonic-3 and completed a $100 million Series B funding round, with NVIDIA among the investors [1][3][12] Company Overview - Cartesia was founded by Karan Goel, a talented individual from Stanford AI Lab, who has previously excelled in the field of state space models (SSM) [2][10] - The company has a strong academic foundation, with its core team primarily composed of members from Stanford AI Lab, including co-founder Albert Gu, a notable figure in the development of the Mamba architecture [3][4] Product Development - Cartesia has rapidly progressed since its inception, launching its first product, the Sonic voice model, shortly after securing seed funding. The company has since released multiple iterations, including Sonic-2.0 and the latest Sonic-3 [6][12] - Sonic-3 features significant upgrades, including improved emotional expression and faster response times, with a latency of only 90 milliseconds and an end-to-end response time of 190 milliseconds, making it one of the fastest voice generation systems available [8][12] Technology Differentiation - Unlike traditional voice AI models that rely on Transformer architecture, Sonic-3 is built on SSM, allowing for more natural and context-aware interactions without the need to revisit the entire conversation history [8][12] - This innovative approach enhances the model's ability to capture emotional nuances and respond more fluidly, positioning Cartesia as a leader in real-time voice AI technology [8][12] Market Context - The voice AI sector is witnessing significant advancements, with other companies like MiniMax also launching competitive products, indicating a growing market for voice models that can handle diverse languages and accents [14]
黄仁勋投了家复刻马斯克声音的AI公司
Sou Hu Cai Jing·2025-11-03 04:14