Workflow
声网发布对话式AI引擎:让任意大模型开口说话
AgoraAgora(US:API) 36氪·2025-03-07 09:37

Core Viewpoint - The article highlights the launch of Agora's conversational AI engine, which enables any text-based large model to be upgraded into a conversational multimodal model, emphasizing affordability and efficiency in AI voice interaction [2][4]. Group 1: Product Features - The conversational AI engine supports a wide range of large model providers, including DeepSeek and ChatGPT, allowing developers to choose freely [4]. - It features low latency with a median voice conversation delay of 650ms and an intelligent interruption technology that allows for responses as low as 340ms [5]. - The engine can filter out 95% of environmental noise, ensuring accurate voice recognition, and maintains stable conversations even under poor network conditions [5]. Group 2: Development and Cost Efficiency - Developers can deploy the AI engine with just two lines of code in about 15 minutes, significantly lowering the development barrier [6]. - The cost for AI voice interaction is set at 0.098 yuan per minute, with an initial bonus of 1000 minutes for new users [7]. - Average conversation costs are calculated to be around 0.03 yuan per interaction, making it highly economical for frequent use [8]. Group 3: Application Scenarios - The conversational AI engine can be utilized in various applications such as smart assistants, virtual companionship, language practice, customer service, and smart hardware [10]. - It enhances the functionality of smart devices by enabling voice control and personalized services, applicable in AI toys, educational hardware, and home assistants [10].