声网对话式AI引擎

Search documents
声网母公司2025年Q2财报:单季度净利润超千万,超一季度3倍以上
IPO早知道· 2025-08-19 01:50
Core Viewpoint - Agora, Inc. has achieved GAAP profitability for three consecutive quarters, indicating strong revenue growth and operational efficiency improvements [3]. Financial Performance - In Q2 2025, Agora, Inc. reported total revenue of $34.26 million, a year-over-year increase of 11.0%, excluding low-margin businesses terminated since Q3 2024 [3]. - The company provided guidance for Q3 revenue in the range of $34.00 million to $36.00 million, representing a year-over-year growth of 7.6% to 13.9% compared to Q3 2024 [3]. - The net profit for Q2 2025 was $1.46 million (approximately 10.5 million RMB), which is over three times the profit from Q1 2025 [3]. Cash Reserves - As of June 30, 2025, Agora, Inc. had cash, cash equivalents, bank deposits, and bank wealth management products totaling $377 million [4]. AI Product Development - Since the launch of the conversational AI engine in March 2025, Agora has collaborated with clients to develop voice dialogue agents in various scenarios, including call centers and AI companion hardware [4]. - The company has upgraded its conversational AI engine to include features such as voiceprint recognition, digital humans, and visual understanding, enhancing the audio-visual interaction experience [6][9]. Technological Advancements - The voiceprint recognition feature allows the AI to accurately identify user voice characteristics, effectively filtering out 95% of background noise for improved dialogue accuracy [8]. - The digital human interaction feature enables real-time, lifelike conversations with highly realistic digital avatars, suitable for virtual customer service, educational companionship, and social entertainment [8]. - The visual understanding capability allows the AI to interpret visual cues and respond intelligently to user gestures and environmental objects, expanding the potential for human-AI collaboration [8]. Industry Applications - Agora's conversational AI capabilities are being applied across various sectors, including AI assistants, AI companion robots, and multi-modal AI agents [11]. - The company has seen successful implementations in platforms like MiniMax, which leverages advanced multi-modal AIGC technology for real-time voice interactions [11]. - Other applications include educational robots like Miko3, which can engage in natural conversations with children and recognize their emotions [12]. Future Outlook - Agora aims to deepen its innovation in scenarios and technology iterations, enhancing real-time interaction experiences to integrate AI into various industries effectively [13].
WAIC现场最“聪明”展台!AI对话眼睛耳朵能力全打开
量子位· 2025-07-28 06:42
Core Viewpoint - The article highlights the advancements in Agora's conversational AI engine, showcasing its new features that enhance real-time interaction and user experience in various applications [4][5][31]. Group 1: Upgrades of the Conversational AI Engine - The upgraded conversational AI engine includes a selective attention locking feature that allows it to accurately capture user commands in noisy environments, filtering out 95% of background noise [12][16]. - The engine now has visual understanding capabilities, enabling it to recognize and interpret images in real-time, enhancing its contextual awareness during interactions [18][23]. - Integration with mainstream digital human solutions allows for more human-like interactions, where digital avatars can express emotions and gestures, making conversations feel more natural [25][30]. Group 2: Applications and Market Position - The conversational AI engine has been successfully implemented across various sectors, including education and smart hardware, demonstrating its versatility and reliability [38][44]. - Agora's long-standing expertise in Real-Time Engagement (RTE) technology positions it favorably in the growing market for multimodal AI interactions, which combine audio and visual inputs [49][50]. - The focus on user experience rather than just technical specifications is expected to enhance the competitive edge of Agora's products in the evolving AI landscape [51][52].
声网发布对话式AI引擎:让任意大模型开口说话
36氪· 2025-03-07 09:37
Core Viewpoint - The article highlights the launch of Agora's conversational AI engine, which enables any text-based large model to be upgraded into a conversational multimodal model, emphasizing affordability and efficiency in AI voice interaction [2][4]. Group 1: Product Features - The conversational AI engine supports a wide range of large model providers, including DeepSeek and ChatGPT, allowing developers to choose freely [4]. - It features low latency with a median voice conversation delay of 650ms and an intelligent interruption technology that allows for responses as low as 340ms [5]. - The engine can filter out 95% of environmental noise, ensuring accurate voice recognition, and maintains stable conversations even under poor network conditions [5]. Group 2: Development and Cost Efficiency - Developers can deploy the AI engine with just two lines of code in about 15 minutes, significantly lowering the development barrier [6]. - The cost for AI voice interaction is set at 0.098 yuan per minute, with an initial bonus of 1000 minutes for new users [7]. - Average conversation costs are calculated to be around 0.03 yuan per interaction, making it highly economical for frequent use [8]. Group 3: Application Scenarios - The conversational AI engine can be utilized in various applications such as smart assistants, virtual companionship, language practice, customer service, and smart hardware [10]. - It enhances the functionality of smart devices by enabling voice control and personalized services, applicable in AI toys, educational hardware, and home assistants [10].
2行代码与DeepSeek语音对话,1分钟不到一毛钱,所有大模型都能开口说话
量子位· 2025-03-07 07:12
Core Viewpoint - The article discusses the launch of Agora's conversational AI engine, DeepSeek, which offers low-latency, real-time voice interaction capabilities at an extremely low cost, making it accessible for developers to integrate AI into applications [1][4][17]. Pricing and Cost Efficiency - The cost of using the AI engine is remarkably low at 0.098 yuan per minute, with an initial offer of 1000 free minutes for new users [3][5]. - Average conversation length is approximately 21.1 seconds, resulting in a cost of only 0.03 yuan per interaction, leading to a monthly cost of less than 0.5 yuan for 15 interactions [5]. Technical Performance - The engine achieves a median response latency of 650 milliseconds, significantly below the 1.7 seconds threshold for natural conversation [7][8]. - It supports interruption of responses with a low latency of 340 milliseconds, mimicking human conversation dynamics [9]. - The engine can filter out 95% of background noise, ensuring high-quality voice recognition even in noisy environments [9]. Network and Compatibility - Agora has established over 200 data centers globally, allowing for stable connections even in poor network conditions, with the ability to maintain communication despite 80% packet loss [10]. - The engine is compatible with various large models, including DeepSeek and ChatGPT, and supports over 30,000 device types, ensuring broad accessibility [10][16]. Developer Accessibility - The integration process for developers is simplified to just two lines of code, allowing for deployment of a conversational AI agent within 15 minutes [11][12]. - Developers can easily switch between different underlying models and voice synthesis providers without altering the front-end logic [13][14]. New Service Model - The launch of the conversational AI engine signifies the emergence of a "voice interaction as a service" model, decoupling RTC technology from large model development [17][18]. - Agora positions itself as a middleware provider in the AI voice interaction ecosystem, facilitating the integration of RTC technology into various AI applications [19][21].