Conversational AI

Search documents
Realtime Conversational Video with Pipecat and Tavus — Chad Bailey and Brian Johnson, Daily & Tavus
AI Engineer· 2025-06-27 10:30
Core Technology & Products - Tavis offers a conversational video interface, an end-to-end pipeline for conversations with AI replicas, with a response time around 600 milliseconds [9] - Tavis's proprietary models, Sparrow Zero and Raven Zero, are being integrated into Pipecat [10][11] - Pipecat is an open-source framework designed as an orchestration layer for real-time AI, handling input, processing, and output of media [15][18] - Pipecat uses frames, processors, and pipelines to manage data flow, with processors handling frames of audio, video, or voice activity detection [23][24] Strategic Partnership & Integration - Tavis and Pipecat are partnering to enhance conversational AI, leveraging Pipecat's capabilities for real-time observability and control [8] - Enterprise customers are using Pipecat and want to integrate Tavis's technology within it, leading Tavis to move its best models into Pipecat [39] - Tavis is integrating its Phoenix rendering model, turn-taking, response timing, and perception models into Pipecat [39][40] Future Development & Deployment - Tavis is developing a multilingual turn detection model to improve conversational AI speed and prevent interruptions [41] - Tavis is working on a response timing model to adjust response speed based on conversation context [42][43] - Tavis's multimodal perception model will analyze emotions and surroundings to provide more nuanced conversational flow [44] - Pipecat Cloud offers a solution for deploying bots at scale, simplifying the process without requiring Kubernetes expertise [49]
Cerence AI Powers In-Car Experience in Premier German Automaker’s New Electric Sedan
Globenewswire· 2025-06-26 12:00
Core Insights - Cerence Inc. announced the integration of its AI technologies into the next-generation MBUX system, debuting in the all-electric Mercedes-Benz CLA, marking a significant milestone in their collaboration with Mercedes-Benz [1][2] - The MBUX Virtual Assistant will feature advanced conversational capabilities, including a new "living" avatar and enhanced emotional interaction through Cerence's neural text-to-speech technology [2][3] - Cerence AI's solutions support seamless interaction across 25 languages, enhancing the user experience for drivers and passengers [2][5] Company Overview - Cerence Inc. is a leader in creating AI-powered experiences in the automotive sector, with over 500 million cars equipped with its technology [5] - The company focuses on voice interaction technologies, generative AI, and large language models to improve safety and connectivity in vehicles [5] - Cerence is headquartered in Burlington, Massachusetts, and collaborates with major automakers and technology firms to innovate user experiences [5]
How Pigment Built an AI-Powered Business Planning Platform with LangGraph
LangChain· 2025-06-20 15:30
Pigment's Business and Technology - Pigment is an enterprise planning and performance management platform that helps companies build strategic plans and adapt to changing market conditions [1] - Pigment AI consists of conversational AI and autonomous agents that accelerates insight generation and scenario creation across the organization [2] - Pigment's autonomous agents framework allows users to schedule and automate reports and scenario creation, saving hundreds of hours of manual work [3] Challenges with Previous AI Architecture - Linear chain pipelines limited flexibility and made experimentation with agent-based workflows complex and cumbersome [4] - Managing graphs, memory, state transitions, and interruptions for custom agents was too complex [5] - Strong control over tools and agents, simple state management, and asynchronous processing were critical needs for financial use cases [5] Benefits of Long Graph - Long Graph offers graph-based orchestration, long-term memory, streaming, and interrupt capabilities [6] - Graph orchestration is easy to set up, allowing easy definition and tweaking of agent iteration and collaboration [6] - Full visibility and control over message flow between agents enables building reliable and testable logic [7] - Agent topologies can be abstracted into configuration files, enabling rapid prototyping and deployment of new workflows [7] Impact of Long Graph - Reduced time to insight from hours to seconds using natural language search and agent analysis [8] - Faster decision-making by surfacing anomalies and key performance gaps in real time [8] - Users can focus on higher value work by automating routine analysis and planning tasks [9] - Engineering team has more time to experiment and innovate, focusing on higher impact features [9] - Significantly less time is spent implementing key site capabilities like persistent, long-term memory [9]
SoundHound Stock's Lofty Valuation: Still Worth the Price?
ZACKS· 2025-06-19 14:36
Core Insights - SoundHound AI (SOUN) maintains a high valuation with a forward 12-month price-to-sales (P/S) ratio of 20.45, exceeding the industry average of 19.17, reflecting fluctuating investor interest in voice AI [1][2] - The company has established itself in conversational AI, particularly in the automotive and restaurant sectors, but faces caution from investors due to its high valuation and competition from major tech firms [2] Valuation and Stock Performance - SOUN's stock has decreased by 1.7% over the past three months, underperforming the industry gain of 6.3% and the broader technology sector's increase of 11.2% [4] - Currently, SOUN trades at a 62.3% discount to its 52-week high of $24.98, yet remains above its low of $3.82, indicating that investor sentiment is focused on long-term growth rather than immediate earnings [5][7] Growth Drivers - The Polaris platform and the new agentic AI offering, Amelia 7.0, are central to SoundHound's growth strategy, enabling real-time voice recognition and autonomous task execution [9] - SoundHound's voice AI solutions are now implemented in over 13,000 restaurant locations, with a recent partnership with Mastercard enhancing its position in AI-enabled payment flows [10] - The U.S. healthcare market, valued at $4.9 trillion in 2023, presents significant growth opportunities for SoundHound, particularly through its partnership with Allina Health [11] Revenue and Guidance - In Q1 2025, SoundHound reported revenue of $29.1 million, a 151% year-over-year increase, driven by the adoption of its voice AI solutions across various sectors [12] - The company has reaffirmed its 2025 revenue guidance of $157–$177 million, expecting a stronger revenue contribution in the first half of the year [13] Competitive Landscape - SoundHound faces intense competition from major tech companies like Alphabet, Amazon, and Apple, which dominate the AI-powered voice assistant market [15] - The company must differentiate itself by offering more customizable and lightweight solutions to compete effectively against these established players [16] Challenges and Margin Pressures - SoundHound's automotive business has experienced softness due to geopolitical and macroeconomic uncertainties, impacting unit volumes despite rising average selling prices [17] - Integration costs from acquisitions and legacy contracts are exerting pressure on gross margins, although management is working to improve this over the next 18–24 months [18] Market Sentiment and Future Outlook - The Zacks Consensus Estimate for a full-year loss in 2025 remains unchanged, indicating limited near-term upside potential for the stock [19] - Despite recent underperformance, SoundHound's expanding platform, debt-free balance sheet, and reaffirmed profitability guidance by year-end 2025 provide a stable outlook for long-term investors [21]
Is SoundHound Ready to Challenge Big Tech in Automotive AI?
ZACKS· 2025-06-18 16:06
Core Insights - SoundHound AI (SOUN) is emerging as a significant player in the automotive AI sector, traditionally led by major tech companies, with a reported revenue of $29.1 million in Q1 2025, reflecting a 151% year-over-year increase driven by growth in restaurant and automotive voice AI solutions [1][10] Group 1: Company Developments - SoundHound is expanding its voice commerce capabilities, allowing drivers to perform tasks such as ordering food and booking parking hands-free, which is attracting attention from automakers [2] - The company has over 13,000 restaurant locations utilizing its system and is conducting multiple large OEM pilots, indicating a scalable voice ecosystem [3] - The launch of Amelia 7.0 enhances SoundHound's offerings by enabling AI agents to perform complex tasks autonomously [3] Group 2: Financial Performance - Despite slightly missing revenue expectations and facing margin pressures from recent acquisitions, SoundHound maintains its full-year revenue guidance of $157–$177 million and aims for profitability by year-end [4] - SOUN's Q1 revenue growth of 151% is attributed to advancements in its voice AI platforms, Polaris and Amelia 7.0, which enhance in-car voice capabilities [10] Group 3: Competitive Landscape - SoundHound faces competition from well-funded rivals like Alphabet Inc. (GOOGL) and Aurora Innovation (AUR), with GOOGL leveraging its Android Automotive OS and deep ecosystem integration [5][6] - Aurora Innovation focuses on autonomous driving and human-machine interaction, aligning with SoundHound's goals for seamless in-vehicle voice experiences [7] - SoundHound's specialization in end-to-end conversational AI and rapid deployment across OEMs provides it with a differentiated edge in the competitive landscape [8] Group 4: Market Performance and Valuation - SOUN's stock has declined by 5% over the past three months, underperforming the Zacks Computers - IT Services industry, which rose by 3.3% [9] - The company's forward 12-month price-to-sales (P/S) ratio stands at 20.29, slightly above the industry's 19.26 [12]
The way you program an AI is like the way you program a person, says Nvidia's Huang
CNBC· 2025-06-09 10:09
Core Insights - Nvidia CEO Jensen Huang describes artificial intelligence as the "great equalizer," enabling programming through everyday language rather than traditional programming languages [1][3] - Conversational AI models, such as OpenAI's ChatGPT, have gained significant traction, with ChatGPT reporting 400 million weekly active users as of February 2023 [2] - Huang emphasizes that programming AI is akin to programming a human, making it accessible to a broader audience [3][4] Group 1 - AI technology allows users to interact with computers in a conversational manner, making it easier for non-programmers to generate content and perform tasks [3][4] - Companies like Shopify, Duolingo, and Fiverr are encouraging employees to integrate AI into their workflows, reflecting a growing trend in the industry [4][5] - OpenAI has reported 3 million paying business users, indicating a strong demand for AI solutions in the business sector [4] Group 2 - Huang advocates for the adoption of AI to enhance workplace efficiency, countering fears of job displacement due to automation [5][6] - The new method of interacting with computers is seen as transformative, particularly for younger generations who are naturally engaging with AI [6]
Innodata vs. SoundHound: Which AI Stock Has More Upside Potential?
ZACKS· 2025-06-02 17:21
Core Insights - The article highlights two companies, Innodata Inc. (INOD) and SoundHound AI (SOUN), that are finding success in the AI sector despite being less prominent than larger competitors [1] - Both companies are experiencing significant revenue growth and forming partnerships with major industry players, but they present different risk-reward profiles [1] Innodata (INOD) - Innodata specializes in data engineering and AI model assurance, serving major tech clients including Microsoft, Alphabet, and Amazon, which are expected to invest billions in generative AI infrastructure by 2025 [3][4] - In 2024, Innodata's revenues nearly doubled to $170.5 million, with adjusted EBITDA surging 250% to $34.6 million; Q1 2025 saw revenues increase 120% year-over-year to $58.3 million [4] - The company launched a Generative AI Test & Evaluation Platform in partnership with Nvidia, addressing enterprise concerns about AI safety and bias [5] - Innodata has a strong financial position with $56.6 million in cash and no debt, allowing for investment in growth [6] - A significant risk is customer concentration, with 48% of revenues in 2024 coming from a single client; management is working to diversify this through new contracts [7] - The company targets revenue growth of over 40% for 2025, indicating a scalable business model [8] SoundHound AI (SOUN) - SoundHound focuses on conversational AI, achieving a record 151% year-over-year revenue increase in Q1 2025, reaching $29.1 million, driven by acquisitions and partnerships [9][10] - The company has expanded its customer base through acquisitions, with expected contributions of $45 million in recurring revenues from Amelia in 2025 [11] - SoundHound's diversified customer base mitigates risk, with no single customer accounting for more than 10% of revenues [12] - Financially, SoundHound has $246 million in cash and no debt, but it remains unprofitable, reporting a $22.2 million adjusted EBITDA loss in Q1 2025 [12][13] - The company faces margin pressure, with GAAP gross margins declining from 59.7% to 36.5% year-over-year due to acquisition costs [13] Comparative Analysis - Analysts have maintained a steady outlook for SOUN's earnings, while sentiment for INOD has turned more bearish recently [14] - For 2025, INOD's sales are expected to grow by 41.76%, while SOUN's sales are projected to increase by 91.07% [15] - INOD's stock has seen a slight decline of 0.1% this year, while SOUN has dropped 49%, although SOUN has recently bounced back by 10% [17] - INOD trades at a forward price-to-sales multiple of 4.77X, while SOUN's multiple is significantly higher at 22.23X [18] Conclusion - Both companies hold a Zacks Rank 3 (Hold), making the choice between them challenging [20] - SoundHound is seen as a strong growth story in voice AI, but faces challenges with profitability and competition [21] - Innodata, while less visible, offers a balanced growth profile with strong profitability metrics and deep integration with major tech players [22]
Agora(API) - 2025 Q1 - Earnings Call Transcript
2025-05-28 02:02
Financial Data and Key Metrics Changes - Total revenue for Q1 reached $33.3 million, up 12% year over year, excluding revenues from certain low-margin products, indicating a recovery in business momentum [3][11] - GAAP net profit for the quarter was $400,000, representing a 1.2% net income margin, a significant turnaround from a net loss margin of 28.7% in Q1 last year [16][17] - Operating cash flow was $17.6 million in Q1 compared to negative $6.5 million last year, reflecting improved financial health [17] Business Line Data and Key Metrics Changes - Core revenues reached $18.6 million in Q1, growing 17.7% year over year, driven by market expansion in high-potential verticals like live shopping and entertainment [12] - Shunlong revenues were RMB 105.5 million in Q1, with a year-over-year growth of 6.7%, although it declined 13.7% sequentially due to seasonal factors [12][13] - The net retention rate improved to 96% for Agora and 85% for Shunlong, indicating better customer retention [13] Market Data and Key Metrics Changes - The growth rate in the U.S. and global markets is around 18% to 20%, with increasing adoption of live video-based shopping and entertainment apps, particularly in North America and Europe [28][30] - Demand recovery is noted in Asian markets, including India, for both education and entertainment use cases [30] Company Strategy and Development Direction - The company is focusing on conversational AI, particularly in education, IoT, and customer service sectors, which are seen as key growth drivers [25][26] - The strategy includes maximizing penetration in live commerce and improving video quality for better customer experiences [35] - The company aims to maintain profitability while investing in future growth opportunities, including share repurchase programs [18][19] Management's Comments on Operating Environment and Future Outlook - Management expressed confidence in maintaining GAAP profitability for the remainder of the year, supported by current business momentum [4][17] - The competitive landscape in China is consolidating, with fewer players remaining, which may enhance the company's market position [46][48] Other Important Information - The company launched a Conversational AI Device Kit, enabling manufacturers to integrate conversational AI into IoT devices, which could reduce R&D costs and time to market [7] - The open-source project for building real-time conversational AI agents is gaining traction, with significant support from leading cloud providers [8] Q&A Session Summary Question: What are the key areas for AI application? - Management highlighted education, IoT, and customer service as the most active use cases for AI applications [25][26] Question: What is the demand trend for China and overseas business? - Management noted a solid demand recovery in the U.S. and global markets, with stable pricing trends in developed markets [28][30] Question: What is the strategy for overseas e-commerce platforms? - The company is focusing on maximizing penetration in live commerce and improving video quality for better user experiences [35] Question: What is the timing for massive adoption of conversational AI? - Management indicated that the tipping point for adoption will depend on the maturity of product-market fit across various verticals [40][42] Question: How is the competitive landscape in China affecting pricing trends? - The competitive landscape is consolidating, with fewer players, and the company is focusing on higher value use cases to maintain margins [46][49]
Agora(API) - 2025 Q1 - Earnings Call Transcript
2025-05-28 02:00
Financial Data and Key Metrics Changes - Total revenue in Q1 2025 reached $33.3 million, up 12% year over year, excluding revenues from certain low-margin products [3][11] - GAAP net profit for the quarter more than doubled from the previous quarter, marking the second consecutive quarter of GAAP profitability [4][15] - Operating cash flow was $17.6 million in Q1, compared to negative $6.5 million last year [16] - Gross margin for Q1 was 68%, with an increase of 0.6% year over year and 1.4% quarter over quarter [13] Business Line Data and Key Metrics Changes - Core revenues reached $18.6 million in Q1, growing 17.7% year over year [12] - Shunlong revenues were RMB 105.5 million in Q1, with a year-over-year growth of 6.7% [12] - The base net retention rate improved to 96% for Agora and 85% for Shunlong [12] Market Data and Key Metrics Changes - Growth rate in the US and global markets is around 18% to 20%, with increasing adoption of live video-based shopping and entertainment apps, particularly in North America and Europe [26][28] - Demand recovery is observed in Asian markets, including India, for both education and entertainment use cases [28] Company Strategy and Development Direction - The company is focusing on conversational AI, particularly in education, IoT, and customer service sectors [24][25] - The launch of the Conversational AI Device Kit aims to enable device manufacturers to integrate conversational AI into various IoT devices [7] - The company is committed to maintaining profitability while investing in future growth opportunities through share repurchase programs [17][18] Management's Comments on Operating Environment and Future Outlook - Management expressed confidence in maintaining GAAP profitability throughout 2025, supported by current business momentum [4][16] - The company anticipates a revenue range of $33 million to $35 million for Q2 2025, reflecting a year-over-year growth rate of 6.8% to 13.3% [18] Other Important Information - The conversational AI engine launched in March is expected to unlock innovation across multiple verticals, particularly in education [6] - The company has seen record-high product registrations and inquiries following recent product launches [9] Q&A Session Summary Question: AI demand and future growth drivers - Management highlighted key use cases for AI applications, including education, IoT, and customer service [24][25] Question: Breakdown of China and overseas business demand trends - Management noted a growth rate of about 18% in the US and global markets, with demand recovery in Asian markets [26][28] Question: Overseas e-commerce business strategy - Management discussed ongoing efforts to penetrate the live commerce space and improve video quality for better customer experience [33] Question: Shifts in downstream demand for AI-powered interaction capabilities - Management emphasized the importance of product market fit for various verticals to achieve growth [40][41] Question: Timing for massive adoption of conversational AI - Management indicated that the tipping point for adoption will depend on the maturity of product market fit across different use cases [40][41] Question: Competitive landscape in China and pricing trends - Management acknowledged the competitive nature of the Chinese market but noted a trend towards consolidation among major players [44][46]
Agora(API) - 2025 Q1 - Earnings Call Presentation
2025-05-27 22:49
Business Highlights - Agora launched a Conversational AI Engine, an enterprise-grade solution for human-AI voice experiences[5] - The company offers a ConvoAI Device Kit, a turnkey solution for adding voice AI to any device, combining software, cloud services, and chips from Beken[8, 10] - TEN (a voice activities detection) continues gaining traction and adds VAD and Turn Detection, outperforming existing open-source alternatives[11, 14] - Agora's Conversational AI Extension is available on the Dify Marketplace[15] Customer Base & Revenue - Agora had 3,800 active customers in March 2025, including 1,994 Agora customers and 1,806 Shengwang customers[35] - Total revenues for Q1 2025 were $33.3 million, with a year-over-year growth of 12.1%[39] - Agora revenues in Q1 2025 were $18.6 million[42] - Shengwang revenues in Q1 2025 were ¥105.5 million, equivalent to $14.7 million[43] Financial Performance - The dollar-based net retention rate for Agora was 96% and for Shengwang was 85% in March 2025[46] - Gross profit for Q1 2025 was $22.6 million, resulting in a gross margin of 68.0%[50] - Loss from operations in Q1 2025 was $3.7 million, representing a loss from operations margin of (11.1%) [57] Share Repurchase Program - As of March 31, 2025, Agora repurchased 33.0 million ADSs for approximately $116.4 million, representing 58% of the $200 million share repurchase program[63]