Core Insights - The company is accelerating its "one foundation, two wings" technology strategy amid the rise of intelligent agents, recently launching the "Shan Hai · Zhi Yin" model 2.0 after upgrading the "Shan Hai · Zhi Yi" medical model 5.0 [1] Group 1: Model Capabilities - The "Shan Hai · Zhi Yin" model 2.0 focuses on three major capability evolutions: understanding professional and local dialects, expressing warmth and emotional connection, and achieving rapid responsiveness [1] - In terms of "understanding," the model's ASR (Automatic Speech Recognition) capabilities have demonstrated leading performance in both public test sets and proprietary full-scene test sets, surpassing mainstream domestic open-source and closed-source speech models, reaching the highest industry standards [1] - For the "expression" aspect, the Shan Hai · Zhi Yin-TTS (Text-to-Speech) features a "highly human-like and creatively diverse" core, currently supporting 12 dialects (including Cantonese, Sichuanese, and Shanghainese) and 10 foreign languages, with the ability to switch between 12 styles of Mandarin [1] Group 2: Technological Foundation - The capabilities are underpinned by the company's proprietary "Shan Hai · Atlas" intelligent computing foundation, which deeply integrates a general multimodal model base with the Atlas architecture, serving as the foundation for professional intelligent agents and the core of perceptual AI [2]
云知声(09678)山海·知音2.0重磅发布 重塑人机交互新范式