语音识别大模型 - filings, earnings calls, financial reports, news

语音识别大模型

Search documents

Zheng Quan Shi Bao Wang· 2025-08-22 08:33

Core Viewpoint - Alibaba Tongyi has launched a new end-to-end speech recognition model, Fun-ASR, which enhances contextual awareness and high-accuracy speech transcription capabilities across various industries [1] Group 1: Model Features - Fun-ASR has improved speech recognition accuracy by over 15% in multiple industry scenarios, including home decoration and insurance [1] - The model is currently applied in scenarios such as meeting subtitles, simultaneous interpretation, smart minutes, and voice assistants [1] Group 2: Future Developments - Fun-ASR is set to be further integrated into Alibaba Cloud's Bai Lian platform in the future [1]

钉钉联手通义实验室发布Fun-ASR语音识别大模型，支持企业专属模型定制训练

Xin Lang Ke Ji· 2025-08-22 05:21

Core Insights - The collaboration between DingTalk and Tongyi Laboratory has led to the launch of a new voice recognition model, Fun-ASR, which can understand industry-specific jargon across ten sectors, including home decoration and animal husbandry [1][2] - Fun-ASR has been integrated into various DingTalk functionalities such as meeting subtitles, simultaneous interpretation, smart minutes, and voice assistants [1] Technical Highlights - Fun-ASR enhances the recognition capability of industry-specific vocabulary, trained on over one hundred million hours of audio data, and co-created with real scenarios from DingTalk's multi-industry clients [1] - The model features improved contextual awareness and understanding, utilizing existing information within DingTalk, such as contact lists and schedules, to optimize inference and provide reliable transcription results [1] - Fun-ASR supports customized voice recognition model training for enterprises with advanced needs, allowing for algorithm optimization based on real scenario voice data provided by the companies [1] Future Plans - The voice team leader from Tongyi Laboratory expressed excitement about the partnership with DingTalk, aiming to expand the data and model scale of Fun-ASR to enhance the replicability of voice intelligence solutions for enterprise clients [2] - DingTalk's CTO highlighted the rapid development of Fun-ASR within three months of collaboration, achieving recognition from leading clients and marking a significant breakthrough towards industry leadership [2]