AI多人有声剧自动化方案 - filings, earnings calls, financial reports, news

AI多人有声剧自动化方案

Search documents

Shang Hai Zheng Quan Bao· 2025-10-28 11:34

"AI多人有声剧"自动化方案支持从小说文本到完整成品有声剧的全自动生产。该方案可以自动进行角色划分，准确率超过98%，同时，其语音大模型通过对海量文本与语音的多模态预训练，原生地将文本和语音模态融合，引入思维链信息，具备强大的文本理解能力和语音演绎能力，多人演播效果发音自然、情感丰富。此外，方案中的画本预测模型在多角色演播音频基础上，实现了从小说文本到带有音效、人声特效、环境音、配乐的画本预测，在得到画本信息之后进行音频召回并合成、智能动态调整音频参数，并结合多角色TTS最终合成"有声剧"成品。目前，首批通过"AI多人有声剧"方案端到端创作的作品已经在番茄小说App上线，效果超出预期，并得到书友良好反馈，为听书行业注入全新活力。未来，"AI多人有声剧"方案仍将不断升级，覆盖更多有声内容，小说更新即可让用户同步享受精品有声剧。来源：上海证券报·中国证券网上证报中国证券网讯（记者罗茂林）近日，豆包语音团队发布"AI多人有声剧"自动化方案。方案支持多角色、高表现力的TTS（语音合成）演播，同时，实现了全自动AI后期链路，从小说文本到高质量的多人有声剧成品，全部由AI端到端完成。据了解， ...

Xin Lang Ke Ji· 2025-10-28 08:23

新浪科技讯 10月28日下午消息，豆包语音团队发布了"AI多人有声剧"自动化方案。方案支持多角色、高表现力的TTS（语音合成）演播，同时实现了全自动AI后期的链路，从小说文本到高质量的多人有声剧成品，全部由AI端到端完成。目前，首批通过"AI多人有声剧"方案端到端创作的作品已经在番茄小说 APP上线。责任编辑：何俊熹 ...

小说一键转有声剧！豆包语音团队提出「AI多人有声剧」方案，沉浸感拉满了

机器之心· 2025-10-27 10:40

Core Viewpoint - The article discusses the advancements in AI-generated audio dramas, specifically highlighting the "AI Multi-Character Audio Drama" automation solution developed by Doubao Voice, which significantly reduces production costs and time while achieving high-quality audio outputs [3][5][13]. Group 1: AI Multi-Character Audio Drama Solution - The "AI Multi-Character Audio Drama" solution automates the entire process from novel text to high-quality audio drama, utilizing the upgraded multi-character Seed-TTS-2.0 model, which supports multi-role, expressive TTS performances [3][5]. - This solution allows for a drastic reduction in production costs and timelines, as traditional audio drama production typically takes several months and involves multiple manual steps [5][12]. - The automation includes features such as intelligent sound effects, music, and mixing, which enhance the overall listening experience and make it comparable to professional human-produced audio dramas [3][8]. Group 2: Technical Innovations - The solution boasts over 98% accuracy in character voice matching and dialogue attribution, thanks to its advanced text and voice integration capabilities [8][10]. - Key innovations include chapter-level context awareness, historical long audio modeling, and multi-turn reasoning, which improve the understanding of characters and emotions, resulting in a more immersive listening experience [10][12]. - The system also features automated predictions for voice effects, action sounds, environmental sounds, and music, ensuring a cohesive and engaging audio experience that aligns with the narrative [12][13].

多角色Seed - TTS - 2.0模型

多角色Seed - TTS - 2.0模型