Workflow
Mureka V7.5模型
icon
Search documents
人工智能龙头“开花结果”:昆仑万维发布多款前沿模型,厚积薄发迎商业收获期
Mei Ri Jing Ji Xin Wen· 2025-08-15 12:45
Core Insights - Kunlun Wanwei is experiencing a critical window for technological and commercial advancement in the rapidly accelerating global AI industry [1] - The company has launched six cutting-edge models during the SkyWork AI Technology Release Week, showcasing its long-term R&D investments translating into market competitiveness [1][7] - In 2024, Kunlun Wanwei's R&D expenses reached 1.54 billion yuan, a year-on-year increase of 59.5%, reflecting ongoing investments in AI computing chips, large models, and applications [1][13] R&D and Technological Advancements - The Mureka V7.5 model, launched on August 15, is a significant milestone in Kunlun Wanwei's AI commercialization efforts, generating over $12 million in annual revenue by March 2025 [2][3] - The Mureka V7.5 model features a breakthrough in music audio understanding, capable of accurately capturing the essence of various Chinese music styles [3][4] - The MoE-TTS framework, a novel voice synthesis technology, integrates pre-trained large language models with voice expert modules, achieving superior performance in generating natural-sounding speech [4][6] Product Development and Applications - The SkyReels-A3 model enables audio-driven video generation, while the Matrix-Game 2.0 model offers real-time interactive generation capabilities, enhancing user experience in various applications [7][9] - The Matrix-3D model allows for high-quality panoramic video generation from single images, revolutionizing content production in gaming, film, and architecture [9] - Skywork UniPic 2.0 addresses challenges in multi-modal generation, providing a unified model for efficient content creation [10] Business Strategy and Market Position - Kunlun Wanwei's strategy of "All in AGI and AIGC" is evident in its substantial R&D investments, which are expected to continue into 2025 with a projected increase of 23.4% [13] - The company has transitioned from a "technology exploration phase" to a "commercial harvest phase," with a stable global monthly active user base of nearly 400 million and overseas revenue accounting for 91% [14] - The dual model of driving business through technology and using commercial success to reinvest in R&D is positioning Kunlun Wanwei to build a trillion-level ecosystem in the AI industry [14]
昆仑万维Mureka V7.5模型上线 AI音乐创作水平再迎新高度
Core Insights - Kunlun Wanwei Technology Co., Ltd. has launched the SkyWorkAI technology release week from August 11 to August 15, introducing a new model each day, culminating in the release of the Mureka V7.5 model on August 15 [1] Group 1: Model Releases - The company has released several models during the event, including SkyReels-A3, Matrix-Game2.0, Matrix-3D, SkyworkUniPic2.0, and SkyworkDeepResearchAgent [1] - Mureka V7.5 significantly enhances the performance of Chinese songs, improving both the tonal quality and emotional expression [1] Group 2: Technical Innovations - Mureka's understanding model has a deep comprehension of various Chinese music styles, allowing for accurate representation of artistic essence and emotional nuances in music generation [1] - The company has optimized ASR technology to enhance the authenticity and emotional depth of generated vocals, focusing on micro-level singing details such as breath control and emotional fluctuations [2] - The MoE-TTS framework, the first of its kind based on MOE, combines pre-trained large language model capabilities with specialized speech expert modules, ensuring independent optimization of text and speech [2]
昆仑万维:Mureka V7.5模型正式上线 AI音乐创作水平再迎新高度
Core Insights - Kunlun Wanwei officially launched the Mureka V7.5 model on August 15, enhancing the performance of Chinese song interpretation significantly [2] - The Mureka V7.5 model demonstrates a deep understanding of various Chinese music styles, allowing for accurate emotional and artistic expression in generated music [2] - The company also introduced MoE-TTS, a novel speech synthesis framework that combines pre-trained large language model capabilities with specialized speech expert modules [3] Group 1 - Mureka V7.5 has improved the timbre and performance techniques of Chinese songs, as well as the articulation and emotional expression [2] - The model's deep accumulation of knowledge regarding Chinese music diversity enables it to convey unique artistic essence and emotional nuances [2] - The ASR technology has been optimized to enhance the authenticity and emotional depth of vocal performances in generated music [2] Group 2 - MoE-TTS innovatively integrates pre-trained large language model text capabilities with speech expert modules, ensuring independent optimization of each modality [3] - The release of MoE-TTS provides a reproducible open descriptive TTS solution for academia and demonstrates the potential of decoupled modalities and knowledge freezing in speech synthesis [3] - Future plans for MoE-TTS include integration into the Mureka-Speech platform, offering customizable descriptive speech synthesis capabilities for global developers and creators [3]