Mureka TTS V1 - filings, earnings calls, financial reports, news

Mureka TTS V1

Search documents

Zhong Zheng Wang· 2025-10-30 04:39

Core Insights - The company reported a revenue of 5.8 billion yuan for the first three quarters of 2025, marking a 52% year-on-year increase [1] - Overseas business revenue reached 5.4 billion yuan, up 58% year-on-year, with overseas revenue accounting for 93% of total revenue, an increase of 3.6 percentage points [1] - The overall gross margin stood at 69.9% [1] - The company achieved a net profit attributable to shareholders of 190 million yuan in Q3 2025, reversing previous losses [1] AGI and AIGC Developments - The company is making steady progress in AGI and AIGC business, with ongoing technology research and product iteration [1] - A significant academic breakthrough was achieved with a paper selected as a Spotlight paper at NeurIPS 2025 [2] - The company launched the Skywork Deep Research Agent V2, integrating multi-modal deep research capabilities [2] AI Video and Music Innovations - The company introduced the SkyReels-A3 model, enabling high-quality video generation for various applications [3] - In the music domain, Mureka V7 and Mureka V7.5 were launched, enhancing music generation and voice synthesis capabilities [3] AI Gaming and Social Applications - The AI gaming business is progressing well, focusing on content generation and intelligent interaction [4] - The company’s short drama platform, DramaWave, ranked third in overseas revenue for short drama platforms, with over 4 million downloads in August 2025 [4] Future Outlook - The company aims to leverage opportunities in the AI era, focusing on technological breakthroughs and scene implementation [6] - Plans to deepen the integration of AI technology with information distribution, social entertainment, and content creation are in place [6]

腾讯研究院· 2025-07-23 11:14

Group 1: AI Compute Competition - OpenAI plans to launch 1 million GPUs by the end of the year, competing against Musk's xAI which aims to deploy 50 million GPUs over five years, indicating an intensifying compute arms race [1] - OpenAI is pursuing compute autonomy through self-developed chips, the Stargate project, and collaboration with Microsoft, aiming to shift 75% of its compute sources to the Stargate project by 2030 [1] - AI capital expenditure in Silicon Valley is expected to reach $360 billion by 2025, equivalent to 2.5 trillion RMB, with leading cloud companies controlling core industry resources [1] Group 2: Talent Acquisition in AI - Meta has recruited three Chinese scientists from DeepMind who were involved in the IMO gold medal project, including Tianhe Yu, Cosmo Du, and Weiyue Wang, who previously worked on Google's Gemini [2] - Microsoft has also hired over 20 employees from Google DeepMind in the past six months, including the former VP of engineering for the Gemini chatbot, Amar Subramanya [2] - Zuckerberg attempted to recruit OpenAI's Chief Researcher Mark Chen for $1 billion but was unsuccessful, indicating Meta's aggressive talent acquisition strategy and the establishment of Meta Superintelligence Labs [2] Group 3: Open Source AI Models - Alibaba has open-sourced the Qwen3-Coder-480B-A35B-Instruct model, which has 480 billion parameters, supports 256K context, and can output up to 65,000 tokens [3] - The model is designed for tasks in intelligent programming, browser usage, and tool invocation, competing with both open-source models like Kimi K2 and closed-source models like GPT-4.1 [3] - Pre-training utilized 75 trillion tokens of data (70% of which was code) and involved reinforcement learning training in 20,000 independent environments [3] Group 4: AI Audio Generation - Tsinghua University and Shengshu Technology developed FreeAudio, which allows for precise and controllable generation of AI audio for up to 90 seconds, with the research selected for ACM MM 2025 [4][5] - FreeAudio employs a "no training" method to overcome industry bottlenecks, using LLM for time planning and generating audio based on non-overlapping time windows [5] - The system includes Decoupling & Aggregating Attention Control modules and excels in generating audio for tasks of 10 seconds, 26 seconds, and 90 seconds [5] Group 5: Voice Recognition Technology - ima has integrated Tencent's self-developed ASR (Automatic Speech Recognition) model, enabling direct voice input functionality, which is now available on mobile apps [6] - The mixed ASR model is the first in the industry based on dual encoders, capable of recognizing 300 characters per minute, which is four times faster than manual input [6] - This voice input feature can be applied in various scenarios such as knowledge base Q&A, note-taking, and writing continuation, with iOS users able to add desktop widgets for quicker voice queries [6] Group 6: Music Generation Models - Kunlun Wanwei launched the Mureka V7 music model, improving the yield rate from 43.4% in V6 to 57.7%, with a 44% enhancement in vocal realism and nearly double the overall sound quality [7] - Mureka V7 utilizes MusiCoT technology to first generate a global music structure before producing audio, mimicking human creative thought processes [7] - The company also introduced Mureka TTS V1, a text-to-speech model that allows users to customize voice tones based on text descriptions, achieving a voice quality score of 4.6, surpassing Elevenlabs' score of 4.36 [7] Group 7: Quadruped Robots Market - Zhiyuan Robotics has launched its first industry-grade small quadruped robot, Zhiyuan D1 Ultra, with a maximum running speed of 3.7 m/s and the ability to jump 35 cm high [8] - Magic Atom has released a wheeled quadruped robot, MagicDog-W, starting at 75,000 RMB, claiming to be the strongest in its class, with both products set to be showcased at the 2025 World Artificial Intelligence Conference [8] - The quadruped robot market is rapidly growing, with an estimated market size of 470 million RMB in China for 2023, projected to reach 850 million RMB by 2025, while Yushu Technology currently holds a 60-70% global market share [8] Group 8: Robotics Safety Concerns - The American robot fighting champion DeREK, based on Yushu G1, malfunctioned and entered a walking mode, causing it to "go crazy" and kick surrounding objects [9] - The emergency braking system failed to respond in time, and the wireless emergency stop device took five seconds to activate, only stopping when the Ethernet cable was disconnected [9] - Analysis highlighted multiple safety hazards, including difficult access to the battery, powerful motor torque (120-160 Nm), unsuitable wireless communication for safety-critical systems, and a lack of multiple safety mechanisms [9] Group 9: AI Platform Competition - According to a16z, competition among platforms is shifting from cost and speed to the control of contextual permissions [10] - Models are becoming the fourth layer of infrastructure in software development, alongside computing, networking, and storage, evolving from "callable components" to central control systems [10] - The reasoning layer is emerging as a new battleground for system sovereignty, with platforms redefining development paradigms and business models through interface definitions, context management, and task scheduling capabilities [10] Group 10: ChatGPT Agent Development - The ChatGPT Agent consists of Deep Research (intelligent agents), Operator (computer operation agents), and other tools, integrating through shared states [11] - OpenAI employs reinforcement learning to train the Agent, integrating all tools into a virtual machine, allowing the model to autonomously explore optimal tool combinations without pre-defined usage rules [11] - The team comprises 20-35 members from research and application teams, implementing multiple safety measures (real-time monitoring, user confirmation, etc.), with plans to evolve into a general superintelligent agent [11]

用户暴涨近300万，国产AI音乐神器Mureka重磅升级V7，我们拿它复刻了「印度神曲」

机器之心· 2025-07-23 08:57

Core Viewpoint - The article discusses the rapid advancement of AI-generated music, particularly focusing on the capabilities of the new music model Mureka V7 developed by Kunlun Wanwei, which significantly surpasses its predecessors and competitors in various performance metrics [6][8][51]. Group 1: Mureka V7 Performance - Mureka V7 has been released as the strongest domestic music model, outperforming the overseas AI music platform Suno in key metrics such as average performance rating and overall audio quality [6][8]. - Compared to its predecessor Mureka V6, Mureka V7 shows substantial improvements in music quality, including melody and arrangement, as well as vocal and instrumental realism [7][8]. - The performance metrics for Mureka V7 include an average performance rating of 57.7%, mixing quality of 39.0%, and vocal realism of 70.0% [8]. Group 2: Features and Innovations - Mureka V7 introduces a feature allowing users to upload audio or video links to create songs mimicking specific artists, enhancing personalization in music creation [12][13]. - The model can analyze user-uploaded music to generate original works with similar styles, demonstrating its versatility in music generation [17]. - Mureka V7 has also upgraded its capabilities to generate music videos alongside audio, expanding its creative offerings [20]. Group 3: MusiCoT Technology - The MusiCoT technology has been optimized in Mureka V7, allowing for a structured approach to music creation that aligns with human creative processes [25][28]. - MusiCoT enables the model to generate music with clear structure and coherence, enhancing the overall quality of the output [29][33]. - The technology has shown superior performance in both subjective and objective evaluations, establishing a new standard in the industry [32][34]. Group 4: Voice Model Development - Kunlun Wanwei has also introduced Mureka TTS V1, an audio model that allows for customizable voice generation based on user-defined characteristics [39][40]. - This model surpasses competitors in various aspects of voice synthesis, indicating a strong position in the voice generation market [41]. - Mureka TTS V1 can create voices for various applications, including film, gaming, and advertising, broadening its market potential [45]. Group 5: Industry Trends - The article notes a shift in the industry towards the commercialization of AI models, with a focus on vertical models like music and video generation becoming the new competitive landscape [47][48]. - Kunlun Wanwei's strategy aligns with this trend, aiming to create a comprehensive ecosystem for AI-generated content across multiple domains [49][50]. - The growing user base of Mureka, with nearly 3 million new users since March, highlights its acceptance and impact on music creation [51].