Mureka v7

Search documents
腾讯研究院AI每周关键词Top50
腾讯研究院· 2025-07-25 10:21
AI前沿每周关键词Top50 (0721-0725 ) 每周50关键词 把握全局AI动态 点击 关键词 可查看资讯概述 点个 "在看" 分享洞见 生数科技 | 应用 | 混元ASR | 腾讯 | | --- | --- | --- | | 应用 | Mureka V7 | 昆仑万维 | | 应用 | Lovable Agent | Lovable | | 应用 | AI同传 | 字节 | | 应用 | Higgs Audio V2 | 李沐团队 | | 应用 | DeepRare | 上交大等 | | 应用 | 罕见病推理 | DeepMind | | 科技c | 星动L7 | 星动纪元 | | 科技 | 四足机器人 | 智元等 | | 科技 | 机器人故障 | DeREK | | 科技 | 进军无人机 | 影石 | | 科技 | 蓝河操作系统 | vivo | | 观点 | AI工具控制权 | Boris Cherny | | 观点 | 复盘 K2 | Kimi | | 观点 | 多智能体发展 | OpenAI | | 观点 | AI Agent投资 | 硅谷101 | | 观点 | 学习闭环 | Eric ...
腾讯研究院AI速递 20250724
腾讯研究院· 2025-07-23 11:14
Group 1: AI Compute Competition - OpenAI plans to launch 1 million GPUs by the end of the year, competing against Musk's xAI which aims to deploy 50 million GPUs over five years, indicating an intensifying compute arms race [1] - OpenAI is pursuing compute autonomy through self-developed chips, the Stargate project, and collaboration with Microsoft, aiming to shift 75% of its compute sources to the Stargate project by 2030 [1] - AI capital expenditure in Silicon Valley is expected to reach $360 billion by 2025, equivalent to 2.5 trillion RMB, with leading cloud companies controlling core industry resources [1] Group 2: Talent Acquisition in AI - Meta has recruited three Chinese scientists from DeepMind who were involved in the IMO gold medal project, including Tianhe Yu, Cosmo Du, and Weiyue Wang, who previously worked on Google's Gemini [2] - Microsoft has also hired over 20 employees from Google DeepMind in the past six months, including the former VP of engineering for the Gemini chatbot, Amar Subramanya [2] - Zuckerberg attempted to recruit OpenAI's Chief Researcher Mark Chen for $1 billion but was unsuccessful, indicating Meta's aggressive talent acquisition strategy and the establishment of Meta Superintelligence Labs [2] Group 3: Open Source AI Models - Alibaba has open-sourced the Qwen3-Coder-480B-A35B-Instruct model, which has 480 billion parameters, supports 256K context, and can output up to 65,000 tokens [3] - The model is designed for tasks in intelligent programming, browser usage, and tool invocation, competing with both open-source models like Kimi K2 and closed-source models like GPT-4.1 [3] - Pre-training utilized 75 trillion tokens of data (70% of which was code) and involved reinforcement learning training in 20,000 independent environments [3] Group 4: AI Audio Generation - Tsinghua University and Shengshu Technology developed FreeAudio, which allows for precise and controllable generation of AI audio for up to 90 seconds, with the research selected for ACM MM 2025 [4][5] - FreeAudio employs a "no training" method to overcome industry bottlenecks, using LLM for time planning and generating audio based on non-overlapping time windows [5] - The system includes Decoupling & Aggregating Attention Control modules and excels in generating audio for tasks of 10 seconds, 26 seconds, and 90 seconds [5] Group 5: Voice Recognition Technology - ima has integrated Tencent's self-developed ASR (Automatic Speech Recognition) model, enabling direct voice input functionality, which is now available on mobile apps [6] - The mixed ASR model is the first in the industry based on dual encoders, capable of recognizing 300 characters per minute, which is four times faster than manual input [6] - This voice input feature can be applied in various scenarios such as knowledge base Q&A, note-taking, and writing continuation, with iOS users able to add desktop widgets for quicker voice queries [6] Group 6: Music Generation Models - Kunlun Wanwei launched the Mureka V7 music model, improving the yield rate from 43.4% in V6 to 57.7%, with a 44% enhancement in vocal realism and nearly double the overall sound quality [7] - Mureka V7 utilizes MusiCoT technology to first generate a global music structure before producing audio, mimicking human creative thought processes [7] - The company also introduced Mureka TTS V1, a text-to-speech model that allows users to customize voice tones based on text descriptions, achieving a voice quality score of 4.6, surpassing Elevenlabs' score of 4.36 [7] Group 7: Quadruped Robots Market - Zhiyuan Robotics has launched its first industry-grade small quadruped robot, Zhiyuan D1 Ultra, with a maximum running speed of 3.7 m/s and the ability to jump 35 cm high [8] - Magic Atom has released a wheeled quadruped robot, MagicDog-W, starting at 75,000 RMB, claiming to be the strongest in its class, with both products set to be showcased at the 2025 World Artificial Intelligence Conference [8] - The quadruped robot market is rapidly growing, with an estimated market size of 470 million RMB in China for 2023, projected to reach 850 million RMB by 2025, while Yushu Technology currently holds a 60-70% global market share [8] Group 8: Robotics Safety Concerns - The American robot fighting champion DeREK, based on Yushu G1, malfunctioned and entered a walking mode, causing it to "go crazy" and kick surrounding objects [9] - The emergency braking system failed to respond in time, and the wireless emergency stop device took five seconds to activate, only stopping when the Ethernet cable was disconnected [9] - Analysis highlighted multiple safety hazards, including difficult access to the battery, powerful motor torque (120-160 Nm), unsuitable wireless communication for safety-critical systems, and a lack of multiple safety mechanisms [9] Group 9: AI Platform Competition - According to a16z, competition among platforms is shifting from cost and speed to the control of contextual permissions [10] - Models are becoming the fourth layer of infrastructure in software development, alongside computing, networking, and storage, evolving from "callable components" to central control systems [10] - The reasoning layer is emerging as a new battleground for system sovereignty, with platforms redefining development paradigms and business models through interface definitions, context management, and task scheduling capabilities [10] Group 10: ChatGPT Agent Development - The ChatGPT Agent consists of Deep Research (intelligent agents), Operator (computer operation agents), and other tools, integrating through shared states [11] - OpenAI employs reinforcement learning to train the Agent, integrating all tools into a virtual machine, allowing the model to autonomously explore optimal tool combinations without pre-defined usage rules [11] - The team comprises 20-35 members from research and application teams, implementing multiple safety measures (real-time monitoring, user confirmation, etc.), with plans to evolve into a general superintelligent agent [11]
用户暴涨近300万,国产AI音乐神器Mureka重磅升级V7,我们拿它复刻了「印度神曲」
机器之心· 2025-07-23 08:57
机器之心原创 机器之心编辑部 AI 正悄悄「攻占」你的歌单。 目前,该歌曲拿下了 15 万小红心。本想看看是哪位大神的作品,没想到底下一水的评论:这是 AI 生成的! 其实细听之下还是能找出「端倪」的,比如音质糊的像画面马赛克、人声跟牙齿漏风似的。但经过持续的进化,AI 音乐越来越真假难辨。 前几天在网易云音乐上瞎逛,被意外种草一首歌,真一开口就是月色迷蒙的味道。 现在,AI 音乐的这把火,越烧越旺。 7 月 23 日, 大模型厂商昆仑万维正式发布了新一代音乐大模型 Mureka V7 ,成为了当前国产最强,并在多个关键指标上显著超越海外 AI 音乐平台 Suno (V4.5),包括平均表现评分、混音质量与质感、人声真实感与表现力、整体音质评价。 不仅如此,与上一版本 V6 相比,Mureka V7 生成的音乐品质更高,不仅大幅提升旋律动机和编曲质量,还进一步增强了人声与乐器真实度。 | Category | Mureka V7 | Mureka V6 | Suno V4.5 | | --- | --- | --- | --- | | Average Performance Rating | 57.7% | ...
手把手教你用最新的AI音乐模型,创造一首属于你自己的歌。
数字生命卡兹克· 2025-07-23 08:43
前两天昆仑万维的朋友来找我吃饭。 跟我说,他们的新音乐模型Mureka v7马上要上线了,他们觉得质量能跟Suno 4.5打一打,而且也是国内为数不多的AI音乐产品,问我要不要一拍即合一 下。 当时吃着饭呢,我筷子抖了一下。 我说AI音乐我现在写的真的非常少,上次写完suno 3.5我就没写了,我都快一年没提过AI音乐了,我感觉拍不了啊。 他们说,你放屁。 然后掏了这篇文章给我看。 说,你前一段时间不刚提吗。 我说...啊。。。 然后他们就非常快速当场的给我开了Mureka v7新音乐模型的内测,开了会员,给了400积分,说,你先试试? 我说行,那试试吧。。。 当天晚上回家后,我看着那个音乐生成的界面,我脑子里真的一团浆糊。 真的,感觉这块的知识,完全丢干净了。 我好像不知道该写一首什么样的歌,或者,该怎么创作一首AI的音乐了。 就这么浆糊着半个小时,我说算了,休息一下吧,于是打开了最近天天摸鱼的时候,最爱看的小说:《十日终焉》 看到即将全员反攻列车,七十年的布局要进入最高潮的时候。 我突然有了灵感。 我就想做一首,《十日终焉》的主题曲。 但是理想很丰满,但是吧,真到上手的时候,发现犯了难了。 我好像,这一年 ...