Workflow
AI音乐生成
icon
Search documents
谷歌Gemini学会了看图作曲,你的朋友圈也能拥有专属BGM了
量子位· 2026-02-19 07:03
克雷西 发自 凹非寺 量子位 | 公众号 QbitAI 刚刚,Gemini摇身一变,成了专业的"唱作人"。 谷歌把最新的Lyria 3模型塞进了Gemini,直接在对话框里招呼一声,Gemini就能现场给你攒个乐团。 这套玩法主打一个有手就行,给它打一段天马行空的文字,或者干脆甩过去一张刚拍的照片,它几秒钟内就能根据你的想法,吐出一首带歌 词、旋律甚至人声演唱的完整作品,整个过程快得惊人。 而且还顺便拉来了Nano Banana模型当帮手,曲子刚生成完,一张风格特搭的专辑封面也就跟着出炉了。 总之,从你想出点子到拿到带封面的专属BGM,中间的步骤简化到了极点。 网友评价,48kHz的立体声质量,加上根据照片生成音乐的功能,可见DeepMind这一波非常注重创意工作流程。 你的照片能开口唱歌了 硬指标上,Lyria 3的音频采样率来到了48KHz的高保真级别。这种规格让生成的曲子底气特别足,每一声琴鸣都显得非常扎实,有了这个音 质底座,看图唱曲的功能才更有发挥空间。 你随手上传一张在森林徒步的照片,AI就能瞬间捕捉到那种静谧感,转手给你配上一段对味的民谣,让原本静止的风景瞬间有了自己的声音。 这下,你的朋友圈也 ...
自由量级正式上线音潮V3.0
Core Insights - The core viewpoint of the article is that the domestic music AI startup, Free Level, has launched the V3.0 version of its music model, "Yin Chao," which has achieved a full technical reconstruction of the "encoding-generating-decoding" process, marking a significant breakthrough in both musicality and quality [1][2]. Industry Trends - The AI music sector is predicted to transition from "wild growth" to "professional advancement" by early 2026, with multiple domestic and international music models completing technological upgrades, evolving from merely generating music to creating emotionally resonant and enjoyable music [1]. - The competition in AI music generation is shifting away from merely increasing parameter scale; the focus is now on enabling machines to understand the physical space and emotional logic behind sounds [1]. Product Features - The Yin Chao V3.0 has significantly improved singing quality through the introduction of dual-track modeling and multi-stage reinforcement learning, mastering intricate singing techniques such as humming, pitch bending, and breathy tones, while also enhancing emotional expressiveness [1]. - The V3.0 version has transformed the melody's pleasantness and memorability, ensuring that each song has a "highlight" or memorable hook, addressing the common issue of AI-generated music being forgettable [1][2]. - The diversity in arrangement and overall musicality has been significantly enhanced in V3.0, with improvements in instrument timbre selection and transitions between sections, as well as independent modeling of phase and reverb to recreate the physical texture of sound [2]. Company Philosophy - The CTO of Free Level, Jiang Tao, emphasized that the model's development aims to lower the barriers to music creation, ensuring that technological advancements lead to the realization of "music equity" for ordinary people, rather than merely serving as a cold tool for professional production [2].
174亿,黄仁勋又投出一家AI独角兽
3 6 Ke· 2025-11-20 07:38
Core Insights - Suno, an AI music creation platform, announced a $250 million Series C funding round, achieving a valuation of $2.45 billion, nearly five times its valuation of $500 million from May last year [1][3] - The company reported an annual revenue of $200 million [1] Funding and Valuation - The recent funding round was led by Menlo Ventures, with participation from NVentures, Hallwood Media, Lightspeed Venture Partners, and Matrix [1] - Suno's valuation increased from $500 million to $2.45 billion within a year, reflecting significant growth in investor confidence and market potential [1][3] Product Development - Suno specializes in AI music generation, having developed a model capable of creating complete songs from text descriptions, including vocals, lyrics, arrangement, and mixing [3] - The latest version, v5, claims to be the best music model globally, addressing issues with AI-generated vocals and achieving a 90% success rate in executing complex instructions [3] - Suno also launched Suno Studio, a generative audio workstation that integrates professional multi-track editing with generative AI [3] Team Background - The founding team has a strong background in AI and physics, with members having previously worked at Kensho, a Cambridge AI finance company [4][5] Industry Context and Legal Issues - The AI music platform sector has faced controversies and legal challenges, with major record labels suing Suno and another platform, Udio, for allegedly using copyrighted music to train their models [7] - However, Universal Music recently reached a settlement with Udio, indicating a potential shift towards collaboration in the industry [7][8] Market Trends - AI music generation is lowering barriers for creators and gaining acceptance among the public, with AI-generated music increasingly appearing on Billboard charts [8] - The collaboration between Universal Music and Udio exemplifies a trend towards resolving copyright issues in the AI music space [8]
4个金融男搞音乐,1年赚超1亿美元
虎嗅APP· 2025-11-06 13:17
Core Viewpoint - Suno is emerging as a revolutionary AI music generation platform, significantly lowering the barriers to music creation and attracting substantial investment, indicating a strong market potential in the AI music sector [3][12][34]. Company Overview - Suno was founded in 2022 by a team of four with backgrounds in finance and technology, aiming to leverage AI for music creation [23][27]. - The company has raised a total of $125 million in funding since its inception, with a recent round of $100 million expected to increase its valuation to $2 billion, four times its previous valuation [12][33]. - Suno's user base has rapidly grown to over 12 million, achieving an annual recurring revenue (ARR) exceeding $100 million within a year of launching its first product [13][12]. Product Features - Suno's platform allows users to generate complete songs, including lyrics, vocals, and instrumentation, from simple text prompts, making music creation accessible to non-professionals [6][15]. - The introduction of the Suno Studio desktop application enhances its functionality, allowing users to edit and mix tracks like a professional digital audio workstation [18][19]. - The latest model, V5, has significantly improved sound quality and realism, making AI-generated vocals nearly indistinguishable from real human voices [16][19]. Market Dynamics - The global digital music industry is expanding, with the music streaming market exceeding $26 billion in 2023, and AI-generated music is rapidly gaining traction, accounting for 28% of daily new releases [32][34]. - The AI music generation sector is still in its early stages, presenting a "blue ocean" market opportunity, although competition is intensifying with new entrants and established tech giants exploring this space [34][35]. Competitive Landscape - Suno faces competition from other AI music platforms like Udio and Boomy, which also focus on user-generated content but have different operational models [36][35]. - The company is currently involved in legal challenges regarding copyright issues, as major record labels have accused it of using copyrighted material without authorization [37][30]. Future Outlook - The ongoing negotiations with major record labels for licensing agreements suggest a potential shift towards collaboration between traditional music companies and AI platforms, indicating a new phase in the music industry [37][30].
我们大胆做了个决定,大会所有音乐bgm由AI生成,这部分预算可以省了!|Jinqiu Scan
锦秋集· 2025-11-03 08:13
Core Viewpoint - The article discusses the first CEO annual conference organized by Jinqiu Fund, themed "Experience with AI," focusing on the intersection of technology, capital, and creativity in the AI era [1]. Group 1: Event Overview - The conference aims to explore not just AI itself but how technology, capital, and creativity can interact in the AI age [1]. - The event is designed to be a genuine space for understanding, utilizing, and experiencing AI [1]. Group 2: Music Generation with AI - Seven representative AI music generation products were evaluated, including Suno, ElevenLabs, and Udio, with Suno being selected for the conference music due to its high success rate [4][5][6]. - The music requirements included creating entrance music for guests based on their company and personal situations, as well as warm-up music suitable for the conference theme [7][8]. Group 3: Music Production Process - The production process involved using ChatGPT to generate prompts for music creation, which were then used with Suno to produce suitable music [10][12]. - Different styles of warm-up music were created based on the agenda and desired atmosphere, with 10-20 tracks prepared for each segment [20][21]. Group 4: AI Music Generation Insights - AI can generate melodies and mimic styles but lacks deep semantic understanding, making it challenging to create emotionally resonant music [26]. - The effectiveness of AI music generation heavily relies on the precision of prompts, which can be a challenge for those unfamiliar with music [27][28]. Group 5: Future Directions - The company plans to explore a more systematic and intelligent approach to music generation in the future, potentially integrating multiple AI models for different styles [30]. - There is an aspiration to create a conference theme song that meets the satisfaction of all team members and to experiment with real-time emotional feedback for music generation [30].
OpenAI进军音乐模型!
智通财经网· 2025-10-26 03:46
Core Insights - OpenAI is developing an AI music model in collaboration with students from the Juilliard School, aiming to enhance its AI ecosystem and user engagement [1] - The music model will allow users to generate music based on text and audio prompts, potentially transforming content creation for platforms like TikTok [1] - OpenAI's previous music models, MuseNet and Jukebox, have not been integrated into its current offerings due to technical limitations [2] Group 1: OpenAI's Music Model Development - OpenAI is actively working on an AI music model, collaborating with Juilliard students for music score annotation [1] - The model aims to generate music for various applications, including adding guitar accompaniments to existing vocal tracks [1] - OpenAI currently has over 800 million active users, and the music model is expected to enhance user retention [1] Group 2: Competitive Landscape in AI Music - The AI music generation sector is becoming a competitive focus, with advancements in computing power and model architecture [3] - Google has launched its second-generation music production model, Lyria, which aligns with OpenAI's commercial direction [3] - Startups like Suno and Udio have successfully commercialized their AI music products, with Suno achieving an annual recurring revenue of $150 million, a nearly fourfold increase from the previous year [3] Group 3: Emerging Players in AI Music - Chinese companies are rapidly developing AI music models, with ByteDance's Seed-Music and Alibaba's InspireMusic leading the charge [3] - Kunlun Wanwei has released the Mureka O1 model, which surpasses Suno V4 in multiple performance metrics [3] - Tencent AI Lab has introduced the SongGeneration model, focusing on improving sound quality, musicality, and generation speed [3][4]
MiniMax上线新一代音乐生成模型可生成整首歌曲 记者实测
Xin Jing Bao· 2025-09-14 06:41
Core Insights - MiniMax has launched its new music generation model, Music1.5, which significantly improves music generation duration, control precision, and arrangement performance [1][4] - The model can generate songs up to 4 minutes long, producing complete tracks rather than just demo samples [4] Group 1: Model Capabilities - Music1.5 can create songs with a duration of up to 4 minutes, addressing the previous limitation of AI-generated music typically lasting only seconds to a minute [3][4] - Users can generate high-quality songs by inputting just a few keywords or a natural language description, with advanced options for defining different lyrical sections [4] Group 2: Performance and Quality - The model has been tested to produce a complete song titled "September Huangpu," which, despite minor flaws, includes all essential components of a full track [1][3] - Music1.5 features deep modeling of vocal techniques, allowing for the generation of diverse vocal tones and styles, resulting in a more natural and emotionally expressive sound [4]
昆仑万维正式上线Mureka V7
Zheng Quan Ri Bao Wang· 2025-07-23 12:40
Core Insights - Kunlun Wanwei officially launched the latest music model MurekaV7 and the new audio model MurekaTTSV1, following the successful reception of previous models MurekaO1 and MurekaV6, which gained nearly 3 million new registered users since their release in March [1][2] Group 1: MurekaV7 Features - MurekaV7 allows users to generate a song by simply inputting lyrics or selecting a style or theme, significantly streamlining the music creation process [1] - The model enhances melody motivation and arrangement quality, improving the realism of vocals and instruments while increasing musical innovation [1] - MurekaV7 incorporates a significantly optimized MusiCoT technology, designed for music generation, which guides the model to create a global music structure plan before generating audio tokens [1] Group 2: MusiCoT Technology - MusiCoT builds a "music thought chain" with clear semantic direction by combining CLAP, allowing for flexible input of reference audio for style prompts, thus avoiding direct copying risks [2] - The technology has demonstrated superior performance in both subjective and objective metrics, outperforming traditional methods in structural integrity, melodic coherence, and overall musicality, achieving industry-leading standards [2] - MusiCoT also enhances the controllability and scalability of music creation, bridging the gap between text and audio modalities [2] Group 3: MurekaTTSV1 Introduction - MurekaTTSV1 introduces VoiceDesign capabilities, enabling users to obtain corresponding vocal characteristics through text input [2] - The model further aligns music creation thinking with structural aspects, enhancing the overall creative and industrial capabilities of AI music generation [2]