视频生成

Search documents
Veo3逼真脱口秀火爆全网,视频生成的GPT时刻到了吗?
Di Yi Cai Jing· 2025-05-26 03:02
Core Insights - The recent release of Google's video model Veo 3 has generated significant discussion, particularly due to its ability to create realistic characters and scenes, but users express that the technology is not as groundbreaking as some claims suggest [3][4][12] - Veo 3 introduces a native audio generation feature, allowing for simultaneous creation of sound effects and dialogue, marking a shift from previous silent video generation models [4][7] - Despite improvements, industry experts highlight that Veo 3 still has many flaws and is not yet suitable for large-scale commercial production [12][15][17] Group 1: Technology and Features - Veo 3's key innovation is its ability to generate audio alongside video, which enhances the overall production quality and efficiency [4][7] - The model allows for a streamlined workflow where text prompts can generate complete animated videos, including music and voice synchronization [7][15] - Users have reported that while the video quality has improved, it does not meet the high expectations set by earlier versions, and there are still issues with consistency and accuracy [12][14] Group 2: Market Reception and Cost - The cost of using Veo 3 is relatively high, requiring a subscription to Google's AI ultra plan at $249.99 per month, which is more expensive than competing services [16] - Users have noted that the points system for video generation can lead to additional costs, making it less feasible for commercial projects without purchasing extra credits [16][17] - Despite the high costs and existing flaws, some industry professionals see potential in Veo 3 and its associated tools like FLOW for future AI-driven video production workflows [17]
鹅厂开源视频生成大杀器!参考图主体精准复刻,还能编辑现有视频
量子位· 2025-05-09 07:03
克雷西 发自 凹非寺 量子位 | 公众号 QbitAI 人物部分,提示词如下: A woman takes a selfie in a busy city. A woman holds a smartphone in one hand and makes a peace sign with the other. The background is a bustling street scene with various signs and pedestrians. 刚刚,鹅厂开源"自定义"视频生成模型 HunyuanCustom 。 "自定义"主打的就是主体一致性,用一张图片就可以确定视频主角, 其一致性评分达到了开源模型SOTA ,且可和闭源媲美。 这样在构思提示词时,就可以不必纠结主体特征描述了。 HunyuanCustom一共支持单主体参考、多主体参考、局部编辑、角色配音四大功能。 其中 单主体参考已上线并开源,其余也将在本月内开源 。 此外混元的技术人员还在直播中透露,团队正在和开源社区合作, 将适配AI创作者常用的ComfyUI 。 期待所有功能完整上线的同时,不妨先来看看demo效果! 主体一致性 ...
昆仑万维:一季度营收大幅增长46% AI算力芯片取得突破性进展
Zheng Quan Shi Bao Wang· 2025-04-29 02:00
Core Viewpoint - Kunlun Wanwei (300418.SZ) reported a significant revenue growth of 46% year-on-year in Q1 2025, driven by advancements in AI computing chips and applications [1] Group 1: Financial Performance - The company achieved an operating revenue of 1.76 billion yuan in Q1 2025, marking a 46% increase compared to the previous year [1] - R&D expenses reached 430 million yuan, reflecting a 23% year-on-year growth [1] - The annual recurring revenue (ARR) for AI music reached approximately 12 million USD, with a monthly revenue of about 1 million USD [1] - The ARR for the short drama platform Dramawave was approximately 120 million USD, with a monthly revenue of around 10 million USD [1] - Overseas business revenue amounted to 1.67 billion yuan, showing a 56% increase year-on-year, and accounted for 94% of total revenue [1] Group 2: Technological Advancements - The company launched several disruptive technologies in multi-modal reasoning, video generation, and audio generation, achieving state-of-the-art (SOTA) status in various models [2] - The Skywork R1V multi-modal reasoning model reached open-source SOTA, while the SkyReels-V1 model and SkyReels-A1 algorithm led the global video generation field [2] - In the AI music sector, the Mureka V6 and Mureka O1 models demonstrated a competitive edge, with Mureka O1 surpassing competitors in performance [2] Group 3: AI Chip Development - The company made significant progress in the R&D of AI computing chips, moving towards the goal of "Chinese chips, Kunlun manufacturing" [3] - Kunlun Wanwei acquired a controlling stake in Beijing Aijietek Technology Co., Ltd., completing a full industry chain layout from computing infrastructure to AI applications [3] - The R&D team for AI chips has expanded to nearly 200 employees, covering various fields such as chip design and algorithm development [3] Group 4: Future Prospects - The company plans to launch the Skywork.ai platform in mid-May 2025, which will feature a system of five expert-level AI agents for optimizing various professional tasks [3] - The Opera business segment, including overseas information distribution and metaverse operations, saw a revenue increase of 41% driven by Opera Ads [4] - The company aims to continue advancing AI computing chip development and innovate its AI application matrix to provide leading AI product experiences globally [4]