Veo3

Search documents
新手实测8款AI文生视频模型:谁能拍广告,谁只是凑热闹
锦秋集· 2025-08-26 12:33
过 去半年,AI 视频模型的迭代速度令人惊叹。变身、特写、运镜、特效轮番上阵,一秒出片、人人导演,似乎已触手可 及。 但回到真实使用场景,问题就出现了: 这些效果,普通用户真能复现吗? 模型越来越多,工具五花八门,到底该怎么选? 毕竟不是谁都在拍科幻大片。 大多数用户真正需要的,可能只是一段叙事清晰、动作合理、画面流畅的视频。 所以这次,我们只问关心一个问题: 这些模型,能不能在实际应用层面解决真正的问题? 为此,我们找来了 年轻的新人内容"创作者" 。他们并不是AI视频的资深用户,也不是视频制作达人,甚至很多人还刚刚接触AI和视频制作。 他们中,有人希望能快速生成一个清晰的画面,用来呈现创意、完成提案;有人希望能直接产出一个能用的成片,省掉拍摄、剪辑的繁琐流程。 这正是许多AI视频产品声称可以解决的部分。 于是我们在完成了 音乐生成工具测评 、 PPT制作工具测评 等测评后,设计了这次测评任务:不炫技、不堆术语,只聚焦日常工作中实际的视频内容场景。 虽然当前应用端,"图生视频"(Image-to-Video)依然是主流使用方式,但为了更全面评估模型在语义理解、动作链组织、镜头语言构建等核心能力上的差异,我们 ...
AI视频生成新品实测:这怎么不算影院级呢?
量子位· 2025-08-25 15:47
不圆 发自 凹非寺 量子位 | 公众号 QbitAI 百度最新视频生成模型 蒸汽机2.0 (MuseSteamer 2.0),好像真的有点东西。 这是在网上热传的一段由它生成的视频,可以说是要声音有声音,要画面有画面,不说的话还以为是某部重生剧的先导片。 AI配音的中文非常自然,和角色口型也对得很好。 我们也试着生成了一个小视频,仅用1张图片和1段提示词,就做出了这样的效果: 仔细听,这只猫甚至会呼噜噜,远处还有虫子叫。 网友评价:这简直像魔法一样! 它要怎么用才会更好玩?又能用来做什么呢? 我们实测了这款模型,一起来看它的具体表现。 模型表现 该说不说,作为全球首个 中文 音视频一体化生成的I2V模型,蒸汽机模型在中文语音的表现上可以说是手拿把攥,但这是蒸汽机1.0模型刚出 的时候就已经介绍的东西。 作为升级版本,蒸汽机2.0更加擅长 复杂运镜 ,用镜头讲故事的能力也更强,画质进一步提升。 让我们看看,作为普通人能用这个模型实现什么想法? 它的表现 和爆火的Veo3相比 ,哪个更好呢? 画画人狂喜:绘画转视频 我们让豆包生成了一张手绘风格的图片,画面上是一只大野兔蹲在草丛里。 就假装它是我们画出来的吧 (手 ...
X @Demis Hassabis
Demis Hassabis· 2025-08-23 21:05
RT Google Gemini App (@GeminiApp)This weekend only, everyone gets 3 free #Veo3 video generations from Gemini. To help you make the most of it, we’ve pulled together a few tips from our team so you can get better outputs for your prompts.Check it out ⬇️ ...
GoogleI/OConnectChina2025:智能体加持,开发效率与全球化双提升
Haitong Securities International· 2025-08-22 06:30
Investment Rating - The report does not explicitly provide an investment rating for the industry or specific companies discussed Core Insights - The Google I/O Connect China 2025 event highlighted advancements in AI model innovation, developer tool upgrades, and the globalization of the ecosystem, particularly focusing on the Gemini 2.5 series and the Gemma open model series [1][16] - Gemini 2.5 architecture enhances multimodal and reasoning capabilities, achieving unified embeddings and cross-modal attention across various modalities, significantly improving understanding and generation accuracy [2][17] - Gemma offers openness and extensibility, allowing developers to fine-tune models for specific domains such as healthcare and education, with derivative models showcasing broad applicability [3][18] - AI-driven development tools have been integrated into core workflows, enhancing productivity through features like task decomposition and code synthesis in Firebase Studio, and semantic code analysis in Chrome DevTools [4][19] - Generative content models, including Lyria, Veo3, and Imagen 4, are designed to strengthen the creative ecosystem, particularly for content-focused teams looking to expand globally [4][20] Summary by Sections AI Model Innovation - The Gemini 2.5 series features enhanced cross-modal processing and faster response times, improving the overall efficiency of AI applications [1][16] - The architecture integrates Chain-of-Thought reasoning and structured reasoning modules, enhancing logical consistency and multi-step reasoning performance [2][17] Developer Tool Upgrades - Firebase Studio's agent mode allows for automatic prototype generation from natural language prompts, while Android Studio introduces BYOM (Bring Your Own Model) for flexible model selection [4][19] - Chrome DevTools now includes a Gemini assistant for semantic code analysis and automatic fixes, significantly improving front-end debugging efficiency [4][19] Global Expansion of AI Ecosystem - The report emphasizes the appeal of Google's generative multimedia models for content creation, particularly in enhancing productivity for short-video production, e-commerce marketing, and game exports [4][20]
X @Demis Hassabis
Demis Hassabis· 2025-08-19 03:12
An incredible 100 million videos (!) have been made by creators using Veo3 in the Flow tool https://t.co/QgTpxTKAOi! Google AI Ultra subscribers, enjoy the 2x credits. Check out the new channel @FlowbyGoogle to keep up with the latest.Google Labs (@GoogleLabs):You have generated over 100M videos in Flow 🤯. We are SO grateful for your continued enthusiasm + support. As a token of our appreciation, here are two updates:1.) AI credits are DOUBLED for all Ultra users2.) We're launching @FlowbyGoogle, your new s ...
X @Demis Hassabis
Demis Hassabis· 2025-08-19 02:49
Product Usage & User Appreciation - Filmmakers have created an incredible 100 million videos using Veo3 in the Flow tool [1] - Google expresses gratitude for users' continued enthusiasm and support for Flow [1] Updates & Incentives for Ultra Users - AI credits are doubled for all Ultra users [1] - Google AI Ultra subscribers enjoy 2x credits [1] New Resources & Communication Channel - Launching @FlowbyGoogle, a new source for Flow tips [1] - A new channel @FlowbyGoogle is available to keep up with the latest [1]
高盛最新人形机器人报告:聚焦2025WRC 产品迭代速度远超数月前!
智通财经网· 2025-08-12 13:36
Core Insights - Goldman Sachs participated in the 2025 World Robot Conference, engaging with nine humanoid robot companies, indicating a strong interest in the sector and potential investment opportunities [1] Industry Trends - The World Robot Conference attracted significantly higher consumer visitor traffic compared to the 2025 Shanghai World Artificial Intelligence Conference, suggesting strong short-term demand potential in education, companionship, and entertainment sectors [2] - The introduction of new products like R1 and SA02, priced at 39,900 yuan and 38,500 yuan respectively, reflects a trend towards more practical and affordable robotic solutions [2] - There is a growing emphasis on the efficiency of entire robotic systems rather than individual robot performance, with companies reporting success rates of 80-99.5% in specific manufacturing tasks [7] Technological Developments - The industry is expected to transition to end-to-end operational models within 1-2 years, contingent on acquiring sufficient high-quality training data [3] - Google's new AI video generation model, Veo3, has sparked discussions about its potential superiority over existing AI frameworks in robotics, highlighting the rapid evolution of software and hardware in the sector [4] - NVIDIA showcased advancements in Physical AI and general robotics, indicating a strong focus on cloud-scale model training and real-time edge AI deployment [5] Market Dynamics - Government subsidies for robot purchases during events like the "Yizhuang Robot Consumption Festival" are likely to stimulate consumer and enterprise demand [8] - The introduction of low-cost robotic products and sales subsidies may create short-term sales opportunities for uncovered humanoid robot component companies [9] Company Insights - Goldman Sachs maintains a "Buy" rating for Sanhua Intelligent Controls, projecting a 19% CAGR in revenue and net profit from 2025 to 2030, driven by its strong positioning in the actuator assembly sector [10] - Best is expected to capture a 10% share of the global high-spec humanoid robot PRS market by 2027, with a projected CAGR of 80% in global high-spec humanoid robot shipments from 2024 to 2035 [12][13] - Linyun Optical is anticipated to see a 73% CAGR in its humanoid robot motion capture business from 2025 to 2030, contributing significantly to overall revenue and profit [17]
X @Demis Hassabis
Demis Hassabis· 2025-08-10 12:07
Model Performance - Veo3 is recognized as the best video model globally [1] - Veo3 ranks 1 in the Video Arena Leaderboard for Text-to-Video models with audio [1] - Veo3 and Veo3-fast also hold the 3 position in the Text-to-Video rankings [1] Competitive Landscape - Other notable Text-to-Video models include Hailuo 02 [Standard], Seedance 1.0 pro, Kling 2.1 Master, and Wan 2.2 [1] - Hailuo 02 [Standard] and Seedance 1.0 pro share the 5 ranking [1] - Kling 2.1 Master is ranked 6 [1] - Wan 2.2 is ranked 9 [1] Community Engagement - The Video Arena Leaderboard is based on over 14,000 community votes [1]
X @Demis Hassabis
Demis Hassabis· 2025-08-09 01:38
Industry Recognition - Video Arena Leaderboard showcases rankings of Text-to-Video and Image-to-Video models based on over 14,000 community votes [1] - Google DeepMind, Hailuo AI, Bytedance, Kling AI, Alibaba Wan, Pika Labs, and Genmo AI are recognized for their achievements in Text-to-Video technology [2] Text-to-Video Model Rankings - Veo3 (with audio) ranks 1 in Text-to-Video [2] - Hailuo 02 [Standard] and Seedance 1.0 pro rank 5 [2] - Kling 2.1 Master ranks 6 [2] - Wan 2.2 A14B ranks 9 [2] - Pika 2.2 and Mochi 1 rank 11 [2]
美国科技“三巨头”,这次赚麻了
3 6 Ke· 2025-08-03 23:17
Group 1: Core Insights - The emergence of ChatGPT has initiated a significant AI competition among major tech companies, leading to substantial profit growth after heavy capital expenditures [1][4] - Major companies like Google, Microsoft, and Meta reported impressive earnings, with Google achieving $96.428 billion in revenue (up 13.8%) and $28.196 billion in net profit (up 19.4%) in Q2 [1][5] - Microsoft reported $76.44 billion in revenue (up 18%) and $27.2 billion in net profit (up 24%) for Q4, with its intelligent cloud business revenue reaching $29.88 billion (up 26%) [1][5] - Meta's Q2 revenue was $47.52 billion (up 22%) with a net profit of $18.34 billion (up 36%) [1][5] Group 2: Capital Expenditure Trends - Companies are significantly increasing their AI investments, with Google planning $85 billion in capital expenditures for 2025, a $10 billion increase from previous estimates [2][3] - Microsoft anticipates over $30 billion in capital expenditures for Q1 of FY2026, a more than 50% increase from prior expectations [2][3] - Meta's capital expenditure plan for the year is between $66 billion and $72 billion, with expectations for significant growth in 2026 [2][3] Group 3: AI Infrastructure and Talent - The increase in capital expenditure is primarily aimed at AI infrastructure, including servers, networks, and data centers, as companies face a shortage of AI computing power [3] - Meta is also focusing on talent acquisition, with CEO Mark Zuckerberg emphasizing the importance of hiring skilled personnel to support AI initiatives [3] Group 4: Monetization of AI - AI is beginning to generate revenue, with Google's Gemini application reaching 450 million monthly active users and a 50% increase in daily usage [4] - Microsoft reported that its Azure and other cloud services generated over $75 billion in revenue (up 34%) for FY2025, with 100 million monthly active users for its Copilot series [5] - Meta's operating profit margin reached 43% in Q2, largely due to AI efficiencies in its advertising system [5] Group 5: Competitive Landscape and Future Outlook - The AI investment landscape is characterized by a "FOMO" (fear of missing out) mentality among tech giants, with companies feeling pressured to invest heavily to maintain competitive positions [7][8] - Analysts note that the AI sector is witnessing a "Matthew effect," where leading companies accumulate advantages that make it increasingly difficult for newcomers to compete [9] - Major tech companies are projected to invest over $350 billion in AI infrastructure this year, with expectations to exceed $400 billion by 2026 [10]