Workflow
Flow
icon
Search documents
X @Demis Hassabis
Demis Hassabis· 2025-08-19 03:12
An incredible 100 million videos (!) have been made by creators using Veo3 in the Flow tool https://t.co/QgTpxTKAOi! Google AI Ultra subscribers, enjoy the 2x credits. Check out the new channel @FlowbyGoogle to keep up with the latest.Google Labs (@GoogleLabs):You have generated over 100M videos in Flow 🤯. We are SO grateful for your continued enthusiasm + support. As a token of our appreciation, here are two updates:1.) AI credits are DOUBLED for all Ultra users2.) We're launching @FlowbyGoogle, your new s ...
X @Demis Hassabis
Demis Hassabis· 2025-08-19 02:49
An incredible 100 million videos (!) have been created by filmmakers using Veo3 in the Flow tool https://t.co/QgTpxTKAOi! Google AI Ultra subscribers, enjoy the 2x credits. Check out the new channel @FlowbyGoogle to keep up with the latest.Google Labs (@GoogleLabs):You have generated over 100M videos in Flow 🤯. We are SO grateful for your continued enthusiasm + support. As a token of our appreciation, here are two updates:1.) AI credits are DOUBLED for all Ultra users2.) We're launching @FlowbyGoogle, your ...
AI语音从“输出”到“输入”,资本在用千万美元押注什么?
3 6 Ke· 2025-07-30 03:09
Core Insights - Recent funding rounds for voice input startups Willow Voice and Wispr Flow indicate a growing interest in automatic speech recognition (ASR) technology, which focuses on voice input rather than voice synthesis [1][2] - The funding amounts are $4.2 million for Willow Voice and $30 million for Wispr Flow, highlighting a shift in investor focus towards voice input solutions [1] - The competitive landscape includes established players like ElevenLabs, which raised $250 million in January 2023, emphasizing the potential for innovation in the voice input sector [1] Group 1: Company Overview - Willow Voice and Wispr Flow specialize in ASR technology, offering products that function similarly to "voice input methods" for converting speech to text [2] - Both companies aim to enhance user experience by minimizing the need for manual editing of transcribed text, targeting professional environments where efficiency is crucial [6][24] - Flow's user base includes venture capitalists, entrepreneurs, and professionals who require efficient text input solutions, particularly in non-office settings [9][11] Group 2: Product Features and Performance - Flow and Willow's products incorporate a three-layer text processing approach: formatting text output, understanding context, and recognizing different writing styles based on the input scenario [5][6] - Initial tests show that while Flow and Willow perform better than OpenAI's Whisper in formatting and context understanding, they still fall short of achieving a "zero-edit" output in professional contexts [19][20] - User feedback indicates that Flow excels in less formal input scenarios, suggesting a potential for broader application as ASR technology evolves [22][24] Group 3: Market Trends and Future Potential - The significant user retention rate of 80% and a 19% paid user rate for Flow suggest a strong market demand for voice input solutions that enhance productivity [20][24] - As ASR technology continues to improve, there is a possibility that voice input could replace traditional keyboard input, transforming human-computer interaction [24] - Investors are likely motivated by the dual potential of immediate efficiency gains and the long-term disruption of existing input paradigms [24]
X @Demis Hassabis
Demis Hassabis· 2025-07-25 15:05
Product Innovation - Google Labs discovered a new trick in Flow: drawing prompts instead of writing them [1] - Flow can understand drawings and incorporate them into the final video using Frames to Video [1] Usage Instructions - Users can draw on an image using any editing app [1] - Users should briefly describe what needs to happen in the prompt (e.g., "changes happen instantly") [1] Call to Action - Google Labs encourages users to try the new feature and share their findings [1]
The Great Voyage
Google DeepMind· 2025-07-16 14:23
Watch a short 3-minute film made with our AI models by our in-house creative team, inspired by the age of Victorian silent cinema. Here's more detail on how it was made: Inspiration & Fine-Tuning: The team found a batch of 1800s photos at a thrift store that was then used to LoRA fine-tune our image generation model Imagen to generate new images in the same vintage style. If you want to try this yourself, you can also use "Style Ingredients" in our filmmaking tool Flow. This allows you to directly fine-tune ...
AI模型持续突破,股掌柜证券咨询前瞻科技主线投资机遇
Core Insights - The AI sector is experiencing a new wave of innovation, highlighted by the release of Anthropic's Claude Opus 4 and Claude Sonnet 4, with Opus 4 being recognized as the "best programming model in the world" [1] - Google's launch of the AI film production platform "Flow" at the I/O developer conference integrates multiple models for automated content creation, attracting significant attention from the global film and AI technology industries [1] - In China, Kunlun Wanwei has launched the Skywork Super Agents App, indicating the acceleration of Chinese AI agent technology towards global users [1] Industry Trends - The rapid evolution of the global technology landscape is prompting a new round of value reassessment across the industry chain, with AI large models becoming a core focus in the capital markets over the next few years [1] - The entire industry chain is experiencing sustained prosperity and accelerated technological penetration, driven by foundational computing infrastructure, model training support, software ecosystem development, and application implementation [1] - Local companies with core technological capabilities and industry integration advantages are expected to achieve rapid breakthroughs in key scenarios such as multimodal models, AI agents, and smart terminals, especially in the context of deepening US-China tech competition and increased policy support [1] Investment Strategy - The company has developed a forward-looking technology allocation map by systematically analyzing AI industry chain-related entities, helping investors identify beneficial segments within the industry chain [2] - The investment logic is shifting from foundational reasoning to scenario implementation, as AI large models simultaneously advance in "usability" and "creativity" [2] - Continuous attention to the structural evolution and valuation inflection points of the AI industry chain will be crucial for investors aiming to build a long-term stable portfolio [2]
谷歌(GOOGL.US)Gemini解锁付费新技能,照片转视频功能全面开放
智通财经网· 2025-07-11 02:36
Core Insights - Alphabet's Google has launched a "photo to video" feature for paid users, integrating it into the Gemini AI assistant, marking a significant step in AI video technology [1] - The feature allows users to create 8-second videos from a single photo and text description, with a resolution of 720p [1] - This update positions Google competitively against rivals like OpenAI and Runway AI, as well as Chinese companies such as Alibaba and Kuaishou, which have also released upgraded video tools [1] Feature Details - The new functionality is available to subscribers of Google AI Ultra and Pro plans, with web access starting immediately and mobile updates rolling out within the week [1] - The feature is powered by Google's latest video generation model, Veo 3, which was previously limited to a standalone paid tool [1] Compliance and Safety Measures - Google has implemented significant backend measures to ensure compliance, including restrictions on generating videos using images of public figures and prohibiting content that incites violence or dangerous behavior [1] Technical Limitations - Testing revealed that the technology still has flaws, such as altering facial features and ethnic characteristics when generating videos from personal photos [2] - While the model performs well with simple animations, it struggles with more complex requests, indicating that photo-to-video and facial animation technologies are still in development [2] - Google acknowledges these limitations and plans to continue improving the functionality in future updates [2]
A whistle stop tour of AI creation with Paige Bailey
Google DeepMind· 2025-07-10 13:06
Gemini模型进展与特点 - Google DeepMind发布了升级版VO3模型,该模型在视觉和听觉效果上都有显著提升,能够生成更逼真、更具沉浸感的视频内容 [1][2] - V3模型引入了prompt rewriting功能,可以优化用户输入的prompt,使其更详细、更符合用户的设想,从而提高生成视频的质量 [1] - V3模型生成的视频片段通常为8秒,这是为了在公开版本中提供充分的创作控制空间,更长的内部版本也存在 [2] - Gemini模型能够输出文本、代码、图像和音频,并且能够编辑图像和控制音频,这得益于其将多种模态信息整合到一个模型中,而不是依赖于拼接不同的模型 [3] - Gemini模型通过整合视频、音频和详细的帧级别描述等多模态数据进行训练,从而能够生成更自然、更逼真的声音和响应 [3] Gemini在AI Studio和Flow中的应用 - AI Studio提供了一个实验平台,用户可以在其中尝试最新的Gemini模型,包括文本转语音功能,可以生成具有不同情感和语言的音频 [5][12] - Flow是由Google Labs团队开发的专业电影制作工具,它提供了一个专门的开发环境,允许用户拼接视频片段、控制摄像头,并进行其他高级编辑 [3][4] - AI Studio中的Gemini Live功能,结合了Project Astra的实时视觉理解能力,可以实时分析屏幕内容并提供相关信息 [14][16] Gemini在应用开发中的潜力 - AI Studio提供了一个新的build功能,即使是没有编程经验的用户也可以使用Gemini模型构建应用程序,生成的代码针对最新的SDK进行了优化 [28][29] - 通过build功能创建的应用程序可以直接部署到Cloud Run,从而方便用户与他人分享和使用 [39][40] - Gemini模型可以帮助开发者专注于构建和构思产品体验,而无需花费大量时间进行代码维护和升级 [42][44] 安全与伦理考量 - VO模型引入了安全过滤器,以防止生成不当内容,例如涉及儿童或特定公众人物的图像 [20][21] - 通过Gemini App生成的视频带有专门的水印,以表明其为AI生成,从而减少deepfake和诈骗的风险 [20][21]
Meet the Only "Magnificent Seven" Stock That Is Cheaper Than the S&P 500 (According to This Key Metric)
The Motley Fool· 2025-06-27 10:17
Core Viewpoint - The "Magnificent Seven" companies, including Alphabet, are experiencing a shift in performance, with Alphabet becoming undervalued compared to the S&P 500, raising questions about its growth potential and investment attractiveness [1][3][10]. Group 1: Company Performance - The Magnificent Seven have historically outperformed the S&P 500, but in 2025, companies like Apple and Alphabet are underperforming [2][3]. - Alphabet's forward P/E ratio is 17.4, significantly lower than the S&P 500's 21.8, indicating a discounted valuation despite its industry leadership [8][10]. - The small difference between Alphabet's forward and current P/E suggests lower near-term earnings growth expectations from investors [11]. Group 2: Revenue and Business Model - Alphabet generates most of its revenue from Google Services, particularly Google Search, which accounted for over $50 billion in revenue, representing 65.6% of total services revenue [12][13]. - The company's heavy reliance on Google Search raises concerns about its valuation, especially as competition from platforms like ChatGPT and TikTok increases [14][15]. Group 3: Competitive Landscape and Innovation - Alphabet is facing challenges to its search dominance but has made significant advancements in AI, particularly with the rebranding of its generative AI model to Gemini [15][16]. - The integration of Gemini across Alphabet's ecosystem could enhance growth, although competition has forced the company to innovate more rapidly [17]. Group 4: Investment Outlook - Despite concerns about its market position, Alphabet's earnings are expected to grow steadily, supporting free cash flow generation and potential buybacks [18]. - The current valuation of Alphabet is considered too cheap to ignore, positioning it as a compelling buy for long-term investors [18].
谷歌AI试穿神器真神了!上传照片秒出OOTD,视频效果和照镜子没区别
量子位· 2025-06-27 08:09
Core Viewpoint - Google has launched a new application called Doppl, which allows users to virtually try on clothes by uploading a photo of themselves, enhancing the online shopping experience [1][8]. Group 1: Features of Doppl - Users can generate dynamic videos to see how clothes look on them, eliminating the need to wait in long lines at fitting rooms [2][11]. - The app supports both static and dynamic try-ons, with the latter providing a more intuitive visual experience [11]. - Users can upload a full-body photo and select clothing images to see how they would look wearing them [14][9]. Group 2: Usage Guidelines - For optimal results, users should upload a full-body photo in fitted clothing and choose clothing images that are well-lit and without excessive wrinkles [14][15]. - The app allows users to try on various clothing items, including those seen in thrift stores or on social media, but does not support accessories like shoes or swimwear [21][16]. - After trying on clothes, users can share their results with friends for feedback [25]. Group 3: Development and Testing - Doppl is currently in the testing phase under Google Labs, which is known for showcasing new products and gathering user feedback [29][30]. - Google Labs has previously introduced other experimental products, such as Portraits and Flow, which utilize AI for interactive experiences [31][37].