Google DeepMind
Search documents
Can AI help to save endangered birds?
Google DeepMind· 2025-08-07 15:04
Conservation Crisis & Biodiversity Loss - Hawaii faces a significant conservation crisis, being known as the extinction capital of the world [2] - Almost 75% (three-fourths) of native species in Hawaii have been lost [2] - The deterioration of native forests is expected without the presence and activity of native birds [3] Threats to Bird Populations - The introduction of mosquitoes carrying avian malaria has led to the extinction of many bird species in Hawaii [2] - Global warming is causing temperatures to rise, increasing the mosquito line and threatening bird populations at higher elevations [3] Conservation Efforts & Technology - Conservation efforts involve deploying recording equipment in forests to estimate bird populations and assess their response to conservation actions [4] - Bioacoustics is being used to monitor bird populations and their response to conservation efforts [4] - AI, specifically the Perch model using Google tools, is being used to analyze soundscapes for timely conservation decisions, aiding in species identification and detection of new sounds [5] - The "Perch Search" AI tool enables rapid scanning of soundscapes to identify specific species and detect changes in bird activity in treated areas [6]
Genie 3: Creating dynamic worlds that you can navigate in real life
Google DeepMind· 2025-08-05 14:37
Technology & Innovation - Genie 3 introduces a new frontier for world models, enabling the generation of interactive environments from text prompts [1] - The technology features real-time interactivity, allowing environments to react to user actions and movements [1] - Genie 3 incorporates world memory, ensuring consistency and persistence of actions within the generated environments [2][3] - Promptable events enable dynamic addition of new elements and scenarios into the world [3] Potential Applications - Genie 3 can be utilized for next-generation gaming and entertainment experiences [4] - The technology holds potential for embodied research and training robotic agents in simulated environments [5] - World models can facilitate disaster preparedness and emergency training through simulated scenarios [5] - The technology could benefit learning, agriculture, manufacturing, and other fields [5]
Solving years-old math problems with Gemini 2.5 Deep Think
Google DeepMind· 2025-08-01 11:06
with my most recent experience with uh Gemini deep think. The answer was spectacular. This is a mathematical conjecture that was made by some people some years ago.They didn't manage to prove it back then. They checked many cases and then they just left it as a conjecture. I asked the statement of the conjecture to Gemini deep think and it seems like it proved it right away with a completely different method.When I was thinking about solving that question, I was thinking about maybe three different things, ...
The Great Voyage
Google DeepMind· 2025-07-16 14:23
AI Model Development & Fine-Tuning - Google's creative team utilized a batch of 1800s photos to LoRA fine-tune the Imagen model for vintage style image generation [1] - The filmmaking tool Flow allows users to directly fine-tune Veo with a single image using "Style Ingredients" [1] AI Tool Utilization in Filmmaking - Veo 2 Image to Video was used to animate still images [1] - Gemini was used for generating prompts and motion ideas to shape the story [1] - Lyria 2 was employed to create a period music soundtrack [1] Post-Production & Editing - Final Cut Pro (video) and Logic Pro (sound) were used to assemble the film [1] - Text cards and vintage effects were selectively added, with Imagen used to create the background for the cards [1]
A whistle stop tour of AI creation with Paige Bailey
Google DeepMind· 2025-07-10 13:06
Gemini模型进展与特点 - Google DeepMind发布了升级版VO3模型,该模型在视觉和听觉效果上都有显著提升,能够生成更逼真、更具沉浸感的视频内容 [1][2] - V3模型引入了prompt rewriting功能,可以优化用户输入的prompt,使其更详细、更符合用户的设想,从而提高生成视频的质量 [1] - V3模型生成的视频片段通常为8秒,这是为了在公开版本中提供充分的创作控制空间,更长的内部版本也存在 [2] - Gemini模型能够输出文本、代码、图像和音频,并且能够编辑图像和控制音频,这得益于其将多种模态信息整合到一个模型中,而不是依赖于拼接不同的模型 [3] - Gemini模型通过整合视频、音频和详细的帧级别描述等多模态数据进行训练,从而能够生成更自然、更逼真的声音和响应 [3] Gemini在AI Studio和Flow中的应用 - AI Studio提供了一个实验平台,用户可以在其中尝试最新的Gemini模型,包括文本转语音功能,可以生成具有不同情感和语言的音频 [5][12] - Flow是由Google Labs团队开发的专业电影制作工具,它提供了一个专门的开发环境,允许用户拼接视频片段、控制摄像头,并进行其他高级编辑 [3][4] - AI Studio中的Gemini Live功能,结合了Project Astra的实时视觉理解能力,可以实时分析屏幕内容并提供相关信息 [14][16] Gemini在应用开发中的潜力 - AI Studio提供了一个新的build功能,即使是没有编程经验的用户也可以使用Gemini模型构建应用程序,生成的代码针对最新的SDK进行了优化 [28][29] - 通过build功能创建的应用程序可以直接部署到Cloud Run,从而方便用户与他人分享和使用 [39][40] - Gemini模型可以帮助开发者专注于构建和构思产品体验,而无需花费大量时间进行代码维护和升级 [42][44] 安全与伦理考量 - VO模型引入了安全过滤器,以防止生成不当内容,例如涉及儿童或特定公众人物的图像 [20][21] - 通过Gemini App生成的视频带有专门的水印,以表明其为AI生成,从而减少deepfake和诈骗的风险 [20][21]
Could digital twins – virtual replicas of humans or organs – accelerate medical research? 🏥
Google DeepMind· 2025-07-03 11:17
Digital Twins in Healthcare - Digital twins are a significant topic, especially for healthcare applications [1] - The concept involves creating digital replicas for individual healthcare purposes, with varied interpretations [1] Pharmaceutical Industry Applications - The pharma industry explores digital twins to enhance clinical trials [2] - Virtual cohorts could potentially reduce the number of participants needed in clinical trials while maintaining knowledge about drug safety and efficacy [2][3] - This approach simulates human systems (e g, organs) to minimize reliance on human subjects in trials [2]
A Quest for a Cure: AI Drug Design with Isomorphic Labs
Google DeepMind· 2025-06-05 16:56
AI在药物设计中的应用 - Isomorphic Labs 旨在创建一个 AI 药物设计引擎,能够针对任何疾病和蛋白质靶点,设计出调节蛋白质和细胞功能的分子,从而改善患者的病情 [1] - 行业普遍认为,五年内,不使用 AI 进行药物设计就像在科学研究中不使用数学一样 [1][43] - AlphaFold 3 可以在几秒钟内预测分子结构,而传统的 X 射线晶体学方法可能需要数月甚至数年 [3] - AI 模型通过分析包含数十万个 3D 结构的蛋白质数据库(Protein Data Bank, PDB),学习并泛化到新的蛋白质和分子 [2] - AI 可以通过生成模型和搜索方法,在 10^60 数量级的巨大分子设计空间中进行智能探索,找到合适的药物分子 [2] 药物研发的挑战与未来 - 解决复杂疾病的难点在于对疾病驱动因素的理解不足,以及癌症等疾病的持续进化 [1][2] - 药物设计不仅要考虑分子与靶蛋白的结合,还要考虑结合强度、对其他蛋白质的潜在副作用、稳定性、溶解性等多种相互制约的因素 [6][7] - 临床试验失败率高达 90%,主要原因是动物模型不能很好地复制人类生理 [27][28] - AI 可以通过预测分子与其他蛋白质的相互作用,在药物设计早期发现潜在的毒性和副作用 [31][32] - 药物研发行业有很高的失败率,平均 20 个药物化学家只有 1 个能成功将药物推向市场 [39] - 预计未来五年内,将会有 AI 设计的药物获得批准上市,AI 将在药物研发的各个阶段发挥更大的作用 [41][42]
Gemini 2.5 Pro Deep Think Demo | Competitive coding problem
Google DeepMind· 2025-05-22 22:26
Model Overview - Google DeepMind introduces Gemini 2.5% Pro Deep Seek, an enhanced reasoning mode built on 2.5% Pro [1] - DeepSeek utilizes parallel reasoning passes to discover sophisticated solutions [1] Technical Capabilities - DeepSeek excels at solving complex problems like "Catch the Mo" from Codeforces, a competitive coding platform [1][2] - It employs a minmax strategy to reduce uncertainty and efficiently track the mole's changing position [3][4] - DeepSeek's solution demonstrates elegant management of uncertainty in dynamic environments [3] Availability - DeepSeek is being made available to trusted testers via the Gemini API [4] - Wider availability of DeepSeek is planned for the future [4]
Veo 3 demo | Sailor and the sea
Google DeepMind· 2025-05-20 23:01
General Observation - The ocean is described as a force, wild and untamed [1]
Veo 3 demo | Owl and badger
Google DeepMind· 2025-05-20 23:01
Observations - The document mentions a ball and its bouncing ability [1] - The bouncing height of the ball exceeds the observer's jumping ability [1] - The observer describes the bouncing as "magic" [1]