Google DeepMind
Search documents
From sketches to prototype: Designing with generative AI
Google DeepMind· 2025-10-01 16:00
AI & Design Collaboration - Google DeepMind 与艺术家合作,探索 AI 的应用,并利用艺术家的创造力来影响 AI 技术的发展 [1] - Google 使用 Ross Lovegrove 的草图训练图像生成模型,以产生基于其独特设计语言的新想法 [2] - AI 生成了数百个椅子的迭代和排列方案 [2] - Gemini 被用于优化设计,从多个角度可视化椅子设计,并通过先进的 3D 打印工艺实现 [3] Project Outcome & Future Implications - 该项目展示了 AI 如何帮助人们以新的维度进行思考 [4] - AI 可以为设计过程带来独特而非凡的东西 [4] - 该项目预示着 AI 在设计领域的未来应用 [4]
Gemini Robotics 1.5: Using agentic capabilities
Google DeepMind· 2025-09-25 15:54
[Music] Hi, Aloha. Can you use my location and sort the objects into the correct compost, recycling, and trash bins. To perform the sorting task based on San Francisco waste guidelines, I will use the green bin for compost, the blue bin for recycling, and the black bin for trash.Here we have an agentic layers. Both control the robot and have access to the internet. To sort the trash, they had to look up the rules and look at the objects on the table, think about where each object should go, matching the rul ...
Gemini Robotics 1.5: Learning across embodiments
Google DeepMind· 2025-09-25 15:54
This is usually how we train our robots is actually someone teleoperates the robot. So they have to physically move the robot. And right now what he's doing is he's training the robot how to do a task.So traditionally people will train a single model per robot. In Geminina Robotics 1.5%, we're actually using a single model across multiple robots. So, here's an example.We're hanging things from that workbench. One of the key things in 1.5% is that now all of our other robots can actually do the same task. Th ...
Gemini Robotics 1.5: Enabling robots to plan, think and use tools to solve complex tasks
Google DeepMind· 2025-09-25 15:54
Earlier this year, we brought Gemini's multimodal understanding to the physical world with Gemini robotics, allowing robots to behave in interactive, dextrous, and general ways. Previously, robots could complete one task per instruction. >> Previous Gemini Robotics version has been tested over and over and over again to put this banana into the bowl.This is a very simple task. >> Today, we've reached a new milestone. We're introducing Gemini Robotics 1.5%, a new family of models to power the next generation ...
Gemini Robotics 1.5: Thinking while acting
Google DeepMind· 2025-09-25 15:53
[Music] Previous Gemini Robotics version has been tested over and over and over again to put this banana into the bowl. This is a very simple task. I'm going to switch the model here and make the task slightly more challenging.[Music] So this is Gemini Robotics 1.5%. Aloha. Clear sort these fruits into color matching plates.>> Sure, I can certainly help you do that. Put the green fruit into the green plate. >> We enable it to think.It can perceive the environment, different colors of the object, different c ...
Google DeepMind researchers react to Nano Banana demos 🍌
Google DeepMind· 2025-09-24 17:26
I think the fact that people surprise us with a model we built is the best idea. So, so this is like a demo with nano banana hooked up into I think it's an studio demo. It's hooked onto a canvas and you can like drag these isometric shapes around.Oh, and you're so cool. I mean, we often thought of like Nano Banana as a single tool, as a single thing, but now actually this becomes more part of a pipeline. Wait, San Francisco. They merged San Francisco, New York halfway.What. Oh, no way. Oh, wow.Is that the B ...
Can AI help to save endangered birds?
Google DeepMind· 2025-08-07 15:04
Conservation Crisis & Biodiversity Loss - Hawaii faces a significant conservation crisis, being known as the extinction capital of the world [2] - Almost 75% (three-fourths) of native species in Hawaii have been lost [2] - The deterioration of native forests is expected without the presence and activity of native birds [3] Threats to Bird Populations - The introduction of mosquitoes carrying avian malaria has led to the extinction of many bird species in Hawaii [2] - Global warming is causing temperatures to rise, increasing the mosquito line and threatening bird populations at higher elevations [3] Conservation Efforts & Technology - Conservation efforts involve deploying recording equipment in forests to estimate bird populations and assess their response to conservation actions [4] - Bioacoustics is being used to monitor bird populations and their response to conservation efforts [4] - AI, specifically the Perch model using Google tools, is being used to analyze soundscapes for timely conservation decisions, aiding in species identification and detection of new sounds [5] - The "Perch Search" AI tool enables rapid scanning of soundscapes to identify specific species and detect changes in bird activity in treated areas [6]
Genie 3: Creating dynamic worlds that you can navigate in real life
Google DeepMind· 2025-08-05 14:37
Technology & Innovation - Genie 3 introduces a new frontier for world models, enabling the generation of interactive environments from text prompts [1] - The technology features real-time interactivity, allowing environments to react to user actions and movements [1] - Genie 3 incorporates world memory, ensuring consistency and persistence of actions within the generated environments [2][3] - Promptable events enable dynamic addition of new elements and scenarios into the world [3] Potential Applications - Genie 3 can be utilized for next-generation gaming and entertainment experiences [4] - The technology holds potential for embodied research and training robotic agents in simulated environments [5] - World models can facilitate disaster preparedness and emergency training through simulated scenarios [5] - The technology could benefit learning, agriculture, manufacturing, and other fields [5]
Solving years-old math problems with Gemini 2.5 Deep Think
Google DeepMind· 2025-08-01 11:06
with my most recent experience with uh Gemini deep think. The answer was spectacular. This is a mathematical conjecture that was made by some people some years ago.They didn't manage to prove it back then. They checked many cases and then they just left it as a conjecture. I asked the statement of the conjecture to Gemini deep think and it seems like it proved it right away with a completely different method.When I was thinking about solving that question, I was thinking about maybe three different things, ...
The Great Voyage
Google DeepMind· 2025-07-16 14:23
AI Model Development & Fine-Tuning - Google's creative team utilized a batch of 1800s photos to LoRA fine-tune the Imagen model for vintage style image generation [1] - The filmmaking tool Flow allows users to directly fine-tune Veo with a single image using "Style Ingredients" [1] AI Tool Utilization in Filmmaking - Veo 2 Image to Video was used to animate still images [1] - Gemini was used for generating prompts and motion ideas to shape the story [1] - Lyria 2 was employed to create a period music soundtrack [1] Post-Production & Editing - Final Cut Pro (video) and Logic Pro (sound) were used to assemble the film [1] - Text cards and vintage effects were selectively added, with Imagen used to create the background for the cards [1]