Workflow
Google DeepMind
icon
Search documents
SIMA 2: An agent that plays, reasons, and learns with you in virtual 3D worlds
Google DeepMind· 2025-11-13 14:51
AI Agent Capabilities - Simma 2 is a highly capable AI agent for complex and responsive virtual worlds [1] - Simma 2 can navigate and complete difficult multi-step tasks, understanding multimodal prompts and explaining its actions [1] - Simma 2 learns, reasons, and improves through self-play, developing new skills without human input [2] Learning and Adaptation - Simma 2 transfers learned skills from one virtual world to others, even if previously unseen [2] - The AI agent's capabilities are a step towards creating AI that can assist with tasks in the real world [2]
Waymo: The future of autonomous driving with Vincent Vanhoucke
Google DeepMind· 2025-11-06 18:57
How do you train an AI to drive safer than a human? Professor Hannah Fry sits down with Waymo Distinguished Engineer Vincent Vanhoucke to break down the complexities of autonomous driving—from the "closed loop" problem of real-world traffic to using generative AI for simulation. Plus, Hannah moves from theory to practice, taking a Waymo for a spin in California to experience it firsthand. Timecodes 00:00 Intro 01:02 Ride around town 03:48 The driverless car problem 08:43 Sensors 13:00 3D model of the world ...
Part 2: Social engineering, malware, and the future of cybersecurity in AI
Google DeepMind· 2025-10-16 16:08
Cybersecurity Threats & Actors - Nation-state actors are primarily motivated by geopolitical aims and espionage, often engaging in offensive cyberattacks to support warfare or prepositioning for potential conflicts [5][6] - Subnation-state actors and some nation-state activities are financially motivated, commonly using ransomware attacks to steal and encrypt data, demanding cryptocurrency for its release [9][10] - A gray market exists for zero-day vulnerabilities, with buyers including companies equipping law enforcement and governments, with some vulnerabilities worth millions of dollars [12][14] - AI is exacerbating social engineering risks by enabling deep fakes, making phishing attacks more tailored and effective, such as cloning voices for ransom demands or impersonating executives for financial fraud [30][32][33] Vulnerability Disclosure & Mitigation - Project Zero introduced a 90-day disclosure timeline for vulnerabilities, compelling companies to prioritize security patches to prevent exploitation by malicious actors [19][20] - Governments have been known to deliberately withhold vulnerability information for exploitation purposes, as exemplified by the Eternal Blue case [24] - Healthcare and critical infrastructure sectors often struggle with patch management due to the risk of disrupting essential services, leading to long-term vulnerabilities [29] - Multi-factor authentication and pass keys are emerging as strong defenses against phishing and password-related attacks, enhancing security and user experience [37][39][40] AI & Agent Security - Risk-based authentication, enhanced by AI, assesses user behavior to determine trust levels and adjust security friction accordingly, such as requiring multi-factor authentication based on anomalous activity [43][46] - The rise of AI agents acting on behalf of humans introduces new security challenges, requiring careful consideration of agent identity, permissions, and potential for misuse [50][51] - Contextual integrity is crucial for training AI agents to respect privacy norms and avoid disclosing sensitive data inappropriately, necessitating mechanisms for agents to seek permission before sharing information [57][58][59]
Veo 3.1 - Add and remove objects to your scene
Google DeepMind· 2025-10-15 15:56
Add new elements to any scene. Introduce anything you can imagine, from realistic details to fantastical creatures. Veo now handles complex details like shadows and scene lighting, making the addition look natural. Remove unwanted objects or characters seamlessly. Soon, you’ll be able to take anything out of a scene, and Veo will reconstruct the background and surroundings, making it look as though the object was never there. Try it today in Flow at flow.google. Learn more: https://blog.google/technology/ai ...
Veo 3.1 - Frames to video
Google DeepMind· 2025-10-15 15:56
Control the shot from start to finish. Provide a starting and ending image, and Veo will generate a seamless video that bridges the two, perfect for artful and epic transitions. Try it today in Flow at flow.google. Learn more: https://blog.google/technology/ai/veo-updates-flow ____ Subscribe to our channel / @googledeepmind Find us on X / googledeepmind Follow us on Instagram / googledeepmind Add us on Linkedin / deepmind ...
Veo 3.1 - Ingredients to video
Google DeepMind· 2025-10-15 15:56
With "Ingredients to Video," you can use multiple reference images to control the characters, objects and style. Veo uses your ingredients to create a final scene that looks just as you envisioned. Try it today in Flow at flow.google. Learn more: https://blog.google/technology/ai/veo-updates-flow ____ Subscribe to our channel https://www.youtube.com/@googledeepmind Find us on X https://twitter.com/GoogleDeepMind Follow us on Instagram https://instagram.com/googledeepmind Add us on Linkedin https://www.linke ...
Veo 3.1 - Create longer, seamless shots
Google DeepMind· 2025-10-15 15:56
With "Extend," you can create longer videos, even lasting for a minute or more, that connect to and continue the action from your original clip. Each video is generated based on the final second of your previous clip, making it most useful for creating a longer establishing shot. Try it today in Flow at flow.google. Learn more: https://blog.google/technology/ai/veo-updates-flow ____ Subscribe to our channel / @googledeepmind Find us on X / googledeepmind Follow us on Instagram / googledeepmind Add us on Lin ...
Veo 3.1 and more artistic control in Flow
Google DeepMind· 2025-10-15 15:56
Product Updates - Veo 3.1 introduces richer audio, more narrative control, and enhanced realism [1] - Veo 3.1 builds on Veo 3, with stronger prompt adherence and improved audiovisual quality for image-to-video conversion [1] - New capabilities are introduced, bringing audio to existing capabilities for the first time [1] Technology - Veo 3.1 is state-of-the-art [1]
Veo 3.1 - Designed to empower creatives
Google DeepMind· 2025-10-15 15:56
We're giving creators more artistic control with increased support for audio across all features. We’re also bringing audio to existing capabilities like “Ingredients to Video,” “Frames to Video” and “Extend.” We’re also introducing Veo 3.1, which brings richer audio, more narrative control, and enhanced realism that captures true-to-life textures. Veo 3.1 is state-of-the-art and builds on Veo 3, with stronger prompt adherence and improved audiovisual quality when turning images into videos. Try it today at ...
Beyond phishing: Cyber threats in the age of AI with Four Flynn (pt. 1)
Google DeepMind· 2025-10-09 18:27
Social engineering, cyberattacks, and the fog of war - all topics covered in this interview with the VP of Security and Privacy at Google DeepMind. Hannah Fry and Four Flynn take us behind the scenes of Operation Aurora, the monumental 2009 attack on Google that forever changed the landscape of cybersecurity. They discuss the defender's dilemma, the constant battle between attackers and defenders in the digital world, and how AI can potentially help mitigate some of the most complex vulnerabilities. As Hann ...