Workflow
Multimodal understanding
icon
Search documents
X @Demis Hassabis
Demis Hassabis· 2025-12-05 17:02
RT Josh Woodward (@joshwoodward)If you're building something that needs *any* type of multimodal understanding (e.g. doc understanding, video understanding, screen understanding, …), you want to be using Gemini right now. It's very good at this. ...
What’s new in Gemini 3?
Google DeepMind· 2025-12-01 14:21
We just released Gemini 3, our most intelligent model. Here's what you can do with it. It's state-of-the-art for reasoning and multimodal understanding.So, it's much better at figuring out the context and intent behind your request. Give it a video of your pickle ball match, and Gemini will provide expert level analysis, helping improve your game. Or break down a dense research paper by asking it to code you an interactive guide that visualizes the concepts.Building on Gemini 3, Nano Banana Pro adds an impr ...
Gemini 3: Turn a research paper into an interactive website
Google DeepMind· 2025-11-18 16:01
Gemini 3 combines multimodal understanding and coding to help you learn anything. Using Google AI Studio, see how Gemini 3 analyzes a complex research paper on materials science and deep learning to create code for a beautiful, interactive guide with 3D visualizations that make the concepts easy to explore. What will Gemini create if you upload a different paper? Find out more about Gemini 3: https://deepmind.google/models/gemini/ ___ Prompt: I want to learn about ""Scaling deep learning for materials disco ...
X @Demis Hassabis
Demis Hassabis· 2025-06-27 03:08
Model Announcement - Gemma 3n 模型发布,这是一个多模态(文本/音频/图像/视频)理解模型 [1] - 该模型仅需 2GB 内存即可运行 [1] - 首个参数小于 10B(十亿)的模型,在 @lmarena_ai 上的得分超过 1300 [1] Availability - Gemma 3n 模型已在 @huggingface, @kaggle, llama.cpp 等平台上线 [1]