Workflow
Image Generation
icon
Search documents
Nano Banana Pro | Live from Mountain View
Google· 2025-11-21 18:21
Product Launch & Features - Nano Banana Pro showcases next-gen image generation and editing capabilities in AI Studio [1] - Breakthrough features include SOTA text rendering, multi-image editing for character consistency, and search tool calling [1] - Real-time demos highlight diverse applications, such as 4K wallpaper apps, interactive newspapers with Veo video integration, cultural translation tools, and marketing campaigns [1] Demo Highlights - Vibe coding a comic book with branching storylines [1] - Professional brand design demo focusing on a toothpaste pitch [1] - Turning video into visual explainers [1] - Airplane safety card style demo [1] - Visualizing text-only menus with search grounding [1] - One-shot studios demo creating pixel art game assets [1] - Remixing floor plans [1] Technical Aspects - Discussion of latency during vibe coding of 4K wallpapers [1] - Exploration of multilingual capabilities, visualizing menus in Urdu [1] - Real-time news generator demo called "The Daily Gemini" [1]
How Google’s Nano Banana Achieved Breakthrough Character Consistency
Sequoia Capital· 2025-11-11 10:00
Model Development & Capabilities - Google's Nano Banana image model, built upon the Gemini model, achieves single image character consistency through high-quality data, long multimodal context windows, and disciplined human evaluations [3][4][32][33] - The model benefits from Gemini's multimodal foundational capabilities, including a long context window that allows for multiple image inputs and iterative conversations [33][34] - A key technical breakthrough is the model's ability to generalize well, enabling it to maintain character consistency and edit images while preserving untouched elements [32][33][24] - Craft and attention to detail in data selection and model design are as important as scale in achieving high-quality results [4][38][39] Applications & Use Cases - The model facilitates consistent character and scene preservation in video models, enabling smoother video creation with natural scene cuts [6][7][8] - Users are creatively "hacking" the model for learning and information digestion, such as creating sketch notes from complex topics [9][10] - The model allows users to see themselves in new ways, enhancing self-expression and identity through 3D figurines and other creative outputs [14] - The technology has potential for personalized learning, multimodal creation, and specialized UIs that combine fine-grain control with automation [4][69][70] Business & Product Strategy - Google aims to build a single, powerful model capable of handling any modality and transforming it into any other, with specialized models like Imagen and VEO serving as stepping stones [47][48][49] - The company is focusing on making the technology more accessible and easier to use for consumers, while also developing more precise control and robustness for professional workflows [43][66][67][68] - Google is exploring new visual creation canvases and UIs to enhance user interaction with the models, moving beyond simple chatbot interfaces [72][73][74] - Startups have opportunities to develop workflow-based tools for various verticals, leveraging the fundamental technology to address specific client needs [111][112] Safety & Ethical Considerations - Google is committed to preventing misuse of the technology, particularly in creating deepfakes and misinformation [89][90] - The company employs visible watermarks and invisible SynthID to indicate AI-generated content and verify its origin [91][92][95] - Google invests in ongoing testing and mitigation strategies to address new attack vectors and ensure responsible use of the models [93]
X @Elon Musk
Elon Musk· 2025-11-08 09:21
I just used the above prompt on the Grok image below:Heisenberg (@rovvmut_):Holy moly Grok Imagine's image generation is getting so good 🤯 https://t.co/bcA8xjQlWa ...
AI News: Google's Suncatcher, OpenAI TEAR, Apple $1B Deal for Gemini, Vidu Q2, and more!
Matthew Berman· 2025-11-07 00:47
Google aims to put massive AI data centers in space. This is not science fiction. This is something they are actually working on.This is called project starcatcher. And the gist is they want to put data centers in space. They want to connect the data centers with satellites and they want to power the satellites with solar energy.So here are the interesting bits from this announcement. In the right solar orbit, a solar panel can be up to eight times more productive than on Earth. So, as solar panels continue ...
Why It Accidentally Got Called Nano Banana 🍌 | Made by Google Podcast S8E8
Google· 2025-11-03 18:42
So the official name is is much more catchy. Gemini 2.5% Flash image and that's the official name. Beautiful.And I would love to tell you that a lot of thought and rigor went into the name Nano Banana, but the truth is Welcome to the Made by Google Podcast, where we meet the people who work on the Google products you love. Here's your host, Rasheed Finch. In just a few short weeks, people created billions and billions of images with Nano Banana.Our guest today is David Sharon, a group product manager on the ...
X @Tesla Owners Silicon Valley
Grok's Imagine Feature Usage - Grok's Imagine feature allows users to generate custom images [1] - The process involves describing the desired image, confirming the request, and then receiving the generated image [1] Image Generation Process - Users should provide specific descriptions to achieve better image generation results [1] - Grok requires confirmation before generating the image [1] - Generated images can be used for visuals, memes, or creative purposes [1]
Which AI Model Makes the Best Images?
Matthew Berman· 2025-10-16 18:49
Image Generation Model Comparison - The report compares four image generation models: Quen ImageEdit Plus, Nano Banana, GPT Image 1, and Seedream across various image editing tasks [1][2] - The models are tested on their ability to composite images, transport objects, match lighting, and perform other complex manipulations [2][4] - The open-source script developed by the team allows users to automatically run prompts and upload images to all four models for comparison [11] Model Performance Highlights - Quen ImageEdit Plus excels in tasks requiring realistic lighting and object integration, often outperforming Nano Banana [4][5] - GPT Image 1 demonstrates strength in maintaining style and consistency across images, particularly in portrait and complex scene generation [3][4] - Nano Banana shows proficiency in image consistency and material transformation tasks, such as recoloring and blueprint rendering [31][33] - Seedream shows good performance in specific tasks like motion dynamics and adding graffiti [10][48][67] Task-Specific Performance - In "bleeding edge" tasks pushing model limits, GPT Image 1 often emerges as the winner, particularly in tasks requiring precise anatomical detail and measurement [20][22] - For object removal and reconstruction, Nano Banana consistently delivers the most realistic and seamless results [54][55] - In style transfer tasks, Quen ImageEdit Plus and GPT Image 1 often produce the most visually appealing and accurate results [60][61] - For adding text to images, Nano Banana and GPT Image 1 demonstrate strengths in perspective and transparency [66][68] - In weather effects, Quen ImageEdit Plus and GPT Image 1 excel in creating realistic snowfall and rain effects [69][71] Product Placement - Dell Technologies sponsors the video, highlighting its Dell Pro Max laptops featuring Nvidia RTX Pro Blackwell chips with up to 32 GB of GPU memory, suitable for AI workloads [8][9]
Make the pet of your dreams a reality with Nano Banana
Google· 2025-09-26 19:46
Gemini's state-of-the-art image generation and editing lets you create almost anything. Imagine yourself with a pet dinosaur. Try a new hairstyle. Restore an old photo. Combine two photos into one. It's free for everyone. Go Nano Bananas. Try it now: https://gemini.google.com/ Learn more about Nano Banana: https://gemini.google/overview/image-generation/ Follow Gemini on Instagram: https://www.instagram.com/googlegemini/ Learn about our free pro plan for students: https://gemini.google/students Subscribe to ...
Upgrade your profile pic with Google Gemini
Google· 2025-09-18 18:40
Looking to Nano Banana your profile pic? Here are 4 prompts to try now: Graffiti Mural →Turn me into a huge, graffiti mural on the side of a building. Tarot Card → Create a custom tarot card with a detailed folk-art, vibrant color style, of me. Neon Sign → Turn me into a simple neon sign hanging on a wall Ceramic Mug → Preserving my likeness, create a ceramic mug version of my head. Make my head the entire mug Try it now: https://gemini.google.com/ Learn more about Nano Banana: https://gemini.google/overvie ...
X @Elon Musk
Elon Musk· 2025-09-05 18:33
Product Development - Grok 视频现在可以发声 [1] - 图像和视频生成将在几周内进行重大升级 [1] - 产品仍处于早期测试阶段 [1] Features - Grok Imagine 视频现在可以说话,尝试语音模式 [1]