Workflow
Image Generation
icon
Search documents
Every model that matters
Matthew Berman· 2026-03-17 19:30
Chat GPT writing, coding, image generation, claude, better coding, better writing, plug-in, different tools. We have notion, Figma, Slack, HubSpot, Gemini, incredibly fast, ingest video, image gen, best integrations, best search as well, Grock, search, Twitter, image generation model, open source, Meta's llama model, DeepSeek, Miniaax and Quen, GPT, OSS, Neotron, Gemma, image generation models, Midjourney was an early leader, Dolly, stable diffusion, flux, ideoggram, Many many others. Video generation model ...
美工即将全面失业!谷歌发布Banana2图片生成大模型,已经可以商用!Ai助理Perplexity和三星合作成为S26的标配Ai软件【Vic TALK 第1576期】
Vic TALK· 2026-02-27 05:47
https://x.com/ArmanHezarkhani/status/2026308695399784913 推特:https://x.com/victalk6886 Telegram :victalk2021 #clawdbot #aivideo #ai agent #ai 私人助理 #moltbook #人工智能社交平台 #ai雇佣人类 #seedance #simplclaw #agentwars #GLM-5 #elys #perplexityai #moonlake #banban2 ...
Nano Banana 2 🍌
Matthew Berman· 2026-02-27 02:09
Nano Banana 2 is here image generation model from Google. It is not just an image generator. It actually knows about the world and you can test this by giving it prompts that ask it to output images based on things about the world.It also renders text and infographics incredibly well. And you can generate images up to 4K resolution. Look at some of these examples.This is a flat lay infographic. Beautiful weather. It looks like every element in this image is real.Here's a tptic infographic depicting differen ...
X @Demis Hassabis
Demis Hassabis· 2026-02-26 19:51
top 🍌Arena.ai (@arena):🚨BREAKING: Nano Banana 2 debuts at #1 in Image Arena, and it changes the game again 🍌🍌Officially released as Gemini 3.1 Flash Image Preview, it is powered by real-time information and images from web search.Highlights:- #1 Text-to-Image scoring 1279, surpassing https://t.co/lOS3VJEI8T ...
X @Demis Hassabis
Demis Hassabis· 2026-02-26 16:49
Nano Banana 2 is our new faster and better SOTA image generation & editing model!It uses Gemini’s amazing world understanding + grabs real-time info w/ search to create higher quality outputs.Available in @GeminiApp, @GoogleAIStudio, @FlowbyGoogle, Search & Vertex - enjoy!Google DeepMind (@GoogleDeepMind):We’re launching Nano Banana 2, built on the latest Gemini Flash model. 🍌It’s state-of-the-art for creating and editing images, combining Pro-level capabilities with lightning-fast speed. 🧵 https://t.co/b3s ...
Google launches Nano Banana 2 model with faster image generation
TechCrunch· 2026-02-26 16:00
Core Insights - The company has launched Nano Banana 2, an advanced image generation model that produces more realistic images than its predecessor, Nano Banana [1][2] - Nano Banana 2 will be the default model in the Gemini app and will also be integrated into Google's video editing tool, Flow, and Google Search results [7][8] Group 1: Model Features - Nano Banana 2 can generate images with resolutions ranging from 512px to 4K and supports various aspect ratios [2] - The model maintains character consistency for up to five characters and fidelity for up to 14 objects in a single workflow, enhancing storytelling capabilities [5] - Users can issue complex requests with detailed nuances, resulting in images with vibrant lighting, richer textures, and sharper details [5] Group 2: Availability and Integration - The new model will be the default for image generation across all applications in the Gemini app and will be available in preview through the Gemini API, Gemini CLI, and Vertex API [7][14] - Subscribers to Google's higher-end plans, Google AI Pro and Ultra, can still use Nano Banana Pro for specialized tasks [8] Group 3: Image Verification - All images generated by Nano Banana 2 will feature a SynthID watermark, indicating they are AI-generated [15] - The images will also be compatible with C2PA Content Credentials, a standard developed by an industry consortium including Adobe, Microsoft, Google, OpenAI, and Meta [15]
X @Elon Musk
Elon Musk· 2026-01-30 21:21
Try the new @Grok Imagine image & video generation. Super easy to use.Just go to https://t.co/Ui0vr66BL1 or download the app https://t.co/u2y4RZSsODDéborah (@dvorahfr):Try Grok Imagine, you'll be amazed like a child! https://t.co/dLSbaOlkG9 ...
让扩散模型「可解释」不再降质,开启图片编辑新思路
机器之心· 2025-12-16 02:31
Core Viewpoint - The article discusses the emergence of TIDE (Temporal-Aware Sparse Autoencoders) as a significant advancement in making diffusion models interpretable without sacrificing their generative quality [3][17]. Group 1: Background and Challenges - Over the past three years, diffusion models have dominated the image generation field, with architectures like DiT pushing the limits of image quality [2]. - Despite the growth in explainability research for LLMs, the internal semantics and causal pathways of diffusion models remain largely opaque, making them a "black box" [2]. - Existing attempts at explainability often lead to a noticeable decline in performance, making the pursuit of interpretable diffusion models seem impractical [2]. Group 2: Introduction of TIDE - TIDE is introduced as the first truly temporal-aware framework for diffusion transformers, aiming to reveal the internal mechanisms of these models without compromising their generative capabilities [3][5]. - The framework emphasizes the importance of the temporal aspect of the diffusion process, which unfolds progressively over time [6]. Group 3: Mechanism and Functionality of TIDE - TIDE aligns semantics along the time dimension, allowing for a clearer presentation of the diffusion model's internal processes, such as the emergence of structure from noise and the gradual formation of semantics [7]. - The sparse autoencoder in TIDE enables lossless reconstruction in the feature space, maintaining the stability of the diffusion trajectory while being "observed" [7][10]. Group 4: Performance and Results - TIDE decomposes diffusion features into controllable semantic factors, enhancing image editing capabilities by allowing direct manipulation along clear semantic directions [8][10]. - The impact of TIDE on generative quality is minimal, with FID and sFID changes being less than 0.1%, demonstrating its ability to be interpretable without degrading quality [10][14]. - TIDE shows significant improvements in semantic binding and understanding of spatial relationships, with multiple metrics indicating optimal performance [12]. Group 5: Implications and Future Directions - TIDE represents a new research paradigm, suggesting that diffusion models can be interpretable with the right perspective [19]. - Future developments may include more controllable and robust diffusion editing systems, unified understanding of generative models, and advancements in causal and semantic theory research [21][22].
Disney to Invest $1 Billion in OpenAI, License Characters on Sora
Youtube· 2025-12-11 16:00
Group 1 - Disney has made a significant $1 billion investment in open air, indicating a vote of confidence in the integration of their intellectual property with emerging technologies [1][3] - The company is now the first major content licensing partner on Sora, focusing on image creation and generative AI, which aligns with current trends in productivity and creativity [2][5] - There is a notable concern regarding the impact of artificial intelligence on labor, especially in light of recent Hollywood strikes aimed at addressing these issues [4][6] Group 2 - The use of generative AI in production is becoming more common, with competitors like Netflix and companies such as Runway leading in image generation technology [5][6] - Despite advancements in AI, human creativity remains essential for producing iconic intellectual property, suggesting a need for collaboration between AI and human creators [6][7] - Disney's strategic moves in the AI space will require careful communication to its workforce about the ongoing importance of human roles in the creative process [7]
X @TechCrunch
TechCrunch· 2025-12-11 11:01
Runware raises $50M Series A to help make image, video generation easier for developers https://t.co/ef9JxyUx02 ...