Workflow
谷歌图像模型nano banana正式上线:能力超强,且定价低于OpenAI同类模型
Founder Park·2025-08-27 03:16

Core Viewpoint - Google has launched its latest image generation and editing model, Gemini 2.5 Flash Image, also known as nano-banana, which is being hailed as the "strongest image model" due to its superior capabilities in image generation and editing [2][4]. Group 1: Model Performance - Nano-banana achieved over 2.5 million votes in blind tests, leading its closest competitor by a score of 171 points, marking the largest Elo score advantage in LMArena history [2][3]. - The model's four key capabilities include character consistency, prompt editing, native world knowledge, and multi-image fusion, which collectively enhance its performance compared to similar models [19][20]. Group 2: Key Features - Character consistency allows the model to generate new visual content while maintaining similarity in characters, subjects, or objects across different poses, lighting, environments, or styles [8][24]. - The model can apply specific artistic styles, designs, or textures from one image to another while preserving the original subject's form and details [11]. - It enables creative composition by merging elements from multiple images based on a single prompt, allowing for unique and cohesive compositions [13][35]. Group 3: Pricing and Accessibility - Gemini 2.5 Flash Image is priced at $30.00 per million output tokens, translating to approximately $0.039 per image, making it significantly cheaper than similar models from OpenAI [38][39]. - The model is available to developers through the Gemini API and Google AI Studio, and to enterprises via Vertex AI [4][38].