Core Insights - The article discusses the launch of Google's new image generation and editing model, named gemini-2.5-flash-image-preview, which boasts state-of-the-art capabilities and impressive speed [2][3]. Model Features - The model offers SOTA image generation and editing capabilities, with remarkable character consistency and fast processing speed [3]. - Users can access gemini-2.5-flash-image-preview for free through Google AI Studio and Gemini API, supporting a context of up to 32k [5]. - The model currently does not support image generation and editing for Chinese input, providing text responses instead [6]. - Pricing for the model is set at $0.3 for input text, $2.5 for output text, $0.3 for input images, and $30 for output images, with an estimated cost of $0.039 (approximately ¥0.28) per generated image [10][11]. Editing Capabilities - The model emphasizes maintaining character consistency across different images, allowing users to edit photos of themselves or familiar individuals without noticeable discrepancies [16]. - Users can upload a photo and specify modifications, enabling unique personal styles while keeping the essence of the original image [16]. - Various functionalities include changing outfits or scenes, merging multiple photos into a new scene, and applying styles from one image to another [17][21][23]. Performance and Rankings - Upon launch, gemini-2.5-flash-image-preview quickly rose to the top of the Artificial Analysis image editing leaderboard with an ELO score of 1212 [37]. - In the text-to-image and image editing categories, the model has become a champion in the LM Arena rankings, showcasing its competitive edge [40][42]. - The model demonstrates significant advantages in character consistency, creativity, and environmental rendering, while GPT-4o leads in stylization [42].
谷歌nano banana正式上线:单图成本不到3毛钱,比OpenAI便宜95%