谷歌香蕉模型一夜登顶！干翻GPT-4o和FLUX，坐稳AI图像之王

Core Insights - Google has launched Gemini 2.5 Flash Image, its most advanced image generation and editing model, emphasizing its image editing capabilities [2][4] - The model allows for the blending of multiple images into a single one while maintaining character consistency and enabling targeted modifications through natural language [2][4] - Gemini 2.5 Flash Image has achieved the top rank in both text-to-image and image editing categories, with a score of 1362, leading the second-place model by nearly 15% [7] Image Editing Capabilities - The model can modify backgrounds, remove stains, change poses, and add color to black-and-white photos using natural language commands [20][24] - It supports high character consistency, allowing users to place the same character in different environments without altering their appearance [10][18] - Users can upload a selfie and generate images reflecting various historical styles while maintaining their likeness [10][21] Integration with Other Models - Gemini 2.5 Flash Image works well with video generation models like Google Veo 3, enabling the creation of rich video effects [4][34] - The model has been utilized in advertising, showcasing its ability to generate promotional images with minimal adjustments compared to other platforms [32] Pricing and Accessibility - The model is available for developers through Gemini API, Google AI Studio, and Vertex AI, priced at $30 per million output tokens, with each image costing approximately $0.039 [9] - Google has updated the AI Studio's "Built Mode" to facilitate the development of applications using Gemini 2.5 Flash Image [9] Use Cases and Applications - The model's ability to merge multiple images is particularly useful in e-commerce, allowing businesses to create promotional images of different products in a single scene [30] - It can also generate creative images by combining elements from different photos, enhancing visual impact [30] Market Position and Future Outlook - Gemini 2.5 Flash Image's advanced editing capabilities position it as a significant productivity tool in various sectors, including e-commerce and entertainment [36] - The ongoing developments in image editing models by various companies indicate a competitive landscape worth monitoring [36]