美团开源LongCat-Image模型,在文生图与图像编辑核心能力上逼近更大尺寸的头部模型
Xin Lang Cai Jing·2025-12-08 07:24

Core Viewpoint - Meituan's LongCat team has announced the open-source release of its latest LongCat-Image model, which approaches the capabilities of larger models in text-to-image generation and image editing with a parameter scale of 6 billion [1] Group 1: Model Features - The LongCat-Image model features a high-performance architecture design and systematic training strategies, providing developers and the industry with a "high-performance, low-threshold, fully open" option [1] - The model utilizes a shared architecture for text-to-image generation and image editing, combined with a progressive learning strategy [1] Group 2: Performance Metrics - In objective benchmark tests, LongCat-Image achieved leading scores in image editing and Chinese rendering capabilities compared to other evaluated models [1] - The model demonstrated strong competitiveness in text-to-image tasks, as evidenced by its performance in GenEval and DPG-Bench, outperforming both leading open-source and closed-source models [1]

MEITUAN-美团开源LongCat-Image模型,在文生图与图像编辑核心能力上逼近更大尺寸的头部模型 - Reportify