Workflow
Qwen-Image 模型上线基石智算,快来体验超强文本渲染能力
Sou Hu Cai Jing·2025-08-14 15:48

Core Insights - Qwen-Image, the first text-to-image foundational model from the Qwen series, has been launched by Qiyun Technology's AI computing cloud, CoresHub, featuring 20 billion parameters and developed by Alibaba's Tongyi Qianwen team [1] - The model excels in complex text rendering, precise image editing, multi-line layout, paragraph-level generation, and detail depiction, making it particularly effective in poster design scenarios [1] Model Highlights - Exceptional text rendering capabilities: Qwen-Image demonstrates outstanding performance in complex text generation and rendering, supporting multi-line typesetting, paragraph-level layout, and fine-grained detail presentation in both English and Chinese [2] - Consistency in image editing: Leveraging enhanced multi-task training paradigms, Qwen-Image can accurately modify target areas during image editing while maintaining overall visual consistency and semantic coherence [2] - Industry-leading performance: Multiple public benchmark test results indicate that Qwen-Image has achieved state-of-the-art (SOTA) results in various image generation and editing tasks, validating its comprehensive strength [2] Usage Steps - Users can log into CoresHub, navigate to the model plaza, select the Qwen-Image model, and click on model deployment [3] - The model can be deployed by selecting a single card 4090D resource type, and after successful deployment, users can copy the external link to open in a browser [4] - Once the Comfy UI page loads successfully, users can select the Qwen-Image template and input their prompt [6] Effect Demonstration - Various prompts showcase the capabilities of Qwen-Image, including imaginative scenarios such as a Shiba Inu wearing a cowboy hat at a bar, a cotton candy castle in the clouds, and a retro arcade with a pixel-style game machine [9][11][12][13]