Workflow
智象未来发布全新自回归图像编辑框架 VAREdit ,0.7 秒完成高保真图像编辑
Ge Long Hui·2025-08-25 06:26

Core Insights - The launch of VAREdit marks a significant breakthrough in image editing technology, being the world's first purely autoregressive image editing model [1] - VAREdit enhances editing speed to 0.7 seconds, facilitating real-time interaction and efficient creation [1] Group 1: Technology and Innovation - VAREdit addresses limitations of diffusion models in image editing, such as imprecise modifications and low efficiency in multi-step iterations [1] - The framework introduces a visual autoregressive (VAR) architecture, defining editing as "next-scale prediction" to achieve precise local modifications while maintaining overall structure [1] - The innovative Scale Alignment Reference (SAR) module effectively resolves scale matching issues, further improving editing quality and efficiency [1] Group 2: Performance Metrics - In authoritative benchmarks EMU-Edit and PIE-Bench, VAREdit outperforms competitors across various metrics, including CLIP and GPT [1] - The VAREdit-8.4B model shows a 41.5% and 30.8% improvement in the GPT-Balance metric compared to ICEdit and UltraEdit, respectively [1] - The lightweight VAREdit-2.2B model can achieve high-fidelity editing of 512×512 images within 0.7 seconds, resulting in multiple speed enhancements [1] Group 3: Future Developments - VAREdit is fully open-sourced on GitHub and Hugging Face platforms, indicating a commitment to community engagement and collaboration [2] - The company plans to explore applications in video editing and multimodal generation, aiming to advance AI image editing into a new era of efficiency, control, and real-time capabilities [2]