Adobe Lightroom
Search documents
P图新手福音!智能修图Agent一句话精准调用200+专业工具,腾讯混元&厦大出品
量子位· 2025-12-26 04:24
Core Viewpoint - JarvisEvo, developed by Tencent and Xiamen University, is an advanced image editing AI that simulates human expert designers through iterative editing, visual perception, self-evaluation, and self-reflection, aiming to provide a more controllable and professional editing experience compared to traditional software and AI tools [1][3]. Group 1: Challenges in Image Editing - The article identifies two main challenges in achieving a professional-level editing experience: Instruction Hallucination, where existing models struggle to visualize intermediate results and often make factual errors, and Reward Hacking, where models exploit static reward systems to gain high scores without genuinely improving editing quality [4][5]. Group 2: JarvisEvo's Mechanisms - JarvisEvo introduces the iMCoT (Interleaved Multimodal Chain-of-Thought) mechanism, allowing the model to generate new images after each editing step and use visual feedback for subsequent reasoning, breaking the limitations of traditional blind editing [8][9]. - The SEPO (Synergistic Editor-Evaluator Policy Optimization) framework enables JarvisEvo to learn from mistakes by comparing low and high scoring trajectories, thus developing a strong self-correction ability [11][12]. Group 3: System Architecture - The system operates in a four-step process: visual perception and planning, step-by-step execution, self-evaluation, and self-reflection, ensuring precise execution of each operation [18][16]. - The model utilizes two optimization loops: the Editor Policy Optimization loop focuses on improving tool usage for better image quality, while the Evaluator Policy Optimization loop ensures the model's scoring aligns with human aesthetic standards [17][25]. Group 4: Training Framework - JarvisEvo's training consists of three stages: Cold-Start Supervised Fine-Tuning with 150K labeled samples to teach basic skills, SEPO Reinforcement Learning with 20K standard instruction data for autonomous exploration, and Reflection Fine-Tuning with 5K reflection samples to enhance self-correction capabilities [20][22][31]. Group 5: Experimental Results - In evaluations, JarvisEvo achieved a Spearman Rank Correlation Coefficient (SRCC) of 0.7243 and a Pearson Linear Correlation Coefficient (PLCC) of 0.7116, outperforming other models and demonstrating superior alignment with human preferences [36][38]. - The model showed a 44.96% improvement in L1 and L2 metrics compared to commercial models, maintaining original image details while excelling in style and detail presentation [34][40]. Group 6: Future Prospects - The collaborative evolution paradigm of JarvisEvo is expected to extend beyond image editing to areas such as mathematical reasoning, code generation, and long-term planning, with ongoing efforts to enhance its capabilities for complex tasks [44][45].
拒绝「盲修」:JarvisEvo 如何让 Agent 像人类一样拥有「视觉反思」能力?
机器之心· 2025-12-24 03:41
此外,在传统强化学习中经常依赖于静态的奖励模型。随着模型的不断训练,它很容易学会如何「讨好」这个固定的打分器,导致 Reward Hacking —— 即分数很高,但审美并没有真正提升。 为了打破这一僵局, JarvisEvo 应运而生。它不仅仅是一个连接 Adobe Lightroom 的自动化工具使用者,更是一次大胆的探索:探索 Agent 如何通过 「内省」,真正实现自我进化。 在迈向通用人工智能的道路上,我们一直在思考一个问题: 现有的 Image Editing Agent,真的「懂」修图吗? 大多数基于 LLM/VLM 的智能体,本质上更像是一个「盲目的指挥官」。它们能流利地写出修图代码或调用 API,但在按下回车键之前,它们看不见画布 上的变化,也无法像人类设计师那样,盯着屏幕皱眉说:「这张对比度拉太高了,得往回收到一点。」这种感知与决策的割裂,直接导致了「指令幻觉」, 或者说模型在进行盲目的「脑补」。由于缺乏视觉反馈,模型往往凭空想象下一步操作,导致结果与用户的初衷南辕北辙。 核心范式转移: 论文标题: JarvisEvo: Towards a Self-Evolving Photo Edit ...
Intel Gaining Momentum in AI PC Market: Will the Uptrend Persist?
ZACKS· 2025-06-24 14:25
Core Insights - Intel Corporation (INTC) is actively pursuing initiatives to strengthen its position in the AI sector through collaborations with original equipment manufacturers like HP to develop next-generation AI PCs [1][9] - The global AI market is projected to grow from $757.6 billion in 2025 to $3.68 trillion in 2034, with a compound annual growth rate of 19.2%, positioning Intel favorably to capitalize on this trend [5] Group 1: AI Initiatives and Collaborations - Intel is collaborating with HP to identify AI applications that provide significant benefits to end users, optimizing CPU, GPU, and NPU performance for real-world applications [2] - The partnership has led to the development of AI PCs like the EliteBook series, powered by Intel Core Ultra processors, enhancing enterprise functionality [9] Group 2: Performance Enhancements - Intel's AI-optimized software packages have resulted in significant performance improvements, with Microsoft Power BI running 45% faster and Adobe Lightroom running 32% faster compared to previous systems [3] - AI applications like Canvid and Writeup, previously exclusive to Apple users, are now available on Windows PCs powered by Intel AI chips, enhancing organizational efficiency [4] Group 3: Competitive Landscape - Intel faces strong competition from Qualcomm and AMD, with Qualcomm launching the Snapdragon X chip for mid-range AI desktops and laptops, and AMD's Ryzen AI 300 Series gaining industry adoption [6][7] - The competitive landscape is intensifying as OEMs like Dell, ASUS, and Samsung expand collaborations with Qualcomm to develop AI PCs [6] Group 4: Financial Performance and Valuation - Intel's stock has declined by 31% over the past year, contrasting with the industry's growth of 11.2% [8] - The company's shares currently trade at a price/book ratio of 0.87, significantly lower than the industry's 31.65 [10] - Earnings estimates for 2025 and 2026 have seen a decline of 39.58% and 29.36%, respectively, indicating a downward trend in financial forecasts [11]