视觉创作
Search documents
科普| 未可知 x 杭州余杭区科协: AI 提示词技巧与视觉创作新趋势
未可知人工智能研究院· 2025-10-18 03:02
Core Insights - The article discusses two AI popularization lectures conducted by Liu Xueying, a senior expert at the Unknown AI Research Institute, in Yuhang District, Hangzhou, which attracted many technology workers and enthusiasts [1]. Group 1: AI Writing Techniques - The first lecture titled "AI Prompt Techniques and Intelligent Writing" focused on the core concepts of generative AI, contrasting it with decision-based AI, and emphasized the transformative role of AI in content creation [3]. - Liu demonstrated a four-step method for writing directive prompts (role, background, task, requirements) and shared concise application techniques for inferential prompts, enhancing work efficiency in various scenarios such as team-building planning and public science promotion [3]. - The interactive session allowed participants to actively experience the prompt optimization process, indicating high engagement [3]. Group 2: AI Visual Content Creation - The second lecture, "AI Information Integration and Visual Content Creation," concentrated on practical applications, detailing the workflow for generating AI-based PowerPoint presentations, including theme clarification, framework organization, content extraction, and layout beautification [6]. - Liu introduced tools like Kimi and AiPPT for quickly generating professional presentations and explained the structure of prompt words for image generation, showcasing techniques for editing, expanding images, and local redrawing [6]. - The dynamic demonstrations of video creation, including animations like a cat, highlighted the efficiency of AI in multimedia content creation [6]. Group 3: Commitment to AI Education - Both lectures combined theory and practice, receiving enthusiastic feedback from participants, reflecting the Unknown AI Research Institute's commitment to AI popularization and education [9]. - The institute aims to promote AI technology through high-quality courses and plans to continue scientific popularization activities to empower the public in embracing the intelligent era and fostering an innovative educational ecosystem [9].
通义万相2.5系列模型发布,可一键P图、生成BGM视频
Xin Lang Ke Ji· 2025-09-24 05:20
Core Insights - Alibaba launched the Tongyi Wanshang Wan2.5 preview series models at the 2025 Hangzhou Yunqi Conference, which includes four major models: text-to-video, image-to-video, text-to-image, and image editing [1] - The Tongyi Wanshang 2.5 video generation model can create videos with synchronized audio, effectively lowering the barrier for high-quality video production [1] - The new model enhances creative capabilities, increasing video generation time from 5 seconds to 10 seconds, and supports 24 frames per second in 1080P HD video [1] Summary by Category Product Features - The Tongyi Wanshang 2.5 model can generate human voices, sound effects, and background music that match the visuals, making video storytelling more vivid [1] - It has improved instruction-following capabilities, allowing for complex continuous changes in video generation tasks and enabling one-click effects in image editing tasks [1] - The model can generate Chinese and English text, complex layouts, artistic posters, flowcharts, and architecture diagrams, along with image editing capabilities [1] Performance Metrics - The Tongyi Wanshang model family now supports over 10 visual creation capabilities, including text-to-image, text-to-video, and action generation, with a total of 390 million images and 70 million videos generated [2] - Since February of this year, the Tongyi Wanshang has open-sourced over 20 models, achieving over 30 million downloads across open-source communities and third-party platforms [2]