Workflow
可灵3.0 Omni
icon
Search documents
腾讯研究院AI速递 20260210
腾讯研究院· 2026-02-09 16:03
https://mp.weixin.qq.com/s/vPp0aFcc1QJZ2l0D4qFH8A 二、小红书内测AI视频剪辑应用OpenStoryline,对话驱动 生成式AI 一、 实 测 神秘模型Pony Alpha,Opus级智能 , 架构师思维 1. Pony Alpha在OpenRouter走红,无发布会无论文,却凭超强编程能力引发开发者圈热议,有人连续编程3小时做 出可玩的Pokemon Ruby; 2. 实测表现惊艳,能从零复刻《星露谷物语》,自主完成需求分析、架构设计到功能实现全流程,展现出系统级工程 理解与长时间推理能力; 3. 模型身世成谜,有人猜测是Anthropic Sonnet 5、DeepSeek-V4或智谱GLM-5,若为国内厂商作品,意味着国 产模型在高阶编程领域已进入新阶段。 1. 小红书正在研发AI视频剪辑应用OpenStoryline,采用"非线性编辑+对话驱动"模式,用户上传图片通过自然语言 即可完成视频剪辑; 2. 技术上采用DeepSeek和Qwen 3开源模型,结合小红书自有的dots.lm文本大模型和FireRedASR音频模型实现生 态适配; 3. 小红书近 ...
实测可灵3.0 - 属于每个人的导演时代。
数字生命卡兹克· 2026-02-05 02:23
Core Viewpoint - The article discusses the significant upgrade of the AI video generation tool, 可灵 (Keling), from version 2.0 to 3.0, highlighting its enhanced capabilities in video production, particularly in terms of scene segmentation and language processing. Group 1: Video Generation Capabilities - 可灵 3.0 introduces a new level of video generation, allowing users to create videos with a variety of scene cuts and camera movements using simple prompts [3][7]. - The tool can generate videos ranging from 3 to 15 seconds, with options for both intelligent and custom scene segmentation [8][16]. - Users can create compelling narratives with minimal input, as the AI can autonomously fill in details based on basic instructions [19][20]. Group 2: Scene Segmentation - The intelligent scene segmentation feature allows users to input a prompt and receive a series of automatically generated scenes that align with the narrative [8][19]. - Custom scene segmentation provides users with detailed control over each shot, enabling the creation of complex video sequences [16][17]. - The tool effectively handles various cinematic techniques, including reverse shots, enhancing the storytelling experience [19][24]. Group 3: Language Processing - 可灵 3.0 showcases advanced language capabilities, enabling the generation of multilingual content seamlessly integrated into video narratives [31][39]. - The tool can create educational videos that incorporate language learning in a creative manner, making the learning process engaging [33][36]. - Language capabilities can be combined with scene segmentation to produce dynamic videos featuring characters speaking different languages in context [41]. Group 4: Omni Model - The 可灵 3.0 Omni model allows for video editing and modification, distinguishing it from the standard version which focuses on video generation [42][45]. - Users can replace characters in existing video clips while maintaining the original action and context, showcasing the model's editing prowess [44][49]. - Both 可灵 3.0 and 3.0 Omni support extracting audio and visual elements from previous works, enhancing the efficiency of video production [45][51]. Group 5: Future Implications - The upgrade to 可灵 3.0 represents a comprehensive enhancement in AI video production, potentially democratizing video creation for a broader audience [52]. - The integration of scene segmentation and editing capabilities is expected to significantly boost productivity in AI video creation [52]. - The article suggests that the future of AI video production may lead to a new era where everyone can act as a director, simplifying the creative process [52].