Seaweed

Search documents
视频生成大模型群雄逐鹿 却不温不火
Zhong Guo Jing Ying Bao· 2025-06-27 08:17
在OpenAI发布的Sora爆火后,视频大模型开始出圈,国内就涌现出了腾讯混元、快手可灵等一系列视 频大模型,各有不同的优势特色。不过视频生成大模型行业高开低走,一年过去了,依旧处于不温不火 的状态。业内人士认为,一个重要原因在于,用户想看的短视频大都是由真人博主出演,而AI生成不 了这种视频。 对此,经济学家余丰慧告诉《中国经营报》记者:"关于可灵、即梦、混元这三个视频生成大模型,它 们各自拥有独特的技术优势和应用场景。可灵在图像识别和转换方面表现出色,适合需要高质量图像处 理的任务;即梦则以其强大的自然语言处理能力著称,能够根据文本描述生成相应的视频内容,特别适 用于创意产业;而混元结合了前两者的优点,并增加了更多的自定义选项,使其在灵活性和应用范围上 更具优势。因此,不能简单地说哪一个更好,而是应根据具体的使用需求来选择最适合的模型。" 竞争激烈 从国际方面来看,首先当然是OpenAI的Sora,Sora能生成长达60秒的高质量视频,在画面细节、动作流 畅度和镜头语言把控上较为均衡。目前Sora与ChatGPT Plus深度绑定,用户可在对话中一键体验,但因 为模型规模庞大,对GPU算力要求高,生成延迟 ...
字节Seed首次开源代码模型,拿下同规模多个SOTA,提出用小模型管理数据范式
量子位· 2025-05-11 04:20
克雷西 明敏 发自 凹非寺 量子位 | 公众号 QbitAI 字节Seed首次开源代码模型! Seed-Coder ,8B规模,超越Qwen3,拿下多个SOTA。 它证明 "只需极少人工参与,LLM就能自行管理代码训练数据" 。 通过 自身生成和筛选 高质量训练数据,可大幅提升模型代码生成能力。 这可以被视为对DeepSeek-R1模型自我生成和筛选训练数据策略的扩展。 一共包含三个版本: Base Instruct Reasoning 其中,Instruct在编程方面表现出色,拿下两个测试基准SOTA。 | Model | Size | | SWE-bench Verified | Multi-SWE-bench mini | | --- | --- | --- | --- | --- | | Agentless | | | OpenHands | Agentless | | ~8B Models | | | | | | Yi-Coder-9B-Chat | 9B | 0.0 | 1.6 | 0.0 | | Llama-3.1-8B-Instruct | 8B | 1.0 | 1.2 | 0.5 | | Q ...
为什么AI视频工具长得越来越像?
3 6 Ke· 2025-05-07 07:50
Core Insights - The AI video sector has seen a shift in focus from OpenAI's Sora to new players like Keke and Jiemeng, with industry players now prioritizing the reduction of the gap between AI video production and consumption [4][5][6] - The competition among AI video players is intensifying, with frequent updates and new model releases expected in 2025, indicating a rapid evolution in the industry [4][12][26] - There is a growing concern among mid-tier AIGC entrepreneurs regarding the commercial viability of AI video, as production costs remain high while client budgets are decreasing [4][16][18] Group 1: Industry Dynamics - The AI video landscape is becoming increasingly crowded, with numerous players emerging and competing for market share [23][26] - The focus of competition has shifted from model parameters to three key dimensions: consistency, usability, and playability [6][13][14] - Many AI video products are becoming homogenized in terms of functionality, leading to increased competition on quality, cost, and interaction forms [5][16] Group 2: Technological Advancements - AI video players are enhancing video generation consistency by improving frame transitions and scene realism, which are critical for quality [9][11] - Major players are iterating their foundational models regularly, with updates occurring at least every six months to maintain competitive advantage [11][12] - New features such as dynamic editing capabilities and end-to-end production tools are being developed to improve usability for creators [13][14] Group 3: Market Challenges - Despite the proliferation of tools and features, many creators express anxiety over rising production costs and decreasing project budgets [16][18][21] - The pricing strategies in the AI video market are not leading to significant reductions in costs, with many companies maintaining high prices for advanced models [20][21] - The complexity of video creation demands a multi-platform approach, as no single company currently meets all needs in the market [27]
字节 AI 再创业:独立组织、全链条的饱和出击
晚点LatePost· 2025-03-31 11:58
当中国最大互联网公司遇到一局上限足够高的新游戏,它可能试试就放过吗? 文 丨 王与桐 程曼祺 编辑 丨 程曼祺 黄俊杰 面对 AI,字节依然是那个字节:一旦看到有潜力的方向,就加倍、饱和、全面出击。 一个最新例子是:智能体应用 Manus 出圈前后,字节已有至少 5 个团队在开发不同智能体产品,其中 有些是对内工具。Manus 是 3 月 6 日刚由创业公司 Monica 开始内测的智能体应用。 去年 11 月我们在一篇文章中说:"中国掌握极强产品能力和流量资源的不止字节。微信还没出手呢。" 现在手握微信的腾讯终于出手,以出其不意的方式:全面接入 DeepSeek。 这对字节产生了更实质的影响。3 月 19 日腾讯总裁刘炽平在业绩会上说,从 2 月到 3 月,元宝日活 增长了 20 倍,排名中国 AI 应用第三。他没有说的前两名分别是 DeepSeek 和字节豆包。 仅用字节十分之一的时间和小得多的投放预算,腾讯的用户规模来到了豆包的约 1/5。 在中国所有大科技公司中, 字节本是大语言模型起步最晚的一家。在 2022 年底 OpenAI ChatGPT 上 线前,百度、华为、阿里、腾讯(按发布时间顺序)都已 ...