Vidu

Search documents
实测Vidu Q1参考生功能,看到诸葛亮丘吉尔拿破仑在长城拍照留念
机器之心· 2025-07-11 08:27
机器之心报道 看到这里,大概就可以看出 Vidu Q1 参考生功能的不寻常之处了。 编辑:Youli 这次真的不一样,遇到了「想象力的神」! 以前常说「要把自己活成一支队伍」,如今感谢 AI,真的实现了。 最近,生数科技旗下 AI 视频模型 Vidu Q1 推出参考生功能,极大简化传统内容生产流程,真正实现「一个人就是一个剧组」! 首先,我们来看一个视频: 这几个人物形象大家应该都很熟悉。 摇着羽扇、说着「想不到世间还有如此厚颜无耻之人」出现在各大鬼畜视频中的诸葛亮,英国铁血首相丘吉尔,以及战绩可查的拿破仑,如今他们跨越时空,围 坐在会议室中密切交谈,实现「世纪大会晤」! 如果用常规的 AI 图生视频来做的话,一般要经过写脚本、文生图 / P 图 / 融图、图片生成、图生视频、成片等步骤,但实际上,这里只用了三张图片和 Vidu Q1 的 参考生功能! 就像把大象放进冰箱只需要三步一样,这里也只需要三个步骤:找到上传照片、写提示词、成片。 更炫技的操作是,X 网友 Alex,她是一名艺术家兼程序员,在她的操作下,1989 年版本的蝙蝠侠与 1993 年版的侏罗纪公园霸王龙,不仅同框出现,还上演激烈 「对打」, ...
Methode Electronics, Ultragenyx Pharmaceutical And Other Big Stocks Moving Lower In Thursday's Pre-Market Session
Benzinga· 2025-07-10 12:00
U.S. stock futures were lower this morning, with the Dow futures falling around 100 points on Thursday.Shares of Methode Electronics, Inc. MEI fell sharply in today's pre-market trading after the company reported a fourth-quarter adjusted EPS miss.Methode Electronics posted adjusted loss of 77 cents per share, missing market estimates of earnings of 4 cents per share, according to data from Benzinga Pro. The company' sales came in at $257.10 million versus estimates of $232.87 million.Methode Electronics sh ...
多个产品跻身第一阵营,AI出海成中企出海新热点
Bei Jing Ri Bao Ke Hu Duan· 2025-07-10 08:07
互联网、新零售、游戏、教育等企业也在基于大模型升级服务、加速出海。昨天,在线教育平台作业帮 也分享了其面向海外的产品Question.AI 的成绩单:日活跃用户量2个月突破十万、6个月突破百万。 记者注意到,随着中国云基础设施和中国大模型的不断成熟,中国AI得以在海外快速落地。一方面, 一些新兴AI原生企业自创办起就开始为全球用户提供多样化的AI应用;另一方面,传统企业通过接入 通义等大模型升级自身产品,加速开拓海外市场。 以北京为中心,AI出海正成为中国企业出海新趋势。记者从7月9日举行的2025阿里云中企出海峰会北 京场获悉,生数科技Vidu、作业帮Question.AI、奇点星宇LiblibAI等头部AI应用已与阿里云合作,在海 外市场跻身第一阵营。 只需上传两张服装的正面照片、一张环境图,再给出"一个角色面对镜头自然地摆一些pose"这样的一句 话提示词,几秒钟后,真人试穿服装的视频就完成了从无到有的AI生成。这只是基于国产视频生成大 模型Vidu的诸多应用场景之一。生数科技商业化副总裁王川介绍,借助阿里云AI基础设施,旗下视频生 成应用Vidu已覆盖200多个国家与地区,B(企业)端服务客户数量及调 ...
腾讯研究院AI速递 20250710
腾讯研究院· 2025-07-09 14:49
Group 1: Veo 3 Upgrade - The Google Veo 3 upgrade allows audio and video generation from a single image, maintaining high consistency across multiple angles [1] - The new feature is implemented through the Flow platform's "Frames to Video" option, enhancing camera movement capabilities, although the Gemini Veo3 entry is currently unavailable [1] - User tests indicate natural expressions and effective performances, marking a significant breakthrough in AI storytelling applicable in advertising and animation [1] Group 2: Hugging Face 3B Model - Hugging Face has released the open-source 3B parameter model SmolLM3, outperforming Llama-3.2-3B and Qwen2.5-3B, supporting a 128K context window and six languages [2] - The model features a dual-mode system allowing users to switch between deep thinking and non-thinking modes [2] - It employs a three-stage mixed training strategy, trained on 11.2 trillion tokens, with all technical details, including architecture and data mixing methods, made available [2] Group 3: Kunlun Wanwei Skywork-R1V 3.0 - Kunlun Wanwei has open-sourced the Skywork-R1V 3.0 multimodal model, achieving a score of 142 in high school mathematics and 76 in MMMU evaluation, surpassing some closed-source models [3] - The model utilizes a reinforcement learning strategy (GRPO) and key entropy-driven mechanisms, achieving high performance with only 12,000 supervised samples and 13,000 reinforcement learning samples [3] - It excels in physical reasoning, logical reasoning, and mathematical problem-solving, setting a new performance benchmark for open-source models and demonstrating cross-disciplinary generalization capabilities [3] Group 4: Vidu Q1 Video Creation - Vidu Q1's multi-reference video feature allows users to upload up to seven reference images, enabling strong character consistency and zero storyboard video generation [4] - Users can combine multiple subjects with simple prompts, with clarity upgraded to 1080P, and support for character material storage for repeated use [5] - Test results show it is suitable for creating multi-character animation trailers, supporting frame extraction and quality enhancement, reducing video production costs to less than 0.9 yuan per video [5] Group 5: VIVO BlueLM-2.5-3B Model - VIVO has launched the BlueLM-2.5-3B edge multimodal model, which excels in over 20 evaluations and supports GUI interface understanding [6] - The model allows flexible switching between long and short thinking modes, introducing a thinking budget control mechanism to optimize reasoning depth and computational cost [6] - It employs a sophisticated structure (ViT+Adapter+LLM) and a four-stage pre-training strategy, enhancing efficiency and mitigating the text capability forgetting issue in multimodal models [6] Group 6: DeepSeek-R1 System - The X-Masters system, developed by Shanghai Jiao Tong University and DeepMind Technology, has achieved a score of 32.1 in the "Human Last Exam" (HLE), surpassing OpenAI and Google [7] - The system is built on the DeepSeek-R1 model, enabling smooth transitions between internal reasoning and external tool usage, using code as an interactive language [7] - X-Masters employs a decentralized-stacked multi-agent workflow, enhancing reasoning breadth and depth through collaboration among solvers, critics, rewriters, and selectors, with the solution fully open-sourced [7] Group 7: Zhihui Jun's Acquisition - Zhihui Jun's Zhiyuan Robot has acquired control of the listed company Shuangwei New Materials for 2.1 billion yuan, aiming for a 63.62%-66.99% stake [8] - Following the acquisition, Shuangwei New Materials' stock resumed trading with a limit-up, reaching a market value of 3.77 billion yuan, with the actual controller changing to Zhiyuan CEO Deng Taihua and core team members including "Zhihui Jun" Peng Zhihui [8] - This acquisition, conducted through "agreement transfer + active invitation," is seen as a landmark case for new productivity enterprises in A-shares following the implementation of national policies [8] Group 8: AI Model Usage Trends - In the first half of 2025, the Gemini series models captured nearly half of the large model API market, with Google leading at 43.1%, followed by DeepSeek and Anthropic at 19.6% and 18.4% respectively [9] - DeepSeek V3 has maintained a high user retention rate since its launch, ranking among the top five in usage, while OpenAI's model usage has fluctuated significantly [9] - The competitive landscape shows differentiation: Claude-Sonnet-4 leads in programming (44.5%), Gemini-2.0-Flash excels in translation, GPT-4o leads in marketing (32.5%), and role-playing remains highly fragmented [9] Group 9: AI User Trends - A report by Menlo Ventures indicates that there are 1.8 billion AI users globally, with a low paid user rate of only 3%, and a high student usage rate of 85%, while parents are becoming heavy users [10] - AI is primarily used for email writing (19%), researching topics of interest (18%), and managing to-do lists (18%), with no single task dependency exceeding one-fifth [10] - The next 18-24 months are expected to see six major trends in AI: rise of vertical tools, complete process automation, multi-person collaboration, explosion of voice AI, physical AI in households, and diversification of business models [10]
Altimmune Announces Initiation of RESTORE Phase 2 Trial Evaluating the Efficacy and Safety of Pemvidutide in Alcohol-Associated Liver Disease (ALD)
Globenewswire· 2025-07-09 11:30
"Of the 28 million Americans with AUD, over 6 million have progressed to ALD, a condition for which there are no approved treatments and few in development," said Dr. Loomba. "Alcohol-related liver mortality is highest in patients with comorbid obesity, highlighting the urgent need for a liver-directed therapy that can also drive weight loss. The robust reductions in body weight, liver fat and VCTE and the low rates of adverse event discontinuation in the recently completed IMPACT Phase 2b MASH trial suppor ...
生数科技视频模型Vidu Q1推出参考生功能,重构传统视频生产方式
Zheng Quan Shi Bao Wang· 2025-07-08 13:45
Vidu Q1参考生直接跳过中间复杂度较高的分镜制作环节,仅需上传人物、道具、场景等参考图,Vidu Q1基于参考生功能对于人物、场景、道具等元素的深层理解和各元素之间的互动关系,即可直接将多 个参考元素融合为一段视频素材,真正实现零分镜生成。 相较于文生视频的不可控和图生视频对分镜的重度依赖,参考生兼具可控性与灵活性的双重优势。不过 更为重要的创新在于,文生视频与图生视频仍是基于传统视频制作方式,而Vidu Q1参考生不只是对于 原有传统制作效率的显著提升,更是打破了固有的传统内容创作方式,打造了AI原生工作流,从参考 图元素到视频素材生成,中间仅需一步,创作门槛大幅降低。 不仅如此,Vidu Q1参考生功能的推出,也给予创作者更多灵活性。上传的人物、道具、场景等素材分 别是创作者强大的演员库、道具库和场景库,作为永不疲惫的"数字演员",组成了庞大且任意调配 的"虚拟剧组"。 创作者可以利用Vidu Q1参考生功能随时调用其中的任意素材,可以是多个人物同一场景,或者同一场 景,不同人物或道具,或者不同场景,同一人物等,将有无数种排列组合,排列组合不同,生成的视频 内容也不同。这无疑提高了素材的可复用性,只需 ...
视频模型赛道“热闹”起来,变现仍是大难题
Huan Qiu Wang· 2025-07-06 02:16
【环球网财经综合报道】近一个月来,视频模型领域似乎迎来了久违的喧嚣。生数科技将其视频模型Vidu更新至可一键生成32秒视频,并支持音视频合成与 4D生成;MiniMax推出海螺Hailuo-02,实现最高1080P、最长10秒的超清视频端到端生成;百度也发布了首个图生视频大模型MuseSteamer,瞄准广告商等 专业视频内容创作者。 尽管AI领域的Agent(智能体)正备受资本追捧,视频模型的热度相对有限。瑞银研报指出,视频模型训练所需的视频语料内容限制,使得该领域的竞争强 度预计不及大语言模型。尽管如此,以大型互联网/科技企业为主导,辅以爱诗科技、生数科技、MiniMax等明星创业公司组成的"战队",正借着基础模型效 率提升的东风,加速产品迭代与商业化探索。 回顾过去,Sora的热度已催生了一波新品,从2024年初的爱诗科技PixVerse到如今的生数科技Vidu、智谱清影、字节跳动PixelDance等,竞争日趋激烈。据 AGI-Eval评测,部分模型如PixVerse-V3等已超越Sora。但与AI应用层的创业热潮相比,视频模型创业仍显克制,主要因为技术成熟度、高昂成本及商业化路 径不清晰等因素。 M ...
视频模型赛道“热闹”起来了,但变现仍不容易
第一财经· 2025-07-05 11:44
2025.07. 05 本文字数:2033,阅读时长大约4分钟 作者 | 第一财经 吕倩 近一个月,多款视频模型新品发布,包括生数科技视频模型Vidu更新至可一键生成32秒视频,支持 音视频合成与4D生成;MiniMax发布海螺Hailuo-02,支持最高1080P、最长10秒的超清视频端到端 生成;百度(9888.HK)发布首个图生视频大模型MuseSteamer,面向包括广告商在内的专业视频内 容创作者。 但在过去几年,这一赛道并不被市场看好。 对比AI领域目前正被资本追捧的Agent(智能体),视频模型热度并不算太高。瑞银(UBS)研报认 为,视频模型领域的竞争不会像大语言模型领域的竞争那样激烈,主要是受视频模型训练所需的视频 语料内容所限。但同时,目前市面上由大厂与明星创业公司组成的战队,正在基础模型效率提高的背 景下,加快产品更新与商业化落地。 | App | Model | Monthly Standard | Monthly | Credits per | Length per | Cost (US$) | | --- | --- | --- | --- | --- | --- | --- | ...
视频模型赛道“热闹”起来了,但变现仍不容易
Di Yi Cai Jing· 2025-07-05 08:19
Core Viewpoint - The video model industry is unlikely to see a dominant player emerge, with multiple companies competing and innovating in the space [1][9]. Group 1: Industry Overview - Recent months have seen the launch of several new video models, including Vidu, Hailuo-02, and MuseSteamer, indicating a growing interest in video generation technology [1]. - Despite the recent updates, the video model sector has not attracted as much market enthusiasm as other AI fields, such as intelligent agents [1]. - UBS research suggests that competition in the video model space will not be as fierce as in large language models due to limitations in video training data [1]. Group 2: Market Dynamics - The video model market is characterized by a mix of large tech companies and emerging startups, with a focus on improving model efficiency and accelerating product commercialization [1][3]. - The complexity of video generation, including the significant data storage requirements compared to text, presents challenges for development and commercialization [4]. - Investment sentiment in the video model sector is cautious, with investors concerned about the gap between cost pressures and monetization opportunities [4]. Group 3: Business Models - Current monetization strategies for video models include API services, subscriptions, advertising, and customized solutions, with B2B models being clearer than B2C [7]. - Companies like Kuaishou and Shensu are offering tiered subscription services, while others focus on API solutions for various industries [7][8]. - Kuaishou's Keling AI achieved an annual recurring revenue (ARR) of over $100 million within ten months of launch, highlighting the potential for revenue generation in this space [7]. Group 4: Market Growth Projections - The global AI video generator market is projected to grow from $614.8 million in 2024 to $2.5629 billion by 2032, with a compound annual growth rate (CAGR) of 20.0% from 2025 to 2032 [8]. - In contrast, the estimated growth rate for large language models is approximately 35.92%, indicating differing growth trajectories between these two sectors [8].
INVESTOR ALERT: Pomerantz Law Firm Investigates Claims On Behalf of Investors of Altimmune, Inc. - ALT
GlobeNewswire News Room· 2025-07-03 14:00
Core Viewpoint - Pomerantz LLP is investigating potential securities fraud or unlawful business practices involving Altimmune, Inc. and its officers or directors following the release of trial results that led to a significant drop in the company's stock price [1][3]. Group 1: Company Overview - Altimmune, Inc. is a publicly traded company on NASDAQ under the ticker symbol ALT [1]. - The company recently announced "topline results from the IMPACT Phase 2b trial of pemvidutide in metabolic dysfunction-associated steatohepatitis (MASH)" [3]. Group 2: Trial Results - The trial results were described as "positive," but the key efficacy endpoint showed fibrosis improvement rates of 31.8% and 34.5% for pemvidutide doses of 1.2 mg and 1.8 mg, respectively, compared to 25.9% for placebo, with differences not being statistically significant [3]. - Following the announcement of these results, Altimmune's stock price fell by $4.10 per share, representing a decline of 53.18%, closing at $3.61 per share on June 26, 2025 [3]. Group 3: Legal Investigation - Pomerantz LLP is conducting an investigation on behalf of investors regarding potential securities fraud or other unlawful practices by Altimmune and its management [1]. - The firm has a long history in corporate, securities, and antitrust class litigation, having recovered numerous multimillion-dollar damages awards for victims of securities fraud [4].