Workflow
Sora 2 Pro
icon
Search documents
AI初创公司Runway推出影片生成模型Gen 4.5;字节Seed发布GR-RL,首次实现真机强化学习穿鞋带丨AIGC日报
创业邦· 2025-12-03 00:08
Group 1 - Keling AI officially launched its new product "Keling O1," which integrates multi-modal inputs such as text, video, images, and subjects into a comprehensive engine, addressing consistency issues in AI video generation for applications in film, self-media, and e-commerce [2] - OpenAI is reportedly considering embedding advertisements in ChatGPT, with recent Android test versions containing code labeled as "featured ads," indicating a shift towards personalized advertising based on user interactions [2] - ByteDance's Seed team released GR-RL, achieving a significant improvement in the success rate of a shoe-lacing task from 45.7% to 83.3%, marking a notable advancement in reinforcement learning for fine manipulation tasks [2] Group 2 - AI startup Runway introduced its latest film generation model Gen 4.5, which outperformed Google and OpenAI in third-party evaluations, showcasing its ability to generate high-quality videos based on textual instructions [3]
刚刚,霸榜神秘视频模型身份揭晓,原来它就是「David」
机器之心· 2025-12-02 00:17
Core Insights - Runway's Gen-4.5 has emerged as the leading state-of-the-art (SOTA) video generation model, setting new industry standards in motion quality, prompt adherence, and visual realism [1][3][8] Model Performance - Gen-4.5 has achieved an ELO Score of 1247, surpassing competitors like Veo 3/3.1, Kling 2.5, and Sora 2 Pro, showcasing unprecedented visual realism and creative control capabilities [3][6][8] - The model maintains speed and efficiency while delivering significant quality improvements, making advanced video generation accessible to creators of various scales [8][20] Key Features - **Precise Prompt Adherence**: Gen-4.5 demonstrates exceptional physical accuracy and visual detail, accurately portraying object motion, fluid dynamics, and intricate surface details [11][12] - **Expressive Characters**: The model can depict nuanced emotions and lifelike facial details, enhancing character representation [14] - **Stylized Control and Visual Consistency**: It supports a wide range of aesthetic styles, from photorealism to stylized animation, while maintaining a coherent visual language [16][18] Deployment and Limitations - Gen-4.5 is built on NVIDIA architecture, optimizing training efficiency and inference speed through collaboration with NVIDIA [20] - Despite its advancements, Gen-4.5 exhibits common limitations found in video generation models, such as causal reasoning issues and object permanence challenges [21][22]
Runway rolls out new AI video model that beats Google, OpenAI in key benchmark
CNBC· 2025-12-01 14:05
Artificial intelligence startup Runway on Monday announced Gen 4.5, a new video model that outperforms similar models from Google and OpenAI in an independent benchmark.Gen 4.5 allows users to generate high-definition videos based on written prompts that describe the motion and action they want. Runway said the model is good at understanding physics, human motion, camera movements and cause and effect.The model holds the No. 1 spot on the Video Arena leaderboard, which is maintained by the independent AI be ...
刚刚,神秘模型登顶视频生成榜,又是个中国模型?
机器之心· 2025-11-28 08:05
机器之心报道 机器之心编辑部 刚刚,一个名为 Whisper Thunder (aka) David 的神秘模型登上了 Artificial Analysis 视频榜榜首,超越了 Veo 3、Veo 3.1、Kling 2.5 以及 Sora 2 Pro 等目前市面上所有公开的 AI 视频模型。 | Current models | | All models | All Open weights Global Leaderboard | | Personal Leaderboard | | More info 1-> | | --- | --- | --- | --- | --- | --- | --- | --- | | 11 | Creator TJ | | Model ↑↓ | ELO JT | 95% Cl | Appearances TJ | Release Date | | 1 | | | Whisper Thunder (aka) David | 1,247 | -9/+10 | 7,411 | l | | 2 | G Google | | Veo 3 (No Audio) | 1,226 | ...
“杀死每家AI初创、造超级OS”?奥特曼的野望惊现缺口:资深人士曝出三大瓶颈
AI前线· 2025-10-07 04:56
Core Insights - OpenAI's CEO Sam Altman announced significant updates at the OpenAI DevDay 2025, focusing on AgentKit, Codex, Apps SDK preview, and new APIs [2][3] Group 1: AgentKit - AgentKit is a comprehensive toolkit for developers and enterprises to build, deploy, and optimize intelligent workflows, significantly reducing the time required for integration and development [2][5] - The tool allows developers to create multi-agent workflows visually, manage data connections through a central platform, and embed customizable chat-based interactions [5][8] - Companies like Ramp and LY Corporation have reported drastic reductions in development time, with Ramp completing a procurement agent in hours instead of months, and LY Corporation creating a workflow in under two hours [7][8] Group 2: Codex - Codex has seen a tenfold increase in daily usage since early August, becoming integral to OpenAI's development processes, with a 70% increase in weekly pull requests [11][12] - The tool is now fully available for all coding scenarios, enhancing productivity for developers and allowing even children to utilize its capabilities [11][12] Group 3: Apps SDK - The Apps SDK is now in preview, enabling developers to build and test applications that integrate directly with ChatGPT, with support for various applications like Booking.com and Canva [12][13] - This move positions ChatGPT as a potential new operating system, aiming to be the default interface for users interacting with various applications [13] Group 4: New APIs - Three new APIs were launched, including the powerful reasoning model GPT-5 Pro, which allocates more "thinking time" for complex tasks [14][15] - The Video API introduces Sora 2 and Sora 2 Pro, allowing developers to create and edit short videos, enhancing multimedia capabilities [18][19] - New image and voice models have been introduced, offering cost-effective solutions while maintaining high quality [20]
X @Sam Altman
Sam Altman· 2025-09-30 17:09
I did not expect such fun dynamics to emerge from being able to "put yourself and your friends in videos" but I encourage you to check it out!ChatGPT Pro subscribers can generate with Sora 2 Pro.Longer post coming in a minute. ...