Sora 2 Pro
Search documents
Alibaba-backed PixVerse launches real-time AI video tool, as Chinese rivals race past OpenAI on speed and cost
CNBC· 2026-01-13 14:00
Core Insights - PixVerse, an Alibaba-backed startup, has launched an AI tool for real-time, interactive video creation, allowing users to direct video content as it is generated [1][2] - The company aims to innovate business models by enabling users to influence narratives in micro-dramas or video games without predefined storylines [2] - PixVerse has raised over $60 million in funding, with a significant portion coming from international investors, and is nearing another funding round [3] Company Overview - Founded in 2023, PixVerse has quickly gained traction, surpassing 16 million monthly active users as of October [9] - The company aims to double its workforce to nearly 200 employees by the end of the year and targets 200 million registered users in the first half of the year [10] - PixVerse reported an estimated annual recurring revenue of $40 million in October [12] Industry Context - The AI video generation market is predominantly led by Chinese companies, which offer faster generation speeds and lower costs compared to competitors like OpenAI's Sora 2 Pro [5][7] - Chinese firms are focusing on scalable, low-cost production tools, contrasting with the more simplistic offerings from U.S. products [11] - The competitive landscape includes other players like Kuaishou's Kling, which generated nearly $100 million in revenue in the first three quarters of 2025 [13] Technological Development - PixVerse's tool aims to eliminate waiting times in video creation, reshaping user interaction with AI-generated content [9] - The company prioritizes technology development over immediate commercialization, claiming sufficient funding for a decade of operations [13] - Concerns about the quality of AI-generated content are acknowledged, with comparisons made to the early years of computer graphics, suggesting that quality will improve over time [14]
AI初创公司Runway推出影片生成模型Gen 4.5;字节Seed发布GR-RL,首次实现真机强化学习穿鞋带丨AIGC日报
创业邦· 2025-12-03 00:08
Group 1 - Keling AI officially launched its new product "Keling O1," which integrates multi-modal inputs such as text, video, images, and subjects into a comprehensive engine, addressing consistency issues in AI video generation for applications in film, self-media, and e-commerce [2] - OpenAI is reportedly considering embedding advertisements in ChatGPT, with recent Android test versions containing code labeled as "featured ads," indicating a shift towards personalized advertising based on user interactions [2] - ByteDance's Seed team released GR-RL, achieving a significant improvement in the success rate of a shoe-lacing task from 45.7% to 83.3%, marking a notable advancement in reinforcement learning for fine manipulation tasks [2] Group 2 - AI startup Runway introduced its latest film generation model Gen 4.5, which outperformed Google and OpenAI in third-party evaluations, showcasing its ability to generate high-quality videos based on textual instructions [3]
刚刚,霸榜神秘视频模型身份揭晓,原来它就是「David」
机器之心· 2025-12-02 00:17
Core Insights - Runway's Gen-4.5 has emerged as the leading state-of-the-art (SOTA) video generation model, setting new industry standards in motion quality, prompt adherence, and visual realism [1][3][8] Model Performance - Gen-4.5 has achieved an ELO Score of 1247, surpassing competitors like Veo 3/3.1, Kling 2.5, and Sora 2 Pro, showcasing unprecedented visual realism and creative control capabilities [3][6][8] - The model maintains speed and efficiency while delivering significant quality improvements, making advanced video generation accessible to creators of various scales [8][20] Key Features - **Precise Prompt Adherence**: Gen-4.5 demonstrates exceptional physical accuracy and visual detail, accurately portraying object motion, fluid dynamics, and intricate surface details [11][12] - **Expressive Characters**: The model can depict nuanced emotions and lifelike facial details, enhancing character representation [14] - **Stylized Control and Visual Consistency**: It supports a wide range of aesthetic styles, from photorealism to stylized animation, while maintaining a coherent visual language [16][18] Deployment and Limitations - Gen-4.5 is built on NVIDIA architecture, optimizing training efficiency and inference speed through collaboration with NVIDIA [20] - Despite its advancements, Gen-4.5 exhibits common limitations found in video generation models, such as causal reasoning issues and object permanence challenges [21][22]
Runway rolls out new AI video model that beats Google, OpenAI in key benchmark
CNBC· 2025-12-01 14:05
Core Insights - Runway has launched Gen 4.5, a new video model that surpasses similar offerings from Google and OpenAI in independent benchmarks [1][2] - The model excels in generating high-definition videos from written prompts, demonstrating strong understanding of physics, human motion, camera movements, and cause and effect [1] - Runway's Gen 4.5 currently ranks first on the Video Arena leaderboard, outperforming Google's Veo 3 in second place and OpenAI's Sora 2 Pro in seventh place [2] Company Performance - Runway's CEO Cristóbal Valenzuela highlighted the achievement of competing against trillion-dollar companies with a relatively small team of 100 people, emphasizing focus and diligence as key factors for success [3]
刚刚,神秘模型登顶视频生成榜,又是个中国模型?
机器之心· 2025-11-28 08:05
Core Viewpoint - The article discusses the emergence of a new AI video model named Whisper Thunder (aka David), which has surpassed existing models in the Artificial Analysis video leaderboard, indicating a significant advancement in AI video generation technology [1]. Group 1: Model Performance - Whisper Thunder ranks first on the Artificial Analysis leaderboard with an ELO score of 1,247, outperforming Veo 3 (1,226) and Kling 2.5 Turbo (1,225) [2]. - The model's performance is characterized by a fixed duration of 8 seconds for generated videos, with noticeable motion dynamics [3]. - Users have reported a decrease in the model's appearance frequency, suggesting that it may require multiple refreshes to encounter [3]. Group 2: Model Origin and Characteristics - There is speculation among users that Whisper Thunder may originate from China, based on its generation effects and aesthetic tendencies [4]. - The model has demonstrated impressive capabilities, although some users noted minor generation flaws, particularly during high-motion scenes [11][13]. Group 3: Example Prompts - Several prompts illustrate the model's versatility, including scenes of construction, emotional anime performances, and serene landscapes, showcasing its ability to create diverse and engaging visual narratives [5][6][7][8][9][10][12].
“杀死每家AI初创、造超级OS”?奥特曼的野望惊现缺口:资深人士曝出三大瓶颈
AI前线· 2025-10-07 04:56
Core Insights - OpenAI's CEO Sam Altman announced significant updates at the OpenAI DevDay 2025, focusing on AgentKit, Codex, Apps SDK preview, and new APIs [2][3] Group 1: AgentKit - AgentKit is a comprehensive toolkit for developers and enterprises to build, deploy, and optimize intelligent workflows, significantly reducing the time required for integration and development [2][5] - The tool allows developers to create multi-agent workflows visually, manage data connections through a central platform, and embed customizable chat-based interactions [5][8] - Companies like Ramp and LY Corporation have reported drastic reductions in development time, with Ramp completing a procurement agent in hours instead of months, and LY Corporation creating a workflow in under two hours [7][8] Group 2: Codex - Codex has seen a tenfold increase in daily usage since early August, becoming integral to OpenAI's development processes, with a 70% increase in weekly pull requests [11][12] - The tool is now fully available for all coding scenarios, enhancing productivity for developers and allowing even children to utilize its capabilities [11][12] Group 3: Apps SDK - The Apps SDK is now in preview, enabling developers to build and test applications that integrate directly with ChatGPT, with support for various applications like Booking.com and Canva [12][13] - This move positions ChatGPT as a potential new operating system, aiming to be the default interface for users interacting with various applications [13] Group 4: New APIs - Three new APIs were launched, including the powerful reasoning model GPT-5 Pro, which allocates more "thinking time" for complex tasks [14][15] - The Video API introduces Sora 2 and Sora 2 Pro, allowing developers to create and edit short videos, enhancing multimedia capabilities [18][19] - New image and voice models have been introduced, offering cost-effective solutions while maintaining high quality [20]
X @Sam Altman
Sam Altman· 2025-09-30 17:09
I did not expect such fun dynamics to emerge from being able to "put yourself and your friends in videos" but I encourage you to check it out!ChatGPT Pro subscribers can generate with Sora 2 Pro.Longer post coming in a minute. ...