Workflow
Veo 3.1
icon
Search documents
Unbox your imagination with Veo 3.1 🎁
Google· 2025-12-22 17:29
[“Deck the Halls“ remix plays] [neighs] [neighs]. ...
日耗50万亿Token,火山引擎的AI消费品战事
3 6 Ke· 2025-12-19 10:55
Core Insights - The AI market in China is rapidly evolving, with Huoshan Engine emerging as a leading player, particularly in the model-as-a-service (MaaS) sector, where it holds the largest market share domestically and ranks third globally [2][3] - The daily token usage of the Doubao model has surged to over 50 trillion, marking a tenfold increase compared to the previous year [1][4] - The focus for 2025 in the AI market will be on multimodal capabilities and agents, with Huoshan Engine launching several new products centered around these themes [3][6] Market Position and Growth - Huoshan Engine has established itself as a significant force in the AI sector, with projected revenues exceeding 200 billion in 2024, reflecting a growth rate of over 60% [6][23] - The company aims to simplify model usage by integrating multiple capabilities into a single API, contrasting with competitors who offer separate models for different functions [26][27] Product Innovations - The newly launched Seedance 1.5 pro video generation model emphasizes immediate usability, capable of producing synchronized audio-visual content without extensive post-production [8][15] - The model's advancements include improved lip-sync accuracy and enhanced immersion, making it particularly suitable for diverse content creation [13][21] Competitive Landscape - The AI video model market is characterized by rapid iteration, with companies focusing on producing fully publishable works rather than just raw video segments [7][9] - Huoshan Engine's approach to model training and optimization has led to a tenfold increase in inference speed, significantly reducing costs and enhancing performance [31][30] Future Directions - The company is exploring innovative billing models, such as the "AI Savings Plan," which offers tiered discounts to help businesses reduce costs by up to 47% [32][33] - Huoshan Engine is committed to building a comprehensive AI infrastructure that enables businesses to easily adopt advanced AI capabilities, aiming to make AI assistants as ubiquitous as websites and apps [38][39]
日耗50万亿Token,火山引擎的AI消费品战事
36氪· 2025-12-19 10:31
Core Viewpoint - The AI market is rapidly evolving, with major players like Volcano Engine leading the way in model consumption and innovation, particularly in the areas of multi-modal capabilities and AI agents [3][5][51]. Group 1: Market Growth and Trends - As of December, the daily token usage of Doubao model has surpassed 50 trillion, representing a growth of over 10 times compared to the same period last year [3]. - By 2025, the token usage is projected to reach 16.4 trillion, indicating significant growth potential in the AI market [4]. - The competition among cloud vendors for "AI cloud supremacy" is intensifying, with major updates from companies like Google and OpenAI [4]. Group 2: Product Innovations - Volcano Engine has released key products focusing on multi-modal capabilities and AI agents, including the Doubao flagship model 1.8 and the video generation model Seedance 1.5 pro [5][6]. - The Seedance 1.5 pro model emphasizes the ability to produce "publishable complete works," showcasing advancements in video generation technology [10][11]. - The model's improvements in voice and image synchronization have made it a standout in the market, achieving high levels of usability with minimal input [11][18]. Group 3: Business Model and Strategy - Volcano Engine aims to simplify model usage by integrating multiple capabilities into a single API, reducing complexity for clients [38][39]. - The company is focusing on enhancing the efficiency of model training and deployment, with the Seedance 1.5 pro achieving over a 10-fold increase in inference speed [46]. - A new billing model, "AI Savings Plan," has been introduced to help enterprises save up to 47% on costs, reflecting a shift towards value-based pricing [47][48]. Group 4: System Engineering and Infrastructure - The competition in AI infrastructure has shifted from merely comparing model capabilities to a broader system engineering challenge [51]. - Volcano Engine is developing a comprehensive AI infrastructure that includes both the core model (Doubao) and operational tools (AgentKit) to facilitate easier deployment for enterprises [53]. - The goal is to enable every enterprise to have its own AI assistant, akin to having a website or app, supported by a complete ecosystem [54].
2025年人工智能核心产业规模有望破万亿元!科创人工智能ETF华夏(589010) 震荡回调,逢低配置窗口开启
Mei Ri Jing Ji Xin Wen· 2025-12-15 06:29
Group 1 - The core viewpoint of the news highlights the recent performance of the Sci-Tech Innovation Artificial Intelligence ETF (589010), which experienced a 2.14% intraday pullback, reaching around 1.326 yuan, despite structural strengths in key holdings like Xinghuan Technology, which surged by 19.99% [1][2] - The liquidity in the market is robust, with a trading volume exceeding 47 million yuan, indicating active trading conditions [1] - The Chinese artificial intelligence industry is accelerating, with projections indicating that the core industry scale could exceed 1 trillion yuan by 2025, driven by significant growth in large model applications in manufacturing [1][2] Group 2 - OpenAI's upcoming Sora model, set to launch in February 2024, is anticipated to revolutionize the video sector, akin to the GPT-1 moment, with further advancements expected in Sora 2 by September 2025 [2] - Google's recent updates to the Gemini API with Veo 3.1 and Veo 3.1 Fast are expected to enhance audio support and narrative control, contributing to a more realistic user experience [2] - The Sci-Tech Innovation Artificial Intelligence ETF closely tracks the Shanghai Stock Exchange's AI index, covering high-quality enterprises across the entire industry chain, benefiting from high R&D investment and policy support [2]
刚刚,神秘模型登顶视频生成榜,又是个中国模型?
机器之心· 2025-11-28 08:05
机器之心报道 机器之心编辑部 刚刚,一个名为 Whisper Thunder (aka) David 的神秘模型登上了 Artificial Analysis 视频榜榜首,超越了 Veo 3、Veo 3.1、Kling 2.5 以及 Sora 2 Pro 等目前市面上所有公开的 AI 视频模型。 | Current models | | All models | All Open weights Global Leaderboard | | Personal Leaderboard | | More info 1-> | | --- | --- | --- | --- | --- | --- | --- | --- | | 11 | Creator TJ | | Model ↑↓ | ELO JT | 95% Cl | Appearances TJ | Release Date | | 1 | | | Whisper Thunder (aka) David | 1,247 | -9/+10 | 7,411 | l | | 2 | G Google | | Veo 3 (No Audio) | 1,226 | ...
测完Nano Banana Pro的时空重现,我人傻了……
机器之心· 2025-11-26 01:36
Core Viewpoint - The article discusses the capabilities of the Nano Banana Pro, particularly its ability to recreate historical events and scenes based on provided coordinates and optional time, showcasing its potential as a "time machine" [1][9]. Group 1: Capabilities of Nano Banana Pro - Nano Banana Pro can generate realistic images of historical events by using coordinates and time, transforming from a tool that deduces locations from images to one that creates scenes from given data [7][9]. - The AI has demonstrated impressive results, such as accurately depicting the atmosphere of the 2008 Beijing Olympics, although it made notable errors regarding the location of the opening ceremony [9][10]. - In recreating the scene of Emperor Chongzhen's suicide, the AI displayed significant inaccuracies, including anachronistic elements like the Qing dynasty's "dragon flag" [21]. Group 2: User Experience and Limitations - Users have found that while Nano Banana Pro can generate visually appealing images, it often oscillates between impressive and absurd results, indicating instability in its performance [9][19]. - The AI shows confidence in its outputs, failing to correct errors even when prompted by users, which raises questions about its reliability [17][19]. - Despite its limitations, the AI successfully generated a black-and-white image of the Normandy landing, demonstrating an understanding of historical photographic styles [24]. Group 3: Potential Applications - The article suggests various innovative uses for Nano Banana Pro, such as estimating ages, mapping anime characters to real-life personas, and creating unique video content when combined with other technologies [29][34].
非客观人工智能使用指南
3 6 Ke· 2025-11-18 23:15
Core Insights - The article discusses how to maximize the value of AI tools, emphasizing the importance of understanding user patterns and selecting the right AI model based on specific needs [1][3]. Group 1: AI Model Selection - Users have approximately nine choices for advanced AI systems, including Claude by Anthropic, Gemini by Google, ChatGPT by OpenAI, and Grok by xAI, with several free usage options available [3][4]. - For those considering paid accounts, starting with free versions of Anthropic, Google, or OpenAI is recommended before upgrading [4][6]. - The article highlights the differences in capabilities among AI models, such as web search efficiency, image creation, and handling complex tasks, which should guide user selection [4][7]. Group 2: Advanced AI Features - Advanced AI systems require monthly fees ranging from $20 to $200, depending on user needs, with the $20 tier suitable for most users [6][7]. - The article outlines the distinctions between chat models, agent models, and wizard models, recommending agent models for complex tasks due to their stability and performance [9][10]. - Users can choose specific models within systems like ChatGPT, Gemini, and Claude, with options for deeper thinking and extended capabilities [11][13][14]. Group 3: Enhancing AI Output - The article emphasizes the importance of "deep research" mode, which allows AI to conduct extensive web research before answering, significantly improving output quality [16][18]. - Connecting AI to personal data sources, such as emails and calendars, enhances its utility, particularly noted in Claude's capabilities [18]. - Multi-modal input options, including voice and image uploads, are available across various AI platforms, enhancing user interaction [19][20]. Group 4: Future Trends and User Engagement - The article predicts an increase in AI usage, with 10% of the global population currently using AI weekly, suggesting that user familiarity will evolve alongside model improvements [24]. - Users are encouraged to experiment with AI capabilities to develop an intuitive understanding of what these systems can achieve [24]. - The article warns against over-reliance on AI outputs, as even advanced models can produce errors, highlighting the need for critical engagement with AI responses [26].
微软豪赌超级智能?科创人工智能ETF华夏(589010) 早盘承压下行,短线回踩至分时均线下方
Mei Ri Jing Ji Xin Wen· 2025-11-07 02:18
Group 1 - The core viewpoint is that the AI sector is experiencing short-term adjustments, with the Sci-Tech Innovation Artificial Intelligence ETF (589010) down by 1.91% and only 2 out of 30 component stocks showing gains [1] - Microsoft has formed a super-intelligent team within its Microsoft AI division, focusing on affordable AI companions, expert-level medical diagnostics, and clean energy management [1] - The trading volume for the AI sector is approximately 23.94 million yuan, indicating moderate market turnover [1] Group 2 - OpenAI is set to launch its first Sora model in February 2024, marking a significant breakthrough in the video domain, akin to the GPT-1 moment [2] - The Sora 2 model, expected in September 2025, will enhance physical simulation, realism, and controllability, offering a unified audiovisual experience [2] - The Gemini API by Google has released a paid preview of Veo 3.1, which includes significant upgrades for richer audio support and improved narrative control [2]
互联网行业2025年11月投资策略:AI驱动海外巨头三季报亮眼,关注巨额资本开支下ROI表现
Guoxin Securities· 2025-11-04 12:10
Market Review - The Hang Seng Tech Index decreased by 8.6% in October, while the Nasdaq Internet Index remained flat with a monthly increase of 0.6% [11] - The valuation of the Hang Seng Tech Index remained stable with a PE-TTM of 22.85x, at the 29.2% percentile since its inception [16] - The Nasdaq Index also held steady with a PE-TTM of 42.30x, at the 74.09% percentile over the past decade [18] AI Developments - Google released the Veo 3.1 video generation model, enhancing features for content creators [22] - OpenAI's ChatGPT ecosystem reached 800 million weekly active users, marking a significant milestone in AI adoption [27] - Microsoft launched its first self-developed image generation model, MAI-Image-1, entering the top ten in global rankings [30] Industry Dynamics - The domestic gaming revenue in Q3 2025 declined by 4% year-on-year, while the number of approved domestic game licenses remained high [47] - Payment institutions' reserve funds grew by 5% year-on-year in September, indicating a stable financial technology sector [49] - E-commerce platforms reported significant growth during the Double Eleven sales event, with Douyin e-commerce seeing a 500% increase in live sales [53] Investment Strategy - The report suggests focusing on AI-driven companies, recommending Tencent, Alibaba, Kuaishou, Baidu, Meitu, and Tencent Music, which are expected to benefit from improved operational efficiency [3] - The report highlights that domestic companies face less capital expenditure pressure compared to overseas giants, with AI positively impacting their business [3] Key Company Earnings Forecasts - Tencent Holdings is rated "Outperform" with a projected EPS of 23.69 for 2025 and a PE of 24.78 [4] - Alibaba is also rated "Outperform" with a projected EPS of 0.00 for 2025 and a PE of 22.12 [4] - Kuaishou is rated "Outperform" with a projected EPS of 4.07 for 2025 and a PE of 16.88 [4]
Google partners with Ambani’s Reliance to offer free AI Pro access to millions of Jio users in India
Yahoo Finance· 2025-10-30 14:06
Core Insights - Google has partnered with Reliance Industries to offer its AI Pro subscription bundled with Jio 5G plans at no extra cost, aiming to expand its AI presence in emerging markets [1][2] - The partnership will provide eligible Jio users with free access to the AI Pro subscription for 18 months, reflecting a strategic move by U.S. tech firms to tap into India's vast internet market [2][3] Group 1: Partnership Details - The collaboration will initially target users aged 18 to 25, eventually expanding to all Jio subscribers, and includes access to Google's Gemini 2.5 Pro model and other AI tools [4][5] - The total value of the 18-month offer is estimated at ₹35,100 (approximately $396), while the standard monthly cost of Google's AI Pro plan in India is ₹1,950 (around $22) [5] Group 2: Broader AI Strategy - Reliance has also partnered with Google Cloud to enhance access to Tensor Processing Units (TPUs) in India, with Reliance Intelligence acting as a strategic partner for Google Cloud [6] - This partnership aims to develop pre-built AI agents for the Gemini Enterprise platform, further solidifying the collaboration between the two companies [6][7] Group 3: Market Context - India, being the world's most populous nation and the second-largest internet market, is viewed as a critical area for global tech firms to gather data and test AI applications [3] - Reliance's recent initiatives, including a joint venture with Meta, aim to strengthen AI infrastructure in India, showcasing the growing importance of AI in the region [8]