AI云原生
Search documents
字节跳动发布新一代AI大模型 降价逾六成
智通财经网· 2025-06-11 08:40
Core Insights - ByteDance's cloud service platform Volcano Engine launched the Doubao Model 1.6 with a unified pricing model, significantly reducing costs by 63% compared to previous models [1][2] - The Doubao Model family now includes various models such as Doubao Video Generation Model Seedance 1.0 Pro and Doubao Real-time Voice Model, showcasing a comprehensive and cost-effective solution [1] - The daily token usage of Doubao Model has surged to over 16.4 trillion, marking a 137-fold increase since its launch last year [1] Pricing Strategy - Doubao Model 1.6 introduces a unified pricing model based on input length, with costs set at 0.8 yuan per million tokens for input and 8 yuan per million tokens for output within the 0-32K range [2] - The Seedance 1.0 Pro model offers the lowest industry cost at 0.015 yuan per thousand tokens, equating to just 3.67 yuan for generating a 5-second 1080P video [2] Technological Advancements - The conference highlighted the emergence of autonomous agents capable of reasoning, planning, and task completion, necessitating strong reasoning capabilities, multi-modality, and low costs [2] - ByteDance's executives announced 12 new tools aimed at agent development and application, emphasizing the importance of an "AI Cloud Native" technology stack for enterprise innovation [2]
让AI听懂行业,火山引擎如何拆掉大模型落地的「墙」?
36氪· 2025-06-10 13:34
Core Viewpoint - The article emphasizes that the industrialization of large models is becoming a reality, significantly impacting various sectors and driving the digital transformation of industries [3][4][6]. Group 1: Industrialization of Large Models - The large model trend is accelerating, with significant integration into industries such as finance, automotive, technology, and education [3][5][12]. - By 2024, the usage of large models in China's public cloud reached 114.2 trillion tokens, indicating a shift from early exploration to large-scale implementation [5]. - Major cloud service providers collectively acted in early 2024 to lower the barriers for enterprises to deploy large models, enhancing accessibility [5][10]. Group 2: Trends in Large Model Implementation - Three key trends in the implementation of large models have emerged: 1. Deepening scenarios where value is released from office efficiency to core industry processes [6]. 2. Companies transitioning from passive innovation to actively seeking deployment points based on clear business pain points [7]. 3. Strengthening ecosystem collaboration, with cloud providers becoming crucial enablers for the deployment of large models [9][10]. Group 3: Sector-Specific Applications - In finance, large models are enabling ordinary investors to make more informed investment decisions through tools like the GuoXin Stock Assistant, which utilizes large model capabilities for market analysis [13][15]. - The automotive industry is diversifying its applications of large models, with companies like SAIC Volkswagen and BMW implementing AI-driven solutions for enhanced user interaction and marketing [16][19][20]. - In education, institutions like Nankai University and Zhejiang University are leveraging large models to improve teaching efficiency and research capabilities [21][22][24]. Group 4: Challenges and Future Outlook - The large model landscape faces challenges such as balancing model capability with security and efficiency, high operational costs, and integration difficulties into existing business systems [33][34][35]. - The article predicts that the B-end AI Agent market in China could grow to 171.8 billion yuan by 2025, indicating a long-term trend towards the integration of AI in business operations [41]. - The future of large models is expected to evolve into a fundamental infrastructure for enterprises, with cloud providers playing a key role in facilitating this transition [42].
豆包概念震荡拉升 润欣科技涨超15%
news flash· 2025-06-09 02:42
Core Viewpoint - The article highlights the significant stock price movements in the "Doubao" concept sector, with notable gains in several companies, driven by the upcoming 2025 Volcano Engine Original Power Conference focusing on advanced topics in AI and cloud-native technologies [1] Group 1: Stock Performance - Runxin Technology saw a stock increase of over 15% [1] - Other companies such as Guangyun Technology, Haitai Ruisheng, Yili Media, and Hanyi Co. also experienced stock price increases exceeding 5% [1] Group 2: Event Details - The 2025 Volcano Engine Original Power Conference is scheduled to take place from June 11 to June 12 [1] - The conference will focus on cutting-edge topics including agent development, multimodal understanding, deep thinking, and AI cloud-native technologies [1]
传统云还在「卖铁」,下一代云已在「炼钢」:火山引擎xLLM如何一张卡榨出两张的性能!
机器之心· 2025-05-27 04:11
机器之心报道 编辑:Panda 大模型越来越聪明,企业却似乎越来越焦虑了。 模型性能突飞猛进,从写文案到搭智能体(Agent),AI 掌握的技能也越来越多。但一到真正上线部署,问题就来了:为什么推理成本越来越 高?算力投入越来越多?效果却不成正比? 现如今,推理大模型已经具备服务复杂业务场景的实力。但是,要想让它们在工作时有足够快的速度,企业往往不得不大力堆卡(GPU),从 而满足 T PO T (平均输出一个 Token 的时间)和 TPS (每秒 Token 数)等指标。也就是说,在迈过了模型性能的门槛之后,企业却发现大模 型落地还有另一个高耸的门槛: 推理效率 。 为了响应这一需求,云厂商不约而同地把目光投向了「卖铁」,也就是上更多、更新但也更贵的卡。但它们的客户面临的问题真的是「卡不够 多不够强」吗? 火山引擎给出的答案是:不是卡不够多,也不是卡不够强,而是没「炼」好。 这家已经高举「 AI 云原生 」旗帜的云服务平台已经在「炼钢」这个方向上走出了自己的道路,其推出的 xLLM 大语言模型推理框架具有堪称 极致的性能,能低时延、高吞吐地支持大规模部署: 用同样的 GPU 卡,计算成本仅为开源框架的二分 ...
火山引擎的野心,不止是一个“更聪明的模型”
Sou Hu Cai Jing· 2025-04-24 11:19
但问题也随之而来: 推理能力和多模态能力,真的从实验室走向了可落地的规模化吗? 2025年春天,AI行业的一系列动作释放出一种不同以往的信号。GPT-4o以更强的多模态处理能力强化人机交互;DeepSeek R2持续推进开源攻势,刷新国 产模型的技术期待;而字节跳动旗下的火山引擎,在杭州举行了一场没有太多华丽词藻但含金量颇高的发布会,核心关键词只有三个:深度思考、多模态 推理、全栈Agent。 AI模型从"语言输出者"走向"任务执行者",从生成文字、图像,到开始操作浏览器、编辑视频、理解图表乃至"看图做决策"。这并非简单的模型功能更 新,而是AI能力边界的一次实质性拓展。在这场变化中,字节推出的豆包1.5thinking模型、Seedream3.0文生图引擎、OS Agent平台化方案,构成了一个系 统性的技术组合,也预示着其未来在AI生态中的角色将不再只是"提供一个大模型"。 Agent的门槛是否已经抬升?开发者与企业会为这种能力买单吗? 在国产模型陷入"开源焦虑"时,字节为何依旧坚持平台化和自研路线? 火山引擎强调的"AI云原生"到底是Buzzword,还是产业基础设施的重构? 这些问题不仅关乎一场发布会 ...