火山引擎
Search documents
传统云还在「卖铁」,下一代云已在「炼钢」:火山引擎xLLM如何一张卡榨出两张的性能!
机器之心· 2025-05-27 04:11
机器之心报道 编辑:Panda 大模型越来越聪明,企业却似乎越来越焦虑了。 模型性能突飞猛进,从写文案到搭智能体(Agent),AI 掌握的技能也越来越多。但一到真正上线部署,问题就来了:为什么推理成本越来越 高?算力投入越来越多?效果却不成正比? 现如今,推理大模型已经具备服务复杂业务场景的实力。但是,要想让它们在工作时有足够快的速度,企业往往不得不大力堆卡(GPU),从 而满足 T PO T (平均输出一个 Token 的时间)和 TPS (每秒 Token 数)等指标。也就是说,在迈过了模型性能的门槛之后,企业却发现大模 型落地还有另一个高耸的门槛: 推理效率 。 为了响应这一需求,云厂商不约而同地把目光投向了「卖铁」,也就是上更多、更新但也更贵的卡。但它们的客户面临的问题真的是「卡不够 多不够强」吗? 火山引擎给出的答案是:不是卡不够多,也不是卡不够强,而是没「炼」好。 这家已经高举「 AI 云原生 」旗帜的云服务平台已经在「炼钢」这个方向上走出了自己的道路,其推出的 xLLM 大语言模型推理框架具有堪称 极致的性能,能低时延、高吞吐地支持大规模部署: 用同样的 GPU 卡,计算成本仅为开源框架的二分 ...
【大涨解读】玩具:又一玩具巨头拟赴港上市,字节、OpenAI也在加速部署,三类企业有望受益
Xuan Gu Bao· 2025-05-26 06:37
Group 1: Market Performance - On May 26, the AI toy concept gained momentum, with leading companies such as Shifeng Culture hitting the daily limit, and AoFei Entertainment and BoTong Integration also reaching their daily limits. Runxin Technology surged over 17%, while Huali Technology rose more than 7% [1] - Shifeng Culture's stock price increased by 9.99% to 20.37, with a trading volume of 15.94% and a market capitalization of 25.77 billion [2] - BoTong Integration's stock price rose by 8.97% to 34.01, with a market capitalization of 51.2 billion [2] - AoFei Entertainment's stock price increased by 8.07% to 10.18, with a market capitalization of 103.67 billion [2] - Huali Technology's stock price rose by 7.37% to 29.86, with a market capitalization of 41.3 billion [2] Group 2: Industry Developments - On May 23, the IP toy brand "52TOYS" submitted its prospectus to the Hong Kong Stock Exchange [3] - In the Chinese IP toy market, 52TOYS ranked third in GMV last year, approximately 1/9 of the GMV of Pop Mart [4] - OpenAI's CEO demonstrated an AI "companion" device, with plans to ship 100 million units by the end of 2026 [4] - An AI toy innovation seminar was held in Shantou, focusing on product and technology innovation, industry ecosystem building, and commercialization paths [5] Group 3: Market Potential and Trends - The Chinese toy market has reached a scale of hundreds of billions and is expected to maintain steady growth over the next five years, with IP toys and character-based toys likely to outperform other categories [6] - AI toys are projected to capture a larger market share, with estimates suggesting that by 2028, the domestic AI toy market could reach 30-40 billion [6] - Three types of companies are expected to dominate the AI toy sector: technology-driven companies, IP resource integrators, and traditional toy manufacturers [6][7] - Successful local licensing and brand partnerships are emerging as effective strategies for overseas expansion of IP toys [7]
AI攻击变异率每24小时达93% 全球AI安全损失逼近235亿美元:攻防博弈如何破局?
Mei Ri Jing Ji Xin Wen· 2025-05-25 08:25
Core Insights - The C3 Security Conference highlighted the escalating security risks associated with AI, with global losses from large model security incidents projected to rise from $8.5 billion in 2023 to over $23.5 billion by 2025, indicating a significant increase in AI-related security threats [4][6][10] - AI is exacerbating the asymmetry in cybersecurity, with attackers leveraging open-source tools and AI to enhance their attack methods, while defenders struggle with outdated techniques and resource constraints [5][9] Group 1: AI Security Challenges - The rapid mutation rate of AI-assisted attacks is alarming, with a reported 93% mutation rate every 24 hours in 2024 [4] - The attack efficiency of AI surpasses defensive responses, creating a technological imbalance where attackers can exploit publicly available data for training, while defenders face compliance issues [5][10] - The average time for attackers to breach systems is now 48 minutes, with the fastest recorded at 51 seconds, highlighting the urgency of the threat landscape [9] Group 2: Talent Shortage and Industry Response - There is a significant talent gap in the AI security field, with a shortage of over 300,000 professionals globally, particularly those with expertise in both AI and cybersecurity [10] - Companies are adopting a T-shaped talent development model to address the shortage, emphasizing the need for professionals who possess deep security knowledge and broad AI understanding [10] - The industry consensus is shifting towards a systematic and integrated defense approach, moving away from isolated defenses to a collaborative security matrix [11][12] Group 3: Future of Cybersecurity - The next decade is seen as a golden period for cybersecurity, with a predicted evolution towards cognitive security paradigms and self-supervised security systems by 2030 [13] - The integration of AI into security frameworks is expected to blur the lines between security and intelligence, leading to a new paradigm of "security intelligence" [13]
从“在中国制造” 到“为中国设计” 再到“由中国定义” 合资车企转型开启“加速度”(经济聚焦)
Ren Min Ri Bao· 2025-05-22 21:47
Core Insights - The automotive industry is undergoing a transformation towards electrification and intelligence, prompting joint ventures to clarify their direction and accelerate their transition [1][2] - The shift from "manufacturing in China" to "designing for China" and "defining by China" marks the emergence of the "Automotive Joint Venture 2.0" era, emphasizing deep collaboration and ecosystem integration [1] Market Dynamics - The market environment for joint venture car manufacturers has changed significantly, with their market share in China's passenger car market dropping from 61.6% in 2014 to an estimated 31.5% in 2024 [2] - The number of joint venture brand 4S networks is projected to decline, with a total of 7,744 joint venture brand outlets in 2024, a year-on-year decrease of 13.5% [2] Pricing Strategies - Joint venture car manufacturers are breaking away from traditional pricing models, with companies like SAIC Volkswagen adopting a "one-price" marketing strategy to enhance price transparency and convenience for consumers [3] - The "one-price" model has shown positive market performance, indicating a recovery in sales [3] R&D Innovations - Joint ventures are restructuring their R&D models, moving from unilateral input to collaborative output, with increased investment in local R&D centers [4][5] - Toyota has established a dedicated electric vehicle and battery R&D center in Shanghai, emphasizing local market needs and integrating Chinese engineers into the development process [4] - Nissan plans to invest 10 billion yuan in electric vehicle R&D over the next two years, aiming to accelerate technology iteration and product launch [4] Local Ecosystem Integration - The development of a robust smart electric vehicle supply chain in China is facilitating the transition of joint ventures towards electrification and intelligence [6] - Joint ventures are increasingly collaborating with local suppliers to enhance product offerings and meet consumer demands, particularly in smart technology and user experience [6][7] Strategic Partnerships - Many joint ventures are expanding their partnerships with local suppliers to leverage their technological strengths, which helps in quickly adapting to market changes and improving product competitiveness [7] - Executives from major automotive companies express a commitment to showcasing the competitive advantages of China's electric vehicle supply chain on a global scale [7]
火山引擎发布豆包·语音播客模型,秒级生成“真人对话”播客
Cai Fu Zai Xian· 2025-05-21 05:08
Core Insights - The launch of Doubao Voice Podcast Model represents a significant upgrade in voice language technology, enabling quick transformation from text to podcast format, enhancing user experience with low cost, high efficiency, and strong interactivity [1] Group 1: Product Features - The model allows for natural and fluent dialogue, overcoming previous AI-generated speech limitations, achieving a professional podcast recording level [1] - It streamlines the podcast creation process, allowing users to complete the entire workflow efficiently without extensive time and effort [1] - The model includes a deep search function to generate timely podcast audio based on current hot topics, with a quick turnaround of 5 seconds for new content [1] Group 2: User Capabilities - Users can input a theme to generate in-depth podcast viewpoints, providing rich ideas and content for creators [2] - The model supports converting long texts into podcast format, allowing users to create high-quality podcasts from documents or URLs [2] Group 3: Future Developments - The Doubao Voice Podcast Model will be available on Doubao APP, PC, and other products, with more podcast creation features to be revealed at the upcoming 2025 Huoshan Engine Force Conference on June 11 [3]
知乎AI大会,火山引擎创业大赛...5月不可错过的AI活动都在这里了
Founder Park· 2025-05-20 11:42
Group 1 - Major tech companies are hosting developer conferences in May, including Microsoft, Google, and Anthropic [1] - Apple's WWDC event is scheduled for June, along with several significant domestic events [2] - The Zhihu New Knowledge Youth Conference will feature a sub-forum titled "AI Variable Research Institute," focusing on large models, embodied intelligence, and chips [3] Group 2 - The "AI Variable Research Institute" forum will take place on May 24 in Beijing, providing a platform for deep technical exchange and industry connection [3] - The WaytoAGI Global AI Conference in Tokyo on June 7-8 aims to promote international AI technology exchange and cooperation [5] - An AI programming creative challenge organized by Zeabur and Tencent Cloud is open for participants interested in AI programming [6][7] Group 3 - The 2025 Volcano Engine FORCE Original Power Conference will be held in Beijing on June 11-12, focusing on AI entrepreneurship and featuring a Demo Day [7] - The event encourages innovative companies to participate and showcase their projects [7] - Participants in the AI programming challenge will receive one month of Tencent Cloud server resources and have the chance for official exposure and special prizes [9]
AI智能体应用加速落地
Jing Ji Ri Bao· 2025-05-14 21:59
Core Viewpoint - The development of AI agents is rapidly advancing, driven by technological breakthroughs and market demand, with significant investments and policy support from both central and local governments [1][4][10] Group 1: AI Agent Definition and Applications - AI agents are advanced AI systems capable of autonomous perception, reasoning, and action in specific environments, applicable in various scenarios such as content creation, knowledge assistance, and intelligent search [1][2] - Current applications of AI agents in enterprise settings include enhancing efficiency and data-driven decision-making, particularly in logistics and human resources management [2][3] - In consumer applications, AI agents focus on providing personalized experiences and services, such as smart marketing engines and home automation systems [3][5] Group 2: Market Growth and Policy Support - The global AI agent market is projected to grow at a compound annual growth rate (CAGR) exceeding 40% over the next five years [4] - Policies from cities like Beijing and Shanghai are fostering the development of general AI agents, providing support for innovation and operational cost coverage [4][5] Group 3: Commercialization and Industry Integration - The shift from technical concepts to commercial applications is being accelerated by startups and major tech companies developing AI agent products and platforms [4][7] - AI agents are becoming integral to enterprise operations, addressing issues like fragmentation and low ROI in traditional AI applications [7] Group 4: Challenges and Considerations - The rise of "pseudo AI agents" poses a risk, as some companies misrepresent traditional technologies as AI agents [8] - Technical challenges such as cognitive reliability and decision-making transparency need to be addressed to ensure trustworthy AI agent applications [8][9] - Cost management is critical, as the use of AI agents for complex tasks can lead to significant token consumption and increased computational resource demands [9] - The establishment of standardized protocols and high-quality data sets is essential for the scalable deployment of AI agents across various scenarios [9][10]
火山引擎在沪发布系列新模型 豆包大模型产业落地加速
Xin Hua Cai Jing· 2025-05-14 08:31
Core Insights - Volcano Engine held an AI innovation exhibition in Shanghai, launching several models including Seedance 1.0 lite for video generation and the upgraded Doubao 1.5 visual deep thinking model, aiming to enhance the application chain from business to intelligent agents [1][2] - The Seedance 1.0 lite model supports text-to-video and image-to-video generation, achieving significant improvements in video quality and generation speed, making it suitable for various applications such as e-commerce advertising and entertainment [1] - The Doubao 1.5 model demonstrates strong multi-modal understanding and reasoning capabilities, ranking in the top tier across 38 out of 60 public evaluation benchmarks [1][2] Model Upgrades and Applications - The Doubao music model was upgraded to support English song creation and can automatically adapt background music based on video understanding, now fully launched [2] - Data Agent, a new enterprise data intelligent agent, can analyze and generate professional research reports by integrating structured and unstructured data [2] - Doubao models have been widely adopted across industries including automotive, finance, education, and retail, covering nearly 400 million devices and major companies [2] Industry Collaborations - Giant Network announced a collaboration with Volcano Engine to enhance AI gameplay in their social deduction game "Space Kill" using Doubao models [2][3] - Eli Lilly has developed a dedicated AI application platform in partnership with Volcano Engine, facilitating innovations in drug development and disease diagnosis [3] - Volcano Engine emphasizes the importance of a three-stage journey for AI implementation, focusing on investment returns, model infrastructure, and the lifecycle of intelligent agents [3][4] Model Service Matrix - Volcano Engine has established a comprehensive model service matrix covering various fields such as language, deep thinking, vision, and speech, continuously optimizing model capabilities to meet specific business needs [4]
早报|苹果今年或实现脑机接口操控 iPhone/京东美团饿了么被约谈/小米车主喊话雷军:保持真诚
Sou Hu Cai Jing· 2025-05-14 01:55
Group 1 - Samsung officially launched the Galaxy S25 Edge, featuring a thickness of only 5.8mm and a weight of 163g, made with titanium metal for durability [4] - The Galaxy S25 Edge includes a 200MP main camera and a 12MP ultra-wide camera, optimized with a new visual engine and AI editing features [4] - The pricing for the Galaxy S25 Edge starts at 7999 yuan for the 12GB+256GB version and 8999 yuan for the 12GB+512GB version [6] Group 2 - OpenAI introduced a new AI health benchmark called "HealthBench," developed in collaboration with 262 doctors from 60 countries, which includes 5000 real medical dialogues [13] - The best-performing model in the HealthBench tests was OpenAI's o3 model, which improved by 28% in recent months [13][14] - OpenAI predicts that 2025 will be the year of AI agents, particularly in programming, where they will significantly enhance efficiency and create substantial business value [52][53] Group 3 - Xiaomi's SU7 Ultra model faced backlash from customers over misleading advertising regarding its carbon fiber hood, leading to over 300 customers seeking refunds [26][27] - The Chinese market regulator has held discussions with major food delivery platforms like JD, Meituan, and Ele.me to address competitive issues and ensure compliance with relevant laws [28] Group 4 - Nezha Auto's parent company, Hezhong New Energy Vehicle Co., has been filed for bankruptcy, amid ongoing financial difficulties and reports of stock freezes [31][32] - Perplexity, an AI startup, is in talks for a new funding round that could value the company at $14 billion, although this is lower than its initial target of $18 billion [36][37][38] Group 5 - iQIYI responded to a report of violating personal information collection regulations, stating it is actively rectifying the issues identified [39][41] - Huawei announced a product launch event scheduled for May 19, where it will unveil the HarmonyOS computer and nova 14 series smartphones [57]
AI早报 | 软银对OpenAI的投资或降至200亿美元;月之暗面回应涉足AI医疗
Sou Hu Cai Jing· 2025-05-14 00:21
Group 1 - SoftBank's investment in OpenAI may be reduced to $20 billion, down from an initial commitment of $40 billion, contingent on OpenAI's transition to a Public Benefit Corporation (PBC) by 2025 [2] - OpenAI's CEO announced the cancellation of plans to transition from a non-profit to a for-profit entity, which could further impact SoftBank's investment scale [2] Group 2 - Moonlight's recent focus on AI medical products aims to enhance the search quality of its Kimi product in specialized fields such as finance, law, and medicine [3] - Google launched the "AI Futures Fund" to invest in startups and provide access to Google DeepMind's latest AI models, resources, and technical expertise [4] Group 3 - Tencent's Mixuan announced the open-sourcing of the UnifiedReward-Think model, which enhances reasoning capabilities across visual tasks [5] - Saudi AI company HUMAIN is partnering with NVIDIA to establish an AI factory, aiming to position Saudi Arabia as a global leader in AI and digital transformation [6] Group 4 - A new intelligent technology company, Sichuan Zhixiang Qiyuan, has been established, focusing on AI software development and application [7] - Kunlun Wanwei has officially open-sourced the Matrix-Game model, which is designed for interactive world generation in gaming environments [8]