科大讯飞星火

Search documents
【行业前瞻】2025-2030年全球及中国多模态大模型行业发展分析
Sou Hu Cai Jing· 2025-05-07 03:45
Core Insights - The multi-modal large model industry focuses on deep learning models capable of processing, understanding, and generating various types of data, including text, images, audio, and video, enabling complex and intelligent tasks [1] - The industry has a wide application potential across various sectors such as natural language processing, image recognition, speech recognition, intelligent driving, and medical imaging diagnosis [1] Industry Overview - The multi-modal large model industry chain is complex, encompassing hardware facilities, software development, and various model types, including CLIP, BLIP, and LLaMA, among others [1] - The industry is divided into three layers: the foundational layer (hardware and basic software), the model layer (various types of multi-modal large models), and the application layer (industry-specific applications) [1] Cost Structure - The training costs for mainstream domestic large models range from tens of millions to hundreds of millions of dollars, with major companies like Baidu, Alibaba, and Tencent investing over $200 million [3][5] - Startups like Kimi and DeepSeek have managed to reduce training costs to between $30 million and $60 million through technological optimizations [3] - Cloud hosting costs are significantly influenced by model scale, with major companies leveraging their own cloud platforms to reduce costs [3] Development History - The global large model industry has evolved through several phases: early exploration (1956-2005), rapid growth (2006-2019), the rise of large models (2020-2022), and the current phase of widespread application starting in 2023 [6] Computational Demand - The demand for computational power in AI is increasing, with larger models requiring exponentially more computational resources; for instance, the GPT-3 model requires 3640 PF-days of computation and at least 10,000 GPUs [9] - As model parameters increase, the computational investment needed grows significantly, influenced by model architecture, optimization efficiency, and hardware capabilities [9]
AI大模型正融入日常生活
Ke Ji Ri Bao· 2025-04-29 23:49
Group 1 - The core viewpoint of the articles highlights the significant advancements in AI and multi-modal models showcased at the 8th Digital China Construction Summit, emphasizing their integration into various sectors such as healthcare, governance, and cultural tourism [1][2][3] - China Unicom's Yuanjing model is redefining aesthetic expressions by merging culture and technology through multi-modal generation techniques, enhancing user experiences in cultural contexts [1] - China Telecom introduced the "Starry Sky Model," the first fully domestic, all-size, and all-modal AI model platform, which is facilitating the entry of AI devices into households, enhancing personalized and intelligent living experiences [2] Group 2 - The healthcare sector is a focal point, with Ant Group launching the "AI Doctor Assistant" tools and the "Hundred AI Famous Doctors" initiative, aimed at improving access to quality medical resources through AI-driven solutions [2] - In the realm of public services, digital technologies are transforming user experiences, exemplified by the "one-touch" feature for accessing local government services, making processes more efficient [2] - The summit also introduced 46 digital application experience points in key local areas, allowing citizens and tourists to access various digital services seamlessly through mobile interactions [2]
紫金网络传播创新大会在宁举行
Jiang Nan Shi Bao· 2025-04-28 00:55
Group 1 - The 2025 Zijin Network Communication Innovation Conference was held in Nanjing, focusing on the theme "Riding the Tide: Digital Media Integration for a Future" to promote innovation in network media and enhance unity in cyberspace [1] - The conference highlighted the achievements of Jiangsu's network communication over the past year, with two works recognized as top ten positive energy online masterpieces and eleven as positive energy online masterpieces [1] - A total of 72 projects will be implemented in five major chapters for Jiangsu's online major theme publicity and major topic setting in 2025, with 44 projects included in the "project pool" [1] Group 2 - The conference featured a showcase of 2024 Jiangsu-related boutique projects and a smart media technology roadshow, with experts discussing economic publicity, online external publicity, and technological empowerment [2] - Keynote speakers included leaders from various media organizations and academic institutions, providing insights into media integration and the impact of algorithm recommendations on communication [2] - The event was marked by the launch of ten "dual" projects aimed at telling compelling stories about China, including themes like "The Encounter of the Yangtze River Delta and the Greater Bay Area" [3] Group 3 - A special area for smart media technology roadshows was set up, showcasing innovative technologies from mainstream media and domestic tech companies, including humanoid robots and AI applications [4] - The conference illustrated how artificial intelligence is reshaping the communication landscape, marking a new phase of diverse coexistence in the media industry [4]