存算协同
Search documents
存力中国行北京站释放信号:AI推理进入存算协同深水区
Sou Hu Cai Jing· 2025-11-11 12:38
Core Insights - The event "Storage Power China Tour" in Beijing focused on the challenges and innovative paths of storage power in the AI inference era, highlighting the importance of advanced storage as a core support for AI technology implementation [1] - The AI industry has transitioned from model creation to practical application, with inference costs becoming a bottleneck for large-scale deployment, driven by the exponential growth of token usage in various sectors [3] - Technical innovation is essential for overcoming industry pain points, with storage architecture evolving from passive storage to intelligent collaboration, exemplified by Huawei's Unified Cache Management (UCM) technology [4] Industry Challenges - The AI industry's shift to practical applications has led to three main challenges: the explosion of multimodal data creating storage capacity pressures, the high performance demands on storage systems, and the high costs of advanced storage media [3] - Traditional storage architectures struggle to meet the requirements for high throughput, low latency, and heterogeneous data integration, hindering AI application development [3] Technological Innovations - The UCM technology developed by Huawei represents a significant advancement, enabling a three-tier cache architecture that dramatically reduces token latency by up to 90% and increases system throughput by 22 times [4] - UCM's open-source initiative aims to lower barriers for small and medium enterprises to access advanced inference acceleration capabilities and promote unified technical standards [4] Ecosystem Development - A collaborative effort involving Huawei, China Mobile, and Inspur has led to the establishment of the "Advanced Storage AI Inference Working Group," focusing on technology research, standard formulation, and ecosystem building [5] - The Chinese storage industry has a solid foundation, with total storage capacity reaching 1680 EB by June 2025, and advanced storage accounting for 28% of this capacity, nearing the targets set in national development plans [5][6] Future Outlook - Advanced storage is evolving into a central component of the AI intelligent computing system, addressing performance, cost, and efficiency bottlenecks, thus making AI technology more accessible to small and medium enterprises [7] - The ongoing technological advancements and ecosystem improvements are expected to transform AI from a luxury for large enterprises into a necessity for smaller businesses, enhancing its practical value in real-world applications [7]
存力中国行暨先进存力AI推理工作研讨会在京顺利召开
Zheng Quan Ri Bao Wang· 2025-11-07 07:29
Core Insights - The conference focused on the role of advanced storage in empowering AI model development in the AI era [1][2] - Key experts from various organizations discussed the challenges and solutions related to AI inference and storage technology [2][3][4] Group 1: Advanced Storage and AI Inference - The chief expert from the China Academy of Information and Communications Technology emphasized that advanced storage is crucial for improving AI inference efficiency and controlling costs [2] - The national policies highlight the importance of advancing storage technology and enhancing the storage industry's capabilities [2] - A working group was established to promote collaboration and innovation in storage technology within the AI inference sector [2] Group 2: Technical Challenges and Solutions - Current challenges in AI inference include the need for upgraded KV Cache storage, multi-modal data collaboration, and bandwidth limitations [3] - China Mobile is implementing layered caching, high-speed data interconnects, and proprietary high-density servers to enhance storage efficiency and reduce costs [3] - Huawei's UCM inference memory data management technology addresses the challenges of data management, computational power supply, and cost reduction in AI applications [4] Group 3: Industry Collaboration and Future Directions - The conference facilitated discussions among industry experts from various companies, contributing to the consensus on the future direction of the storage industry [5] - The focus is on enhancing computational resource utilization and addressing issues related to high concurrency and low latency in AI inference [4][5] - The successful hosting of the conference is seen as a step towards fostering innovation and collaboration in the storage industry [5]
存力中国行北京站暨先进存力AI推理工作研讨会顺利召开
Guan Cha Zhe Wang· 2025-11-06 04:14
Core Insights - The article emphasizes the rapid integration of AI large models across various industries, highlighting the significance of data as a fundamental strategic resource for national development [1][3] - The event organized by the China Academy of Information and Communications Technology focused on the role of advanced storage technologies in enhancing AI model performance and addressing challenges in inference costs and efficiency [1][3] Group 1: Industry Challenges and Developments - The current AI application landscape faces significant challenges in inference costs, efficiency, and quality, making advanced storage a key factor in improving AI inference performance and controlling costs [3] - The Chinese government is prioritizing the development of advanced storage technologies, as outlined in policies like the "Action Plan for High-Quality Development of Computing Power Infrastructure," which aims to accelerate research and application of storage technologies [3] - The meeting resulted in the establishment of a working group focused on advanced storage for AI inference, with recommendations to encourage innovative storage technology development and promote deep integration of storage and computing [3][6] Group 2: Technological Innovations and Solutions - China Mobile shared insights on storage technology trends, addressing challenges such as the need for KV Cache storage upgrades and bandwidth limitations, proposing solutions like hierarchical caching and high-speed data interconnects [4] - Huawei highlighted three major challenges in IT infrastructure for the AI era: managing data effectively, ensuring sufficient computing power, and reducing costs, while introducing their UCM inference memory data management technology [5] - Silicon-based Flow discussed solutions to the slow and costly inference issues of large models, focusing on enhancing computing resource utilization and optimizing performance through intelligent gateways and KV Cache solutions [5]
先进存力赋能AI大模型发展
Zhong Guo Xin Wen Wang· 2025-11-06 02:29
Core Insights - The "Storage Power China Tour" event in Beijing focused on the role of advanced storage in empowering AI model development in the AI era [1] - The need for seamless integration of model capabilities into various business scenarios is emphasized, highlighting the importance of storage for AI training and inference [1] Group 1: Industry Challenges and Developments - AI applications are facing significant challenges in inference costs, efficiency, and quality, making advanced storage crucial for enhancing AI inference performance and controlling costs [1] - The Ministry of Industry and Information Technology and other departments released an action plan in October 2023, emphasizing the acceleration of storage technology development and the enhancement of storage industry capabilities [1] Group 2: Technological Innovations and Solutions - Huawei's UCM inference memory data management technology aims to address challenges in data management, computational power, and cost reduction for AI inference [2] - Recommendations from industry experts include adapting core inference frameworks to multi-modal models and optimizing existing models to improve cost-effectiveness [2] - Future trends indicate a shift in storage from passive to intelligent computing collaboration, with a focus on high-density all-flash storage and integrated storage-computing technologies [2]
宏杉科技闪耀东博会!荣登“中国AI出海未来独角兽企业TOP100榜单”,开启全球化新篇章
Sou Hu Cai Jing· 2025-09-23 08:45
Group 1 - The 22nd China-ASEAN Expo was held in Nanning, Guangxi, showcasing cooperation achievements between China and ASEAN countries, with 3,200 enterprises from 60 countries participating [1][3] - Hongshan Technology, a leader in AI storage, was recognized in the "Top 100 Future Unicorn Enterprises in AI Going Global" list, highlighting its strength in the AI storage sector [1][4] - The event served as a platform for enhancing collaboration in the AI industry between China and ASEAN, with officials emphasizing the region as a "golden harbor" for AI expansion [3][6] Group 2 - Hongshan Technology's entry into the unicorn list reflects its significant potential in the AI export sector, driven by its innovative solutions tailored for AI computing scenarios [4][6] - The company has launched customized solutions like the DeepSeek integrated machine and AI storage for various sectors, including healthcare and education, establishing a strong foundation for international expansion [6][7] - The introduction of the new AI storage product, MS5520G3, at the expo demonstrates Hongshan Technology's commitment to high-performance and reliable storage solutions, addressing the growing data demands in AI applications [9][11]
押注“国产英伟达”!东芯股份2.11亿元再投亏损GPU公司
Xin Lang Cai Jing· 2025-09-03 21:12
Core Viewpoint - Dongxin Co., Ltd. (688110.SH) has shown a positive market response following its announcement of additional investment in Shanghai Lishuan Technology, indicating confidence in the GPU sector and a pursuit of industry chain synergy [1][2]. Group 1: Investment Details - Dongxin Co. plans to invest approximately 211 million yuan in Shanghai Lishuan Technology, following a previous investment of 200 million yuan in August 2024 [1]. - The total investment round for Shanghai Lishuan, including contributions from other investors, amounts to around 500 million yuan [1]. - Shanghai Lishuan, established in April 2022, focuses on the research and design of scalable GPU chips, with products aimed at mainstream graphics rendering and AI acceleration [1][2]. Group 2: Financial Performance - Shanghai Lishuan has not generated revenue and has reported continuous losses, with net losses of 210 million yuan and 155 million yuan for 2024 and the first seven months of 2025, respectively [1][3]. - Dongxin Co. has faced declining net profits since its listing, with losses of 306 million yuan and 167 million yuan in 2023 and 2024, respectively [3]. - The investment loss from Shanghai Lishuan accounted for nearly half of Dongxin's total net loss in the first half of the year, amounting to 52.31 million yuan [3]. Group 3: Market Dynamics - Dongxin's stock price has experienced significant volatility, with a cumulative increase of 207.85% from July 29 to August 28, significantly outperforming major indices [3]. - The investment in Shanghai Lishuan reflects a complex valuation logic in the semiconductor sector, balancing the potential of emerging technologies against current financial pressures [2]. - Experts emphasize the importance of a clear technological roadmap and financial discipline in evaluating the success of cross-industry investments in semiconductors [3].
押注“国产英伟达”!东芯股份2.11亿元再投亏损GPU公司,股价狂飙难掩主业连亏
Hua Xia Shi Bao· 2025-09-03 12:24
Core Viewpoint - Dongxin Co., Ltd. (688110.SH) has resumed trading and experienced a stock price increase of over 14% at one point, closing with a 1.17% rise at 119.38 yuan per share, resulting in a market capitalization of 52.8 billion yuan. The company announced an additional investment of approximately 211 million yuan in Shanghai Lishuan Technology Co., Ltd. (Shanghai Lishuan) [2][3][4]. Investment and Financial Performance - Dongxin Co. plans to invest approximately 500 million yuan in Shanghai Lishuan, with its own contribution being about 211 million yuan, acquiring around 35.87% of the company's shares post-investment [3][4]. - Shanghai Lishuan, established in April 2022, focuses on the development of scalable GPU chips and has yet to generate revenue, reporting continuous losses of 210 million yuan and 155 million yuan for 2024 and the first seven months of 2025, respectively [4][5]. - Dongxin Co. has previously invested 200 million yuan in Shanghai Lishuan in August 2024, with the company's pre-investment valuation rising from approximately 200 million yuan to 3.5 billion yuan within a year [6]. Business Strategy and Market Position - Dongxin Co. aims to enhance its core competitiveness through this investment, aligning with its integrated strategy of "storage, computing, and networking" [4][8]. - The company has faced increasing pressure on profitability, with net profits declining in 2023 and 2024, reporting losses of 306 million yuan and 167 million yuan, respectively. The first half of 2025 saw a revenue increase of 28.81% to 343 million yuan, but a net loss of 111 million yuan, a decline of 21.78% year-on-year [7][8]. Market Dynamics and Future Outlook - The investment in Shanghai Lishuan reflects a complex duality in the semiconductor industry, where investments in loss-making tech companies can signify forward-looking strategies but also carry significant risks [5][10]. - The success of Shanghai Lishuan's core product, the 7G100 GPU, is critical for future revenue and profitability, with its market acceptance and competitive positioning being key factors [5][9]. - Experts suggest that achieving true business synergy between storage and GPU technologies requires deep technical integration and effective communication between R&D teams, which poses substantial challenges [5][10].
押注“国产英伟达”!东芯股份2.11亿元再投亏损GPU公司 股价狂飙难掩主业连亏
Hua Xia Shi Bao· 2025-09-03 12:20
Core Viewpoint - Dongxin Co., Ltd. (688110.SH) has resumed trading with its stock price rising over 14% at one point, closing at 119.38 yuan per share, with a market capitalization of 52.8 billion yuan, following its announcement of an additional investment in Shanghai Lishuan Technology Co., Ltd. of approximately 211 million yuan [2][3] Investment and Financial Performance - Dongxin Co. plans to invest a total of about 500 million yuan in Shanghai Lishuan, with its own contribution being approximately 211 million yuan, resulting in a 35.87% stake in Lishuan after the investment [3] - Shanghai Lishuan, established in April 2022, focuses on the development of scalable GPU chips and has not yet generated revenue, reporting continuous losses of 210 million yuan and 155 million yuan for 2024 and the first seven months of 2025, respectively [3][6] - Dongxin's net profit has declined since its second year post-IPO, with losses of 306 million yuan and 167 million yuan in 2023 and 2024, respectively, and a net loss of 111 million yuan in the first half of 2025 [6][7] Strategic Outlook - The investment in Shanghai Lishuan reflects Dongxin's long-term optimism in the GPU sector and aims to enhance its integrated strategy of "storage, computing, and networking" [2][3] - The core business of Shanghai Lishuan is heavily reliant on its self-developed 7G100 GPU product, which poses risks related to market acceptance and competition [5] - Dongxin's ongoing high-level R&D investments are aimed at upgrading storage chip processes and reliability, while also expanding sales channels and brand promotion [7] Market Dynamics - The launch of Shanghai Lishuan's 7G100 GPU series has positioned it as a competitor to NVIDIA, leading to a significant stock price increase for Dongxin, which saw a cumulative rise of 207.85% from July 29 to August 28 [8][10] - The semiconductor industry is characterized by complex valuation logic, where investments in loss-making tech companies can reflect both potential future value and inherent risks [4][9]
中国电信股价下跌1.17% 参与成立AI推理工作组
Jin Rong Jie· 2025-08-26 16:26
Group 1 - As of August 26, 2025, China Telecom's stock price is 7.58 yuan, down 0.09 yuan or 1.17% from the previous trading day [1] - The trading volume on that day was 1.5598 million hands, with a transaction amount of 1.182 billion yuan [1] - China Telecom is a major state-owned telecom operator in the communication services industry, providing fixed and mobile communication services, internet access, and information services [1] Group 2 - Recently, China Telecom participated in the establishment of the "Advanced Storage AI Inference Working Group," which aims to promote the development of "storage-computing collaboration and ecological co-construction" in the AI inference field [1] - The company showcased its "Wide-area Intelligent Computing Lossless Networking Technology" at the 2025 China Computing Power Conference, which enables efficient collaboration between distant data centers [1] Group 3 - On August 26, 2025, the net outflow of main funds for China Telecom was 206 million yuan, accounting for 0.04% of its circulating market value [1] - Over the past five trading days, the cumulative net inflow of main funds was 2.00496 million yuan [1]
工信部:有序引导智能算力基础设施适度超前动态平衡
Zhong Guo Zheng Quan Bao· 2025-08-22 20:09
Group 1 - The 2025 China Computing Power Conference was held in Datong, Shanxi from August 22 to 24, focusing on optimizing national computing power layout and promoting the development of intelligent computing infrastructure [1][2] - As of March this year, China's total computing power ranks second globally, with 10.43 million standard racks in use and an intelligent computing power scale of 748 EFLOPS, supporting AI development and high-quality economic growth [1] - A report released on August 22 highlighted that the rapid growth of data production is not matched by the expansion of storage resources, with a projected national data annual output of 41.06 ZB in 2024 but only 2.09 ZB of total storage capacity [1] Group 2 - Current challenges in computing power include inefficient resource scheduling and underutilization of GPU performance, necessitating technological innovations to meet new demands for ultra-low latency and high throughput in emerging business models [2] - Shanxi province is leveraging the "East Data West Computing" strategy to transform its economic growth model from coal and electricity export to computing power and services, aiming to become a key hub for computing power demand and industry [2] - The province currently has 514,000 standard racks in use and an intelligent computing power scale of 32 EFLOPS, with an average Power Usage Effectiveness (PUE) of 1.2, ranking among the top in the country [2]