昇腾AI云服务

Search documents
华为云CloudMatrix 384超节点再出圈,昇腾AI云服务解锁超级算力
Guan Cha Zhe Wang· 2025-07-28 07:15
Core Insights - Huawei's 384 Super Node received the "Treasure of the Museum" award at the World Artificial Intelligence Conference (WAIC 2025), highlighting its significance in the AI landscape [1] - The CloudMatrix 384 Super Node enables flexible and on-demand access to powerful computing resources, making advanced AI infrastructure more accessible for enterprises [2][3] Group 1: Technology Features - The CloudMatrix 384 Super Node integrates 384 Ascend NPUs and 192 Kunpeng CPUs through a new high-speed network, achieving a computing power scale of 300 PFlops and overcoming bandwidth performance bottlenecks [3][4] - It features four key technological advantages: strong throughput performance with 2300 Tokens decoding per card, coverage of over 160 mainstream models for efficient model migration, a pioneering large-scale expert parallel scheme for system-level optimization, and flexible scalability with low initial investment [4] Group 2: Industry Applications - The CloudMatrix 384 Super Node has been widely adopted across various industries, enhancing delivery efficiency by over 50% for Sina's "Smart Xiao Lang," supporting 6 million daily users for Silicon-based Flow, and accelerating AI model training for the Chinese Academy of Sciences [4] - Huawei Cloud's solutions are being utilized in diverse sectors, such as developing a railway model and smart inspection robots in collaboration with North Railway Institute, and enhancing renewable energy generation through AI and meteorology partnerships [10] Group 3: Future Outlook - Huawei Cloud aims to continue leveraging its advanced technologies and industry-specific solutions to address real-world challenges, fostering a new intelligent world in collaboration with clients and partners [10]
“中国云谷”再添新翼 和林格尔新区迎来华为北方最大AI智算中心
Sou Hu Cai Jing· 2025-07-15 13:37
Core Insights - The official launch of Huawei Cloud's data center in Hohhot (Helingeer) further solidifies the area's position as the "core" of China's "Cloud Valley" and marks a significant step towards becoming a "World Computing Valley" [1][4] - The Helingeer New Area serves as a key hub for the national "East Data West Computing" project, boasting a total computing power scale of 101,000 P with 46 data center projects [3] - The new Huawei Cloud data center is the largest cloud infrastructure in Northern China, with server installation capacity exceeding 3 million units, and it will utilize advanced cooling technologies to enhance energy efficiency [3][4] Industry Impact - The launch of the Huawei Cloud data center is a milestone in attracting top technology talent and building a computing power industry ecosystem in the Helingeer New Area [4] - The center will leverage the region's natural cooling advantages and is expected to have an annual PUE of less than 1.1 in its liquid cooling areas, contributing to the green computing landscape [3] - The new data center will support the deployment of the next-generation Ascend AI cloud services based on the CloudMatrix 384 architecture, facilitating intelligent upgrades across various sectors including government, industry, automotive, and finance [3][4]
青城聚智算力兴 绿算新篇启未来 ——“中国云谷”迈向新高度
Zhong Guo Chan Ye Jing Ji Xin Xi Wang· 2025-07-14 05:46
Core Insights - The 2025 Green Computing (AI) Conference marks a significant step in the integration of green computing and artificial intelligence in China, showcasing the development of the "China Cloud Valley" in Hohhot [1] - The conference gathered over 500 participants, including government officials, experts, and leading companies, to discuss the deep integration and innovative development of green computing and AI [1] Group 1: Conference Highlights - Five major computing center projects were launched, increasing Hohhot's computing capacity to 101,000 P, positioning it as a leading hub in the national "East Data West Computing" initiative [2] - The first national green computing and electricity collaborative base was inaugurated, featuring a 120MW/480MWh shared energy storage facility and two data center clusters [2] - The national integrated computing network's electricity collaboration pilot project commenced operation, achieving an annual power supply of 790 million kWh and reducing carbon emissions by 556,000 tons [3] Group 2: Technological Innovations - The Hohhot multi-cloud computing resource monitoring and scheduling platform became the first provincial-level platform to connect with the national backbone network, facilitating efficient resource flow [4] - The launch of four innovation laboratories, including the Quantum Information Innovation Center, aims to drive technological advancements in Hohhot's computing industry [5] - The release of the "Green Computing Development Research Report (2025)" provides a theoretical framework and path reference for industry development [6] Group 3: Strategic Partnerships and Investments - Ten key projects were signed at the conference, with a total investment of approximately 20 billion yuan, covering areas such as computing infrastructure and AI applications [8] - The collaboration agreement between the Inner Mongolia government and Baidu signifies a deepening partnership aimed at enhancing the regional data industry ecosystem [8] Group 4: Future Outlook - The conference emphasized the importance of integrating green energy with computing power, highlighting Hohhot's ambition to become a leading area for green computing development in China [9] - The discussions led by industry experts provided valuable insights into the future directions for the integration of green computing and AI [9]
2025绿色算力(人工智能)大会召开 呼和浩特展示绿色算力“硬实力”
Nei Meng Gu Ri Bao· 2025-07-13 08:55
Core Points - The 2025 Green Computing (Artificial Intelligence) Conference was held in Hohhot, Inner Mongolia, focusing on building a green computing ecosystem and promoting the national "East Data West Computing" strategy [4][11] - Hohhot has established itself as a key node in the national computing network, with the Hohhot data center cluster becoming the first provincial-level computing network connected to the national network [4][6] - The conference announced the launch of the first national green computing and electricity collaborative industrial base, which aims to drive technological advancement and industrial upgrades [4][6] Group 1: Conference Highlights - The conference showcased significant achievements, including the integration of the Hohhot cluster into the national computing network and the launch of key projects in the financial and cutting-edge research sectors [4][5] - A report by the China Academy of Information and Communications Technology highlighted that Inner Mongolia's green computing development index ranked first nationally for two consecutive years [6][7] - The conference also featured the release of several innovative solutions aimed at enhancing the green computing industry and addressing risks associated with it [7][9] Group 2: Industrial Development - Inner Mongolia has established a digital equipment manufacturing industrial park, attracting major companies and achieving the lowest electricity costs in the country [8] - The Hohhot data center cluster has surpassed a computing capacity of 101,000 PetaFLOPS, with over 95% being intelligent computing [6][8] - The region has been recognized with several titles, including "China Cloud Valley" and "Pioneer of Green Computing Development," solidifying its status as a significant green computing base [8][9] Group 3: Strategic Collaborations - The conference saw the signing of ten cooperation agreements, with a total investment of 20 billion yuan, covering areas such as AI research, regional data collaboration, and green data center construction [9][10] - The agreements aim to strengthen Hohhot's digital economy and enhance the computing data industry ecosystem [9]
中银晨会聚焦-20250703
Bank of China Securities· 2025-07-03 02:41
Core Insights - The report highlights the sustained high demand for domestic computing power driven by ongoing U.S. restrictions on advanced chip imports, accelerating the domestic substitution process [3][7] - Domestic cloud service providers are increasing capital expenditures, gradually releasing industrial demand, while the iteration of domestic AI large models and applications is further boosting computing power needs [3][7] Industry Performance - The report provides a snapshot of market indices, with the Shanghai Composite Index closing at 3454.79, down 0.09%, and the Shenzhen Component Index at 10412.63, down 0.61% [4] - The performance of various sectors is noted, with steel up 3.37% and electronics down 2.01% [5] Key Focus Areas - The domestic computing power market is experiencing a boom, with Huawei's Ascend 910C servers being deployed in significant quantities, indicating a new phase in domestic computing commercialization [7] - The Ascend 910C chip boasts a single-chip computing power of 320 TFLOPS (FP16), designed for efficiency and low power consumption, suitable for AI tasks [7] - Major domestic internet companies are ramping up investments in AI infrastructure, with Alibaba planning to invest 380 billion RMB over three years, and Tencent's capital expenditure reaching 275 billion RMB in Q1 2025, up 91% year-on-year [8] Demand Drivers - The report notes that application-side inference is expected to drive demand growth, with significant increases in token usage reported by major companies like Alphabet and ByteDance [9] - The domestic supply side, including chips and supernode deployments, has achieved technological breakthroughs, which will lead to increased demand for computing power as industry applications evolve [9]
计算机行业周报:谷歌发布全新多模态大模型Gemma3n,阿里达摩院发布医疗AI模型DAMOGRAPE-20250630
Huaxin Securities· 2025-06-30 12:43
Investment Rating - The report maintains a "Buy" rating for the computer industry, indicating a positive outlook for investment opportunities in this sector [2][54]. Core Insights - The report highlights significant advancements in AI technology, particularly with the release of Google's multimodal model Gemma 3n, which is optimized for edge devices, marking a shift from cloud-based models [16][17]. - The introduction of the AI model DAMO GRAPE by Alibaba's DAMO Academy represents a breakthrough in early gastric cancer detection using standard CT scans, showcasing the potential of AI in medical applications [28][32]. - The report emphasizes the growing trend of AI financing, with Harvey completing a $300 million Series E funding round, significantly increasing its valuation to $5 billion [39][41]. Summary by Sections Computing Power Dynamics - The report notes stable pricing in computing power rentals, with specific configurations such as Tencent Cloud's A100-40G priced at 28.64 CNY/hour and Alibaba Cloud's A800-80G at 6.03 CNY/hour, reflecting a 12.77% decrease from the previous week [15][19]. - Google's Gemma 3n model is designed for efficient operation on devices like smartphones and laptops, supporting various input types including audio and video [16][17]. AI Application Dynamics - Kimi's average weekly stay duration increased by 58.70%, indicating growing user engagement [27]. - The DAMO GRAPE model has shown promising results in clinical trials, significantly improving the detection rates of gastric cancer compared to traditional methods [28][30]. AI Financing Trends - Harvey's recent funding round has positioned it as a leading player in legal AI, with a reported annual recurring revenue of $75 million, up from $50 million earlier this year [39][40]. Investment Recommendations - The report suggests focusing on domestic computing power opportunities, particularly with Huawei's new AI cloud service, which enhances computational efficiency and supports a wide range of applications [51]. - Long-term investment is recommended in companies like 嘉和美康 (Jiahe Meikang), 科大讯飞 (iFlytek), and 寒武纪 (Cambricon), which are positioned to benefit from advancements in AI and computing technologies [52].
华为首个!重磅发布!
新华网财经· 2025-06-30 07:48
6月30日,华为宣布开源盘古70亿参数的稠密模型和720亿参数的混合专家模型(盘古Pro MoE 72B)。 此外,基于昇腾的模型推理技术也同步开源。华为表示,此举是华为践行昇腾生态战略的又一关键举 措,将推动大模型技术的研究与创新发展,加速推进人工智能在千行百业的应用与价值创造。 这一系列突破,更为关键的意义在于,华为盘古大模型是基于昇腾云的全栈软硬件训练而成的,这标志 着基于昇腾架构可以打造出世界一流大模型。华为不仅完成了国产算力+国产模型的全流程自主可控的 训练实践,同时在集群训练系统的性能上也实现了业界领先,这意味着实现了从硬件到软件、从训练到 优化、从基础研究到工程落地的"全栈国产化"和"全流程自主可控"的闭环,国产AI基础设施的自主创新 能力得到了进一步验证。 据了解,华为最新开源的Pro MoE 72B大模型,在参数量仅为720亿,激活160亿参数量的情况下,通过 动态激活专家网络的创新设计,实现了以小打大的优异性能,甚至可以媲美千亿级模型的性能表现。在 业界权威大模型榜单Super CLUE最新公布的2025年5月排行榜上,位居千亿参数量以内大模型排行并列 国内第一。 最近一段时间以来,华为公 ...
华为首个!重磅发布!
Zheng Quan Shi Bao· 2025-06-30 04:37
Core Insights - Huawei has announced the open-sourcing of the Pangu 70 billion parameter dense model and the 720 billion parameter mixture of experts model (Pangu Pro MoE 72B), marking a significant step in its Ascend ecosystem strategy to promote AI research and innovation across various industries [1][5] - The Pro MoE 72B model, with 720 billion parameters and 160 billion activated parameters, demonstrates exceptional performance that can rival models with trillion parameters, ranking first among domestic models under the 1 trillion parameter category in the latest Super CLUE rankings [3][4] - Huawei's Pangu models have been successfully implemented in over 30 industries and 500 scenarios, showcasing their value in sectors such as government, finance, manufacturing, healthcare, and more [5] Summary by Sections Open-Sourcing and Model Performance - Huawei's open-sourcing of the Pangu models aims to enhance the development of AI technologies on domestic computing platforms, expanding the Ascend ecosystem [5] - The Pro MoE 72B model's innovative design allows for dynamic activation of expert networks, achieving high performance with fewer activated parameters [3] Technological Advancements - The recent release of the Pangu Ultra MoE model, with a parameter scale of 718 billion, highlights Huawei's advancements in training large-scale models on the Ascend AI computing platform [4] - The Pangu models are built on a fully integrated software and hardware training system, demonstrating Huawei's capability in achieving a self-controlled training process from hardware to software [4] Industry Impact and Strategic Focus - Huawei emphasizes practical applications of its models, focusing on solving real-world problems across various industries rather than merely theoretical advancements [4] - The launch of the Pangu 5.5 model includes five foundational models targeting NLP, multimodal, prediction, scientific computing, and computer vision, positioning them as core drivers for digital transformation in industries [3]
华为首个!重磅发布!
证券时报· 2025-06-30 04:12
Core Viewpoint - Huawei's announcement to open source the Pangu 70 billion parameter dense model and the 720 billion parameter mixture of experts model (Pangu Pro MoE 72B) is a significant step in promoting the development and application of large model technology across various industries, aligning with its Ascend ecosystem strategy [1][7]. Group 1: Model Specifications and Performance - The newly open-sourced Pro MoE 72B model, with 720 billion parameters and 160 billion active parameters, demonstrates exceptional performance that can rival models with over a trillion parameters, according to the latest Super CLUE rankings [3][4]. - Huawei's Pangu Ultra MoE model, launched on May 30, features a parameter scale of 718 billion, showcasing advancements in training performance on the Ascend AI computing platform [4][5]. Group 2: Strategic Implications - The release of these models signifies Huawei's capability to create world-class large models based on its Ascend architecture, achieving a fully controllable training process from hardware to software [5]. - Huawei's unique approach in the large model strategy emphasizes practical applications across various industries, aiming to solve real-world problems and accelerate the intelligent upgrade of numerous sectors [5][7]. Group 3: Industry Impact - The Pangu large models have been implemented in over 30 industries and 500 scenarios, providing significant value in sectors such as government, finance, manufacturing, healthcare, and autonomous driving [5]. - The open-sourcing initiative is expected to attract more developers and vertical industries to create intelligent solutions based on the Pangu models, further enhancing the integration of AI across different fields [7].
华为开源盘古7B稠密和72B混合专家模型
Guan Cha Zhe Wang· 2025-06-30 02:38
Core Viewpoint - Huawei has officially announced the open-sourcing of its Pangu models, including the 70 billion parameter dense model and the 720 billion parameter mixture of experts (MoE) model, as part of its Ascend ecosystem strategy to promote AI technology across various industries [1][2]. Group 1: Model Development and Open-Sourcing - Huawei has launched the Pangu Pro MoE 72B model weights and basic inference code on an open-source platform, with plans to release the Pangu 7B model weights and inference code soon [1]. - The Pangu Pro MoE model, with 720 billion parameters and 160 billion active parameters, has demonstrated performance comparable to larger models, ranking first among domestic models with fewer than 1 trillion parameters on the SuperCLUE leaderboard [1]. - The company plans to open-source the Pangu 72B MoE model first, followed by smaller models potentially for academic institutions [2]. Group 2: Technical Advancements - Huawei has introduced a new model, the Pangu Ultra MoE, with a parameter scale of 718 billion, trained entirely on the Ascend AI computing platform [2]. - The model training efficiency has been highlighted, with a model compute utilization (MFU) of 41% achieved in pre-training and over 50% in specific configurations [3]. - The architecture of the Ascend super nodes has been optimized for extreme parallelism, enhancing training efficiency and inference performance [3]. Group 3: Ecosystem and Future Plans - Huawei is committed to improving its ecosystem and ensuring compatibility with mainstream industry ecosystems to support customer development [2]. - The company has also announced upgrades to its Pangu models for natural language processing, computer vision, multimodal applications, prediction, and scientific computing at the Huawei Developer Conference [3].