Workflow
算力调度
icon
Search documents
一次算力政策研讨实录:算力调度的七个问题
3 6 Ke· 2026-02-12 11:08
Core Insights - The next five years (2026-2030) are considered a critical phase for the development of China's computing power industry, driven by the rapid growth of the AI sector which demands higher computing capabilities [1] - Key challenges facing the industry include limited capital expenditure and restricted supply of advanced AI chips, prompting companies to explore more efficient and cost-effective systems that integrate chips, algorithms, energy, and networks [1] Group 1: Policy and Industry Discussions - The National Information Center has organized discussions focusing on the value of "computing power scheduling" for industry development, involving experts from telecom operators, research institutions, and consulting firms [2] - Seven key issues were identified during these discussions, emphasizing the importance of efficient resource allocation in the computing power sector [2] Group 2: Understanding Computing Power Scheduling - "Computing power scheduling" is recognized as a method to optimize existing computing resources, reduce idle capacity, and achieve reasonable allocation, which is more complex than traditional resource scheduling like water or electricity [3][4] - The construction of a national integrated computing power network requires a comprehensive approach that combines government initiatives, standard-setting, and market operations to ensure efficient functioning [3] Group 3: Current State and Future Directions - As of the end of 2024, China's computing power centers are expected to exceed 9 million standard racks, with a computing power scale of 280 EFLOPS, ranking among the top globally [3] - The average energy utilization efficiency (PUE) of computing centers has improved to 1.46, indicating progress in resource efficiency [3] Group 4: Challenges and Considerations - There are concerns regarding the network costs associated with cross-regional scheduling, which may offset the benefits of lower electricity prices in the west [20] - Data security remains a significant issue, as clients worry about the privacy and sovereignty of their data when it is scheduled outside local environments [20] Group 5: Importance of Scheduling in the Integrated Network - Computing power scheduling is viewed as the "nervous system" of the national integrated computing power network, enhancing the flexibility of data centers and improving overall resource utilization [21][23] - Effective scheduling can lead to the integration of fragmented computing resources, reducing waste from redundant infrastructure [21] Group 6: Construction of a Unified Scheduling Network - Building a national computing power scheduling network involves more than just infrastructure; it requires standardization, platform development, and innovative mechanisms [24] - A successful network will necessitate a unified standard for computing power, a multi-level scheduling platform, and a sustainable operational service mechanism to facilitate rapid matching and flexible transactions [24][25]
A股晚间热点 | 国常会重磅!研究促进有效投资政策措施
智通财经网· 2026-02-06 16:15
Group 1 - The State Council, led by Premier Li Qiang, emphasizes the importance of promoting effective investment to stabilize economic growth and enhance development momentum, focusing on infrastructure, urban renewal, public services, and emerging industries [1][2] - The meeting discusses the need for innovative policy measures and effective use of central budget investments, long-term special bonds, and local government bonds to support major projects [1] - The Ministry of Industry and Information Technology (MIIT) announces the construction of a national computing power interconnection node system to enhance the efficiency of computing resources across regions and industries [3] Group 2 - The People's Bank of China and eight other departments issue a notice to continue the crackdown on virtual currency mining activities, emphasizing that virtual currencies do not have the same legal status as fiat currencies [6] - The National Development and Reform Commission is tasked with strictly controlling virtual currency mining activities and shutting down existing projects [6] - The Ministry of Commerce plans to introduce new policies to expand inbound consumption and support sectors like home services and the automotive aftermarket [9][10] Group 3 - Amazon's stock drops over 9% following the announcement of a $200 billion capital expenditure plan, despite reporting a 14% year-over-year revenue increase to $213.4 billion for Q4 [8] - The China Securities Regulatory Commission intensifies regulatory actions, with 19 companies facing investigations or penalties since the beginning of the year, indicating a "zero tolerance" approach to market violations [12] Group 4 - The National Health Commission seeks public opinion on national standards for prepared dishes, which could benefit leading companies in the sector by enhancing industry order and compliance [14] - Eight companies are projected to see a doubling in net profit for the year, with "Good Idea" leading with an expected profit increase of 1281.25% [16]
思特奇(300608) - 300608思特奇投资者关系管理信息20260204
2026-02-04 10:52
Group 1: Company Overview - The company has established a solid foundation in the telecommunications industry, focusing on empowering 30 operators and various enterprises to facilitate their digital transformation [1][2]. - The company aims for comprehensive breakthroughs and value growth through its development in the telecommunications sector [2]. Group 2: Competitive Strategy - The company's competitive strategy varies by industry; it adopts a direct sales model in the operator sector while emphasizing ecosystem partnerships in urban and digital economy sectors [2][3]. - In the face of competition, the company prioritizes collaboration to expand market opportunities rather than solely focusing on direct sales [3]. Group 3: International Expansion - The company has initiated international business expansion, starting from Shenzhen and targeting operators in Hong Kong as a launch point for further growth [3]. - The focus in international markets is on standardized products in AI and computing power [3]. Group 4: AI and Computing Power - The company integrates demand in the computing power sector, participating in national development rather than just focusing on supply-side solutions [3]. - AI applications are utilized to enhance operational efficiency and reduce labor costs through automation in code verification and digitalization of processes [3]. Group 5: Revenue and Orders - The company’s revenue is significantly dependent on operator orders, with expectations for growth in the second and third lines of business in the coming years [3]. - Specific order-related information will be available in the company's annual report [3].
大模型API的大众点评来了:7×24小时实测,毫秒级延迟智能路由,选API必备
量子位· 2026-02-02 03:39
Core Viewpoint - The article discusses the challenges faced by developers in selecting reliable and cost-effective API services for AI applications, highlighting the need for a comprehensive evaluation tool to streamline the process [1][3][4]. Group 1: API Selection Challenges - Developers often experience frustration when choosing APIs due to significant variations in pricing, latency, stability, and throughput across different vendors [2]. - The current API selection process relies heavily on trial and error, leading to inefficiencies and repeated efforts among teams [3][4]. - There is a lack of a centralized tool that provides clear comparisons of API performance, forcing developers to act as procurement agents [5][10]. Group 2: Introduction of AI Ping - AI Ping, developed by Tsinghua University-affiliated company Qingcheng Jizhi, aims to address these challenges by providing a platform that evaluates and compares API performance continuously [7][8]. - The platform operates like a review system for large model APIs, offering developers a clear overview of performance metrics [9][11]. Group 3: Core Features of AI Ping - AI Ping features a 24/7 performance evaluation system that provides objective rankings based on real-time data, addressing the issues of information asymmetry and blind selection [19][21]. - The platform includes a dynamic routing feature that selects the best-performing API based on real-time assessments, ensuring continuous service availability [27][29]. - AI Ping standardizes API metrics across different vendors, simplifying the integration process for developers and reducing maintenance costs [33][35][39]. Group 4: Industry Impact and Future Prospects - AI Ping fills a significant gap in real-time performance monitoring for large model services, promoting transparency in API selection [67][70]. - The platform encourages competition among API providers, leading to improved service quality and reduced costs for developers [72][73]. - As more companies adopt AI Ping, the industry is expected to shift from experience-driven to data-driven decision-making in API selection [71].
北京佳杰云星数据科技有限公司:算力调度平台赋能东莞大模型中心,构建三方共赢数字生态
Jing Ji Guan Cha Wang· 2026-01-29 05:49
Core Insights - Beijing Jiajie Yunxing Data Technology Co., Ltd. focuses on three core businesses: intelligent agent development, computing power scheduling and management, and multi-cloud management, establishing itself as a key player in digital infrastructure construction through continuous R&D investment and technological innovation [2] Group 1: Challenges in AI Infrastructure - Dongguan, as a manufacturing hub and a front-runner in digital economy development, needs to build a city-level AI public service platform to drive regional industrial upgrades, facing two main challenges: uneven distribution of AI computing resources leading to urgent demand gaps among enterprises, and the need for a sustainable operational profit model for the large model center [4] - The lack of a standardized and complete operational platform to support the full lifecycle of billing and management is a significant challenge for the platform construction [4] Group 2: Solutions and Implementation - Jiajie Yunxing is deeply involved in the construction and operation of Dongguan's AI large model center, leveraging its technological accumulation to provide targeted solutions through its self-developed computing power scheduling and operational platform product system [4] - The platform, set to launch in 2025, has already achieved significant results since its implementation began in 2024, aiming to fully release its value upon official launch [6] Group 3: Key Features of the Platform - The platform supports unified management of computing resources through automated and manual registration modes, compatible with various mainstream chips and AI development platforms, breaking down barriers of decentralized resource management [9] - A one-stop service portal is established for enterprise users, presenting over 50 types of computing-related products, simplifying the usage process and significantly lowering application thresholds [9] - The platform encourages ecosystem partners to participate by providing a self-listing sales channel for computing and algorithm suppliers, creating a virtuous cycle of platform construction, product provision, and service enjoyment for enterprises [9] Group 4: Value Creation - Technically, the platform achieves centralized management and efficient scheduling of different types of AI computing resources, significantly improving resource utilization compared to traditional models [9] - Service-wise, the centralized service entry enhances the efficiency of enterprises obtaining computing services, covering diverse needs across manufacturing, finance, technology, and more [9] - Operationally, the online billing settlement system and multi-dimensional analysis functions support various promotional marketing methods, reducing enterprise usage costs while ensuring sustainable profitability for the platform [9] - Ecologically, the platform successfully connects operators, computing suppliers, and enterprise users, creating a win-win ecosystem that injects lasting momentum into Dongguan's digital economy [9]
恒为科技20260114
2026-01-15 01:06
Summary of the Conference Call for Hengwei Technology Company Overview - Hengwei Technology has acquired Shuheng Technology to enhance its AI application capabilities, addressing the revenue gap in AI applications compared to computational power in China [2][3] - Shuheng Technology focuses on marketing scenarios, providing marketing services through an AI platform, leveraging the founding team's extensive experience in big data and traffic investment [2] Core Business and Development Direction - Hengwei Technology's main business is divided into two segments: network visualization and intelligent system platforms, with a focus on AI infrastructure and computational products [3] - The company plans to seek acquisition targets in the AI application sector by late 2024 to early 2025, aiming to bridge the revenue gap in AI applications [3] Acquisition Rationale - Shuheng Technology was chosen for its ability to convert deep understanding of marketing scenarios into tangible revenue and profit [4] - The company operates on a results-driven business model, charging based on agreed KPIs, ensuring steady profit growth [4][15] Technological Development - Shuheng Technology has developed the SGPT model since 2020, emphasizing small parameter models to solve practical problems, with the local deployment of SGPT 1.0 completed in 2023 [2][6] - The company has built a complete technology stack from computational power to application layers, adapting various GPU types and creating a flexible computational scheduling platform [7] Marketing Services and AI Platform - The Zhixin AI platform integrates advanced AI technologies to enhance marketing efficiency, offering a comprehensive solution that includes a business process center and general Q&A functions [9] - The AI agents can interpret client needs and guide planners in creating precise marketing strategies, improving communication efficiency between sales and planning teams [10] Market Analysis and Data Utilization - Zhixin's market analysis combines official sources and online search capabilities, ensuring comprehensive and reliable data collection [11] - The company has accumulated significant digital assets, which provide a solid foundation for its marketing solutions [12] Client Structure and Industry Focus - Shuheng Technology is focusing on fast-moving consumer goods, beverages, and travel industries, with plans to expand into automotive, education, and telecom sectors [13] - The company aims to develop new AI products for various industries, expecting significant growth in 2026 [13] Capacity and Human Resource Management - The company is experiencing a capacity explosion starting in 2025, with AI empowering teams to improve efficiency without explosive personnel growth [14] Profitability and Future Trends - Despite ongoing audits, net profit is expected to remain stable due to technology reuse and efficiency improvements, with a strategy focused on rapid expansion into new industries [20] Synergies with Hengwei Technology - Collaboration between Hengwei and Shuheng includes integrating AI models with hardware products and jointly developing computational scheduling platforms, enhancing market penetration [21]
让算力像水和电一样方便取用(创新故事)
Ren Min Wang· 2026-01-11 22:43
Core Insights - The article highlights the successful implementation of a dual-rate mixed network trial with 400G and 800G speeds in key computing power regions of China, particularly in the Guizhou-Guangzhou corridor, enhancing efficient interconnection and collaborative development of computing resources [1] - The "East Data West Computing" initiative has positioned Guizhou as a national hub for integrated computing networks, transitioning from a "data warehouse" to a "computing factory" [1][4] - Guizhou's digital economy is projected to exceed 100 billion yuan in software and information technology services revenue by 2024, with continuous growth rates among the highest in the country [4] Group 1: Computing Power Infrastructure - Guizhou has established a direct connection to 24 cities through a low-loss optical cable network, significantly reducing data transmission time by 33% and lowering costs by over 30% for businesses in the Guangdong-Hong Kong-Macau Greater Bay Area [2] - The computing power capacity in Guizhou has reached 150 billion billion calculations per second, with over 90% of intelligent computing being domestically sourced and regionally concentrated [4] Group 2: Efficient Computing Power Scheduling - The "Xirang" computing power scheduling platform has been launched in Guizhou, enabling nationwide coordination and scheduling of computing resources, akin to utilities like water and electricity [3] - Collaborations with local meteorological agencies have led to improved weather prediction accuracy and efficiency through the application of lightweight meteorological diffusion models [3] Group 3: Future Development and Industry Focus - Guizhou aims to further develop its computing power, data, applications, and industries, with a focus on intelligent computing, high-quality data aggregation, and artificial intelligence [5] - The establishment of an artificial intelligence laboratory in Guizhou is expected to attract more enterprises along the computing power industry chain, contributing to high-quality digital economic development [3][5]
下一个“AI卖铲人”:算力调度是推理盈利关键,向量数据库成刚需
Hua Er Jie Jian Wen· 2025-12-24 04:17
Core Insights - The report highlights the emergence of AI infrastructure software (AI Infra) as a critical enabler for the deployment of generative AI applications, marking a golden development period for infrastructure software [1] - Unlike the model training phase dominated by tech giants, the inference and application deployment stages present new commercial opportunities for independent software vendors [1] - Key products in this space include computing scheduling software and data-related software, with computing scheduling capabilities directly impacting the profitability of model inference services [1][2] Computing Scheduling - AI Infra is designed to efficiently manage and optimize AI workloads, focusing on large-scale training and inference tasks [2] - Cost control is crucial in the context of a price war among domestic models, with Deepseek V3 pricing significantly lower than overseas counterparts [5] - Major companies like Huawei and Alibaba have developed advanced computing scheduling platforms that enhance resource utilization and reduce GPU requirements significantly [5][6] - For instance, Huawei's Flex:ai improves utilization by 30%, while Alibaba's Aegaeon reduces GPU usage by 82% through token-level dynamic scheduling [5][6] Profitability Analysis - The report indicates that optimizing computing scheduling can serve as a hidden lever for improving gross margins, with a potential increase from 52% to 80% in gross margin by enhancing single-card throughput [6] - The sensitivity analysis shows that a 10% improvement in throughput can lead to a gross margin increase of 2-7 percentage points [6] Vector Databases - The rise of RAG (Retrieval-Augmented Generation) technology has made vector databases a necessity for enterprises, with Gartner predicting a 68% adoption rate by 2025 [10] - Vector databases are essential for supporting high-speed retrieval of massive datasets, which is critical for RAG applications [10] - The demand for vector databases is expected to surge, driven by a tenfold increase in token consumption from API integrations with large models [11] Database Landscape - The data architecture is shifting from "analysis-first" to "real-time operations + analysis collaboration," emphasizing the need for low-latency processing [12][15] - MongoDB is positioned well in the market due to its low entry barriers and adaptability to unstructured data, with significant revenue growth projected [16] - Snowflake and Databricks are expanding their offerings to include full-stack tools, with both companies reporting substantial revenue growth and customer retention rates [17] Storage Architecture - The transition to real-time AI inference is reshaping storage architecture, with a focus on reducing IO latency [18] - NVIDIA's SCADA solution demonstrates significant improvements in IO scheduling efficiency, highlighting the importance of storage performance in AI applications [18][19]
我省构建异构智算调度技术破解电力行业“算力调度难”
Xin Hua Ri Bao· 2025-12-23 21:48
Core Viewpoint - The "Electric Power Heterogeneous Intelligent Scheduling Technology" developed by Nanjing Nari Ruijun Technology Co., Ltd., a subsidiary of State Grid NARI Group, has achieved international leading standards, effectively addressing the power industry's computing resource supply-demand contradiction [1][2]. Group 1: Technology Development - The technology enables efficient collaboration of heterogeneous computing resources from different brands and models, overcoming the "computing island" phenomenon where high-end resources are over-utilized while mid-to-low-end resources remain idle [1]. - The team has innovated a series of technologies to achieve "interconnectivity" of heterogeneous computing resources, including a unified interface for management and a "network + computing" collaborative mechanism [1]. Group 2: Application and Performance - The "Ruiteng Intelligent Computing Scheduling Platform" has demonstrated outstanding performance, with an average work order response time of only 7.241 seconds and an increase in concurrent processing capacity from 40 to 800, effectively doubling the efficiency of grassroots business processing [2]. - The platform is currently operational in 11 provincial power companies, including those in Jiangsu and Shandong, and is gradually being promoted nationwide, with plans to expand into military, telecommunications, and public security sectors [2]. Group 3: Achievements and Future Plans - The project has secured 19 patent authorizations, published 19 high-level papers, and led the formulation of 3 national standards, with core technologies being industry-first innovations [2]. - The team aims to continuously optimize the technology system to create a self-controllable intelligent computing foundation platform, empowering more industries in their digital transformation and contributing to high-quality development of the digital economy [2].
未来网络试验设施正式投入运行,完成120项重大创新试验
Huan Qiu Wang Zi Xun· 2025-12-06 01:50
Core Insights - The Future Network Experimental Facility, China's first major national technology infrastructure in the information and communication sector, has officially commenced operations [1] Group 1: Facility Overview - The facility is located in Nanjing, Jiangsu, and was completed in August 2024 [1] - It covers 40 cities nationwide, featuring 88 backbone network nodes and 133 edge network nodes, with a total optical transmission length exceeding 55,000 kilometers [1] - The facility can support 4,096 heterogeneous services for parallel testing and is capable of interconnecting with existing domestic and international networks [1] Group 2: Performance Metrics - The facility enables efficient, high-speed, low-latency, and low-jitter data transmission, with a packet loss rate of only one in a million [1] Group 3: Service and Collaboration - To date, the facility has served major national research institutions such as the National Astronomical Observatory and the Institute of High Energy Physics, as well as telecom operators like China Telecom, China Mobile, China Unicom, and China Broadcasting Network [1] - It has collaborated with universities including Peking University, Nanjing University, Zhejiang University, and the Chinese University of Hong Kong, along with leading companies like Huawei, H3C, and Baidu, completing 120 significant innovation experiments [1] - The experiments cover critical dimensions such as core chips, network operating systems, routing control, security and trust, large-scale networking, and new AI services [1]