曦望发布推理GPU芯片启望S3 推进推理云生态共建
Zheng Quan Ri Bao Wang·2026-01-28 12:53

Core Insights - Sunrise has launched its new inference GPU chip "Qiwang S3" at the first Sunrise GPU Summit, marking its first public appearance after raising approximately 3 billion yuan in strategic financing over the past year [1] - The company emphasizes an "All-in inference" approach, focusing on long-term delivery capabilities, unit costs, and system stability, as inference becomes the primary power consumption scenario in the AI industry [1][3] - The Qiwang S3 chip is designed for large model inference, achieving over a 10-fold improvement in overall cost-effectiveness compared to its predecessor in typical inference scenarios [1][2] Product Features - The Qiwang S3 supports precision switching from FP16 to FP4, significantly enhancing low-precision inference efficiency while maintaining model performance [2] - It is the first domestic GPU product to adopt LPDDR6 memory, increasing memory capacity by four times compared to the previous generation, addressing common memory bottlenecks in large model inference [2] - The unit token inference cost in mainstream large model scenarios has decreased by approximately 90% compared to the previous generation, enabling scalable deployment of the "one cent per million tokens" concept [2] Ecosystem Development - Sunrise aims to build a comprehensive "chip + system + ecosystem" layout around inference scenarios, positioning itself beyond just a chip manufacturer [4] - The company is developing a collaborative inference cloud, which integrates dispersed computing resources into a unified inference power pool, providing enterprises with on-demand access to large model inference services [3] - The inference cloud is based on the Qiwang S3 and utilizes GPU pooling and elastic scheduling, allowing businesses to scale computing power flexibly according to their workload [3] Strategic Vision - The company believes that the AI industry is transitioning from a "training-driven" model to an "inference-driven" model, emphasizing long-term delivery capabilities and system stability over one-time training investments [3][4] - Sunrise's chairman stated that whoever can continuously reduce inference costs will control the cost curve of the AI industry, highlighting the importance of systematic innovation in the inference power system for sustainable growth in AI applications [4]

曦望发布推理GPU芯片启望S3 推进推理云生态共建 - Reportify