国芯科技(688262.SH):CNN300预计将支持INT8/FP8/FP16等常规AI应用所需要的数据类型

Core Insights - Guoxin Technology (688262.SH) has introduced the CNN200, which utilizes a GCU+NN network architecture, achieving a maximum single-core computing power of 10 TOPS@INT8, suitable for various edge computing AI SoC chips and applications including robotic dogs [1] Group 1: Product Features - The CNN200 features a pulsed array computing unit that optimizes dynamic power consumption and memory area, effectively reducing energy consumption and latency, achieving industry-leading energy efficiency [1] - It integrates on-chip caching and inter-layer data sharing technology, significantly reducing DDR access [1] - The hardware acceleration unit supports over 90 types of neural network operators and has a quick expansion interface design to adapt to the development of neural network models [1] Group 2: Compatibility and Ecosystem - The CNN200 supports post-training quantization (PTQ) with various quantization strategies, including symmetric, asymmetric, layer-wise, and channel-wise, compatible with mainstream neural network structures like CNN and RNN, and supports INT8 and FP16 data precision [1] - It is compatible with major deep learning frameworks such as PyTorch, TensorFlow, ONNX, and PaddlePaddle, demonstrating broad ecological adaptability [1] - The accompanying NPU toolchain includes tools for model format conversion, preprocessing, quantization, compilation, and simulation, providing software ecosystem support for NPU inference implementation and application deployment [1] Group 3: Future Developments - The company is developing the CNN300, aimed at AIPC applications, which is expected to support data types such as INT8, FP8, and FP16, and will cater to both traditional CNN and RNN applications as well as the latest popular LLM (large language model) applications [1] - The CNN300 will facilitate the offloading of commonly used large models like Deepseek, Qwen, and LLaMa, meeting the needs of conventional applications such as speech, image, and video recognition, as well as high-quality speech and video display for AIPC applications, generative AI, multimodal interaction, and knowledge management [1]