GPGPU
Search documents
龙芯中科:9A1000是龙芯首款GPGPU芯片,在9月底交付流片
Bei Jing Shang Bao· 2025-11-17 14:04
北京商报讯(记者 陶凤 王天逸)11月17日,龙芯中科披露投资者关系活动记录表称,9A1000是龙芯首 款GPGPU芯片,融合了图形和AI的算力,可以用作AIPC。总体上,9A1000图形性能高于CPU中集成的 集显性能,是入门级独显。争取开发9A1000的Windows驱动,也可以与Windows电脑配套。9A1000在9 月底交付流片,仍需要一定时间。 ...
清华大学 集成电路学院在 MICRO 2025 成功举办“Ventus:基于 RISC-V 的高性能开源 GPGPU”学术教程
半导体行业观察· 2025-10-26 03:16
Core Insights - The article discusses the successful organization of a tutorial on "Ventus: A High-performance Open-source GPGPU Based on RISC-V and Its Vector Extension" by Tsinghua University at the IEEE/ACM International Symposium on Microarchitecture (MICRO 2025) [1][15] - The tutorial included eight presentations and a hands-on demonstration, showcasing Tsinghua University's comprehensive research achievements in the open-source GPGPU project "Ventus" [3][15] Group 1: Project Overview - Professor He Hu introduced the Ventus GPGPU project, covering its inception, key technologies, team development, future research goals, and plans for open-source community building [3][15] - The project encompasses a complete layout in instruction set architecture (ISA), hardware architecture, compilers, simulators, and verification tools [3][15] Group 2: GPGPU Design Philosophy and Architecture - PhD student Ma Mingyuan elaborated on the essence of GPGPU as a hardware multithreaded SIMD processor, discussing core issues in instruction design and how Ventus builds a complete GPGPU base on RISC-V Vector extensions [5][16] - Key microarchitecture components such as CTA scheduler, core pipeline, and warp scheduler were introduced [5][16] Group 3: Cache Subsystem and MMU Design - PhD student Sun Haonan presented the cache subsystem and memory management unit (MMU) design under the RISC-V RVWMO memory model, utilizing a release consistency-guided cache coherence mechanism (RCC) [6][16] - The design achieved over 95% L1 DTLB hit rate and over 85% L2 TLB hit rate while controlling MMU overhead between 15% and 25% [6][16] Group 4: Multi-Precision Tensor Core Design - PhD student Liu Wei introduced a new generation of multi-precision reusable tensor cores optimized for AI workloads, supporting various data precisions from FP16 to INT4 [7][16] - Benchmark tests showed significant optimizations of 69.1% in instruction count and 68.4% in execution cycles after integrating the tensor core [7][16] Group 5: Differential Verification Framework - Master's student Xie Wenxuan presented the GVM (GPU Verification Model) framework, which addresses verification challenges posed by out-of-order execution in GPGPU [8][17] - The framework effectively identifies bugs and shortens debugging cycles by integrating with the Ventus software stack [9][17] Group 6: Compiler Design - Dr. Wu Hualin from Zhaosong Technology discussed the design considerations for the OpenCL compiler and Triton AI operator library compiler for Ventus GPGPU [10][17] - Ventus GPGPU supports OpenCL 2.0 profile and has passed over 85% of OpenCL conformance tests [10][17] Group 7: Toolchain Design - Engineer Kong Li introduced the design of the Ventus GPGPU toolchain, which includes core modules such as Compiler, Runtime, Driver, and Simulator [11][17] - The toolchain has achieved stable functionality through OpenCL-CTS and Rodinia benchmark tests [11][17] Group 8: Hands-on Demonstration - The hands-on demonstration provided an entry-level guide for developers to deploy the Ventus environment and run OpenCL programs [12][17] - The team showcased a two-tier FPGA verification platform, successfully running key tests such as vector addition and MNIST inference [13][17] - The tutorial highlighted Tsinghua University's systematic research capabilities in the intersection of RISC-V and GPGPU, marking significant progress in open-source high-performance computing architecture [14][17]
英伟达:从显卡巨头到AI霸主
Tai Mei Ti A P P· 2025-07-14 05:29
Core Insights - Nvidia has undergone a significant strategic transformation from a gaming-focused GPU manufacturer to a core supplier of computing infrastructure driving the global AI wave, achieving a market capitalization that once surpassed $3 trillion [1] - The company's financial performance reflects its market dominance, with Q4 2025 revenue reaching $39.3 billion, a 78% year-over-year increase, and data center revenue soaring to $35.6 billion, up 93% [2][3] Group 1: Market Position and Financial Performance - Nvidia holds a dominant market position in the AI-driven computing landscape, particularly in the data center sector, where its high-performance GPUs are in high demand [2] - The company's data center business has shown exponential revenue growth, with total revenue for fiscal year 2025 reaching $130.5 billion, doubling from the previous year [2] - Nvidia's stock price has surged, making it one of the highest-valued tech companies globally, reflecting investor confidence in its core value and future growth potential in the AI era [2] Group 2: Product and Ecosystem Development - Nvidia's high-end GPUs, such as the H100/H200 and the newly released Blackwell series, are essential for training and inference of large AI models, with significant orders from major cloud service providers [3] - The company has established a strong software ecosystem with platforms like CUDA, cuDNN, and TensorRT, which have become industry standards for AI development, creating a high barrier for competitors [4][11] - Nvidia's vertical integration, from chips to systems and software, has created a robust ecosystem that makes it difficult for competitors to challenge its comprehensive leadership [9][12] Group 3: Strategic Vision and Historical Context - Nvidia's success is attributed to its long-term strategic planning and timely execution, having recognized the potential of GPUs for general-purpose computing early in the 21st century [6] - The introduction of the CUDA platform in 2006 significantly lowered the barrier for GPU parallel computing, laying the groundwork for Nvidia's dominance in AI computing [6][8] - The company's proactive investments in AI-related R&D and its development of integrated solutions, such as the DGX series supercomputers, further enhance its competitive edge [8][12] Group 4: Competitive Landscape and Challenges - Despite its strong position, Nvidia faces challenges from new entrants and existing competitors who are increasing their investments to capture market share [5][13] - The complex global supply chain and geopolitical factors pose potential risks to Nvidia's production capacity and market expansion [5] - Competitors must not only match Nvidia's hardware performance but also invest heavily in software ecosystems and community building to effectively challenge its market dominance [13]