Workflow
谷歌 Ironwood TPU:在推理模型训练与推理服务领域实现一流性能、性能成本比及性能功耗比
2025-09-04 14:38

Summary of Ironwood Conference Call Company and Industry - Company: Ironwood - Industry: Machine Learning and Data Center Technology Key Points and Arguments 1. Performance Metrics: Ironwood's 9216 chips utilize optical circuit switches (OCS) to share memory, achieving a directly addressable shared HBM memory capacity of 1.77 PB and 42.5 Exaflops of ML compute using FP8 precision, which sets a new record for shared-memory multiprocessors [7][73] 2. Efficiency Improvements: The company emphasizes industry-leading compute power efficiency, reporting a 2x performance per watt (perf/W) improvement over the previous generation [7][73] 3. Cooling Infrastructure: Ironwood has developed a 3rd generation of liquid cooling infrastructure, which is crucial for maintaining performance in high-density environments [26][75] 4. SparseCore Technology: The 4th generation SparseCore technology is designed to accelerate embeddings and offload collective operations, providing a 2.4x increase in FLOPS compared to the 3rd generation [30][75] 5. Deployment at Hyperscale: The deployment of Ironwood technology at hyperscale is currently underway, indicating strong market demand and operational scaling capabilities [35][73] 6. Reliability and Serviceability: The emphasis on RAS (Reliability, Availability, and Serviceability) is highlighted as a key feature that enables productive scaling to extreme sizes [20][74] Additional Important Content 1. Power Management: Ironwood supports a full-stack approach to proactive power shaping, which is essential for managing unprecedented load swings during large-scale pretraining [34][67] 2. Security Features: The integrated root-of-trust (iROT) controller provides hardware support for secure boot and secure test/debug, enhancing the security of the computing environment [60] 3. Market Position: Ironwood continues to lead in both scale-up and scale-out capabilities, with a focus on maximizing ML throughput under dynamically varying power budgets [73][72] 4. Future Outlook: The company aims to target a 30% additional throughput per data center within the same power budget, showcasing its commitment to innovation and efficiency [72] This summary encapsulates the critical insights from the Ironwood conference call, focusing on performance, efficiency, technology advancements, and strategic positioning within the industry.