Core Insights - Nebius will start offering NVIDIA's Vera Rubin NVL72 GPU in the second half of 2026 in the US and Europe, becoming one of the first AI cloud providers to deploy this computing platform [1] - The integration of Vera Rubin NVL72 into Nebius's full-stack infrastructure will enable customers to build next-generation AI applications with regional availability and control [1] - Nebius Token Factory is described as a post-training platform for enterprises, aimed at facilitating the development of AI systems [1] Product and Technology Overview - The Rubin computing platform will complement Nebius's existing NVIDIA GB200 NVL72 and Grace Blackwell Ultra NVL72 capacities, expanding customer options [2] - NVIDIA's CEO Jensen Huang announced that Vera Rubin has entered full-scale production, positioning it as the successor to Grace Blackwell [2] - Vera Rubin is characterized as an AI supercomputer composed of six core components, including Vera CPU, Rubin GPU, and others, designed for next-generation AI workloads in cloud and large data centers [2] Performance and Cost Efficiency - The Rubin GPU features a third-generation Transformer engine with NVFP4 inference performance of 50 PFLOPS, which is five times that of the previous Blackwell GPU [3] - The Vera Rubin platform can train large-scale Mixture of Experts (MOE) models in the same training time while requiring only a quarter of the GPUs, reducing the training cost per token to one-seventh of the previous generation [3] - Vera Rubin will support third-generation confidential computing technology, becoming the industry's first rack-level trusted computing platform, catering to AI scenarios with high demands for security isolation, data privacy, and multi-tenant environments [3]
跻身英伟达Rubin首批云服务商:Nebius(NBIS.US)将于2026下半年上线Vera Rubin NVL72算力集群