Core Viewpoint - Nvidia's DGX Cloud has shifted from being a competitive AI cloud service for enterprises to primarily serving as internal infrastructure, with most of its computing power now dedicated to Nvidia's own research rather than customer-facing services [4][6]. Summary by Sections DGX Cloud's Rise and Fall - DGX Cloud was launched in 2023 with a pricing model of $36,999 per month for each H100 instance. Initially, this pricing was justified due to GPU shortages, but as supply improved, the value of this "scarcity alternative" has significantly decreased. AWS has reduced rental prices for H100 and A100 GPUs by up to 45%, making DGX Cloud less attractive to customers [6]. Strategic Shift to Lepton - Nvidia has redirected its focus to the Lepton GPU rental market, which operates differently from DGX Cloud. Lepton acts as a "traffic coordinator," routing computing demands to partners like AWS and Azure, rather than directly renting GPUs. This strategy positions Nvidia not as a direct competitor in cloud computing but as an aggregator within the AI cloud economy, allowing it to maintain influence over the GPU market without owning cloud infrastructure [8]. Impact on Developers and the Industry - For developers, transitioning from DGX Cloud to Lepton means accessing GPU computing power at more competitive prices through existing cloud services like AWS or Azure. For Nvidia, this shift reduces conflicts with channel partners and enhances its control over global GPU workload distribution [10].
英伟达一项业务,退居二线