SLURM
Search documents
Building Hyperscaler Engineered for AI with AI Workload Diversity
DDN· 2025-12-22 23:03
Company Overview - Nscale is a vertically integrated AI stack provider, offering end-to-end solutions from infrastructure to cloud [1] - The company customizes data centers for customers, optimizing for specific workloads, similar to a hyperscaler approach for private clouds [2][3] - Nscale is building the largest supercomputer cluster with Microsoft in Europe, comprising approximately 23,000 nodes [4] Technology and Services - Nscale supports diverse AI workloads including model training, fine-tuning, and inference, accommodating various parameters [5] - The company embraces Kubernetes and SLURM for orchestration, providing managed services and bare metal as a service [9][10] - Nscale offers an open AI API compatible interface, enabling scaling and deployment of open source or proprietary models, along with fine-tuning services [12] - The platform supports both Nvidia and AMD GPUs, catering to different customer requirements [13] Future Directions - Nscale aims to provide a global fleet management solution, integrating on-premise and public/private cloud solutions for a consistent customer experience [14] - The company plans to further diversify its AI services, focusing on open source systems and enterprise features like fine-grained access controls [15] - Nscale supports the open-source community through Hugging Face, acting as an inference provider [16]