Workflow
Data Intelligence Platform
icon
Search documents
Building Scalable Foundations for Large Language Models
DDN· 2025-05-27 22:00
AI Infrastructure & Market Trends - Modern AI applications are expanding across various sectors like finance, energy, healthcare, and research [3] - The industry is evolving from initial LLM training to Retrieval Augmented Generation (RAG) pipelines and agentic AI [3] - Vulture is positioned as an alternative hyperscaler, offering cloud infrastructure with 50-90% cost savings compared to traditional providers [4] - A new 10-year cycle requires rethinking infrastructure to support global AI model deployment, necessitating AI-native architectures [4] Vulture & DDN Partnership - Vulture and DDN share a vision for radically rethinking the infrastructure landscape to support global AI deployment [4] - The partnership aims to build a data pipeline to bring data to GPU clusters for training, tuning, and deploying models [4] - Vulture provides the compute infrastructure pipeline, while DDN offers the data intelligence platform to move data [4] Scalability & Flexibility - Enterprises need composable infrastructure for cost-efficient AI model delivery at scale, including automated provisioning of GPUs, models, networking, and storage [2] - Elasticity is crucial to scale GPU and storage resources up and down based on demand, avoiding over-provisioning [3] - Vulture's worldwide serverless inference infrastructure scales GPU resources to meet peak demand in different regions, optimizing costs [3] Performance & Customer Experience - Improving customer experience requires lightning-fast and relevant responses, making time to first token and tokens per second critical metrics [4] - Consistency in response times is essential, even with thousands of concurrent users [4] - The fastest response for a customer is the ultimate measure of customer satisfaction [4] Data Intelligence Platform - DDN's Exascaler offers high throughput for training, with up to 16x faster data loading and checkpointing compared to other parallel file systems [5] - DDN's Infinia provides low latency for tokenization, vector search, and RAG lookups, with up to 30% lower latency [5] - The DDN data intelligence platform helps speed up data response times, enabling saturated GPUs to respond quickly [6]
Introducing the Data Intelligence Platform: DDN Infinia 2.0
DDN· 2025-05-15 19:49
uh I would like to talk about uh infinia uh most of you have probably seen we've done a pretty great announcement uh we delivered a product in version 20 lots of people ask why is it 2.0% well if it's already in production you shouldn't call it 1.0% at the time with G and so we gave it a 2.0% given that it's also a pretty significant large deployment we saw 2.0% is probably a a better number to start with so what is infinia uh infinia is our great software defined Next Generation data intelligence platform ...
Nebius and DDN: Partnering to Power the Future of Enterprise AI
DDN· 2025-05-01 21:27
[Music] I'm Laurel Roseman. I'm vice president of channel and alliances for North America for Nebius. I think the AI market today as we all know is extremely competitive. So to find partners and providers in the GPU space that can really be an advocate for the customer, build the product in the stack the way the customer needs to support their business and be nimble in how you work with them and provide enterprisegrade infrastructure. number one core um core importance there. That is how you're going to win ...