DDN Infinia
Search documents
DDN One-Click RAG Pipeline Demo: DDN Infinia & NVIDA NIMs
DDN· 2025-11-11 18:56
Welcome to this demonstration. Today we'll be showing how DDN enables a one-click high-performance rag pipeline for enterprise use. Our rag pipeline solution is enterprise class and easy to deploy and use in any cloud environment whether AWS, GCP, Azure, any NCP cloud and of course on prem.Let's take a closer look at the architecture. This rag pipeline solution is made of several NVIDIA Nemo NIMS or NVIDIA inference microservices which host embedding reranking LLM models a milild vector database a front-end ...
DDN Infinia on OCI: High-Performance AI Storage
DDN· 2025-11-11 18:56
Performance Overview - DDN Infinia demonstrates excellent performance in Oracle Cloud Infrastructure (OCI) with a small six-node cluster [7] - Achieved a consistent 5 milliseconds Time To First Byte (TTFB), which is excellent for S3 object IO [6] Throughput Metrics - Achieved approximately 30 GB/s of put throughput during object population [5] - Each client and Infinia node processed puts at roughly 5 GB/s [5] - Sustained approximately 37.5 GB/s of get throughput during the get benchmark [6] - Load was evenly distributed across all clients and Infinia nodes at around 6.5 GB/s of throughput during get operations [6] Infrastructure and Configuration - The test used six BM dense ioe5 compute instances as hosts for the Infinia cluster [2] - Six BM standard E5.192 instances with single 100 GB connections were used for the clients to avoid networking bottlenecks [2] - Only 32 out of the 128 cores available in the dense ioe5 instances were utilized for the Infinia software [2] - DDN is investigating other OCI instances to prevent overallocation of hardware [3] Technology and Architecture - Infinia architecture provides capabilities for data management, including data IO paths, object file querying, scale-out KV store, always-on encryption, and data reduction [2] - Infinia is fully software-defined and containerized, enabling it to run on physical or virtualized hardware with Intel, AMD, or ARM processors [2] - Implemented high-performance eraser coding, custom fall domains, and the ability to use both TLC and QLC flash [2] Testing Methodology - IO generation was performed using warp in distributed benchmarking mode to ensure a full mesh of IO across all clients and Infinia cluster nodes [3] - Parallel warp was used across all six clients and six Infinia nodes during the put and get tests [4][5][6] Disclaimer - The information presented is for potential future integrations and is a tech preview [1] - The overall capabilities, including the performance of this feature, can and will change [1] - No timelines for delivering this capability should be inferred from this demo [1]
What’s New and What’s Coming at DDN - Dr. James Coomer, DDN
DDN· 2025-09-18 15:11
DDN Exoscaler产品特性与优势 - DDN Exoscaler是一个并行文件系统,专为大规模数据处理而设计,旨在加速GPU流量,提高GPU的生产力,适用于生命科学、金融等多种行业 [1] - 该技术通过消除IO等待时间,使GPU能够持续获取数据,从而加速模型训练、推理以及提高token生成速度 [1] - DDN的解决方案旨在以最小的物理空间、功耗和网络占用提供最大的性能 [1] - DDN提供多种闪存配置(TLC、QLC)和混合系统(HDD),以满足不同客户在成本、性能和容量方面的需求,并支持将这些不同介质类型挂载到同一挂载点 [1][2] - DDN Exoscaler的客户端具备智能性,能够感知数据位置,从而优化数据访问路径,提高效率 [2] - DDN提供数据缩减系统,通过客户端压缩机制,在不影响存储性能的前提下实现数据压缩,数据缩减率通常在2到4倍之间,对于文本和日志数据最高可达50倍 [2] - DDN提供在线升级功能,允许在系统运行过程中进行升级,这对需要保持服务持续性的客户至关重要 [1][2] - DDN提供EMF分析工具,用于全面测试网络,帮助客户快速发现和解决网络问题,确保系统稳定运行 [2] - DDN Exoscaler支持多种协议访问,包括S3、NFS、SMB以及原生并行文件系统,并兼容Prometheus和Grafana等开源监控工具 [2] - DDN的监控系统能够显示哪些用户、客户端或作业正在对文件系统造成压力,帮助云服务提供商确保公平的数据访问 [2] DDN AI400X3产品 - DDN推出AI400X3,专为Nvidia Blackwell架构设计,旨在满足GPU技术快速发展带来的数据存储和访问需求 [1] - AI400X3在2U空间内提供150 GB/s的网络吞吐量,并提供95 GB/s的checkpoint速度 [1][3]
Ask the Experts: Turbocharge Performance with DDN Infinia on Oracle Cloud
DDN· 2025-09-11 15:32
The fastest AI isn’t just about GPUs. It’s about removing I/O bottlenecks that slow your business objectives down. Discover how DDN Infinia and Oracle Cloud Infrastructure (OCI) are redefining AI performance. Join this Ask the Experts session to learn how Infinia’s high-performance, S3-compatible storage along Oracle’s powerful and scalable Cloud Infrastructure deliver ultra-low latency, massive throughput, and linear scalability for the most demanding AI workloads. What you’ll learn: - The biggest challeng ...