NVIDIA cuVS
Search documents
KIOXIA Achieves 4.8 Billion High-Dimensional Vector Search Database on a Single Server, with 7.8x Index Build Time Acceleration via GPUs
Businesswire· 2026-03-16 20:45
Core Insights - KIOXIA has successfully demonstrated a high-dimensional vector search database capable of scaling to 4.8 billion vectors on a single server, utilizing its AiSAQ technology and NVIDIA GPU acceleration for significant improvements in index build time [1][3][4] Technology Advancements - The integration of KIOXIA's AiSAQ technology with NVIDIA's cuVS library allows for efficient indexing of 1024-dimensional vectors while minimizing DRAM usage [1][4] - KIOXIA achieved a 20x improvement in index build time, reducing the time from 28.4 days using CPU to 1.4 days with four NVIDIA Hopper GPUs, and end-to-end testing time from 31 days to 4 days [3][5] Market Implications - The advancements in vector databases are crucial for AI applications that require processing large volumes of vectorized information, potentially reaching tens of billions of vectors [3][5] - KIOXIA's technology addresses scalability challenges in retrieval augmented generation (RAG) applications, making large-scale deployments more practical and efficient [4][5] Future Developments - KIOXIA is committed to evolving its AiSAQ capabilities towards supporting trillion-vector deployments, indicating a focus on continuous innovation in storage-driven AI solutions [5]
VAST Data Introduces End-to-End Fully Accelerated AI Data Stack with NVIDIA
Globenewswire· 2026-02-25 18:30
Core Insights - VAST Data announced a fully CUDA accelerated AI data stack in collaboration with NVIDIA, enabling a unified platform for data ingestion, retrieval, analytics, and inference [1][2][3] Group 1: Product and Technology Advancements - The VAST AI Operating System simplifies the AI infrastructure by integrating data services and compute layers, reducing operational complexity [2][3] - The new CNode-X servers are designed to enhance AI infrastructure, providing high-performance storage services and enabling VAST AI OS to run directly on NVIDIA-powered servers [3][4] - VAST's architecture allows orchestration of AI pipelines, high-performance analytics, vector search, and agent runtimes as a single software stack [3][4] Group 2: Performance Improvements - The integration of NVIDIA software libraries and APIs within the VAST platform leads to higher performance, lower latency, and improved efficiency in real-time SQL analytics and AI inferencing workflows [4][5] - Early benchmarks of the Sirius query engine show up to a 44% reduction in query time and up to an 80% reduction in query cost, enhancing the VAST DataBase for modern analytics workloads [6] Group 3: Market Strategy and Partnerships - VAST plans to market CNode-X servers through OEM partners like Cisco and Supermicro, allowing customers to procure GPU-accelerated infrastructure while maintaining a consistent operational experience [8][9] - The collaboration with Cisco and Supermicro aims to deliver an integrated AI Data Platform that simplifies enterprise AI deployment and enhances performance [10] Group 4: Future Developments - VAST will support NVIDIA NIM microservices for scalable AI pipelines and open-source production-ready blueprints for various use cases, including video intelligence and genomics research [12]