Software Technology Improves Vector Database Scaling and Accuracy in RAG Workflows by Using SSDs

KIOXIA announced the open source release of its new All-in-Storage ANNS with Product Quantization (AiSAQ) technology. A novel “approximate nearest neighbor” search (ANNS) algorithm optimized for SSDs, KIOXIA AiSAQ software delivers scalable performance for retrieval-augmented generation (RAG) without placing index data in DRAM – and instead searching directly on SSDs.

“By leveraging SSD-based ANNS, we deliver performance comparable to leading in-memory solutions, but using a fraction of expensive DRAM—making large-scale RAG applications more accessible than ever.”

AI Tech Insights: Solidigm Extends Agreement with Broadcom on High-Capacity SSD Controllers for AI

Generative AI systems demand significant compute, memory, and storage resources. While they have the potential to drive transformative breakthroughs across various industries, their deployment often comes with high costs. RAG is a critical phase of AI that refines large language models (LLMs) with data specific to the company or application.

A central component of RAG is a vector database that accumulates and converts specific data into feature vectors in the database. RAG also utilizes an ANNS algorithm, which identifies vectors that improve the model based on similarity between the accumulated and target vectors. For RAG to be effective, it must rapidly retrieve the information most relevant to a query. Traditionally, ANNS algorithms are deployed in DRAM to achieve the high-speed performance required for these searches.

KIOXIA AiSAQ technology provides a scalable and efficient ANNS solution for billion-scale datasets with negligible memory usage and fast index switching capabilities.

AI Tech Insights: KIOXIA Showcases Flash Memory Solutions for AI & IoT at CES 2025

Key Benefits of KIOXIA AiSAQ technology:

  • Allows large-scale databases to operate without relying on limited DRAM resources, enhancing the performance of RAG systems.
  • Eliminates the need to load index data into DRAM, enabling the vector database to launch instantly. This supports seamless switching between user-specific or application-specific databases on the same server for efficient RAG service delivery.
  • Optimized for cloud systems by storing indexes in disaggregated storage for sharing across multiple servers. This approach dynamically adjusts vector database search performance for specific users or applications and facilitates the rapid migration of search instances between physical servers.

“Our AiSAQ solution unlocks the potential for RAG applications to scale nearly without limits, with flash-based SSDs at the heart of it all,” said Neville Ichhaporia, senior vice president and general manager of the SSD business unit at KIOXIA America, Inc. “By leveraging SSD-based ANNS, we deliver performance comparable to leading in-memory solutions, but using a fraction of expensive DRAM—making large-scale RAG applications more accessible than ever.”

AI Tech Insights: ADATA Reveals a New Perspective on The AI Era at CES 2025

Source – businesswire

To share your insights, please write to us at news@intentamplify.com