Distributed Execution of index.search(query_vector, k) Without Loading Large Faiss Index in Memory #3567

ivishalanand · 2023-06-12T12:38:12Z

ivishalanand
Jun 12, 2023

I'm working with a large parquet file containing several million embeddings and I'm trying to perform index.search(query_vector, k) in a distributed fashion. The issue arises when dealing with the size of the faiss index, which is approximately 16GB. Given the size of the index, loading it entirely into memory is inefficient and presents a performance bottleneck.

In addition, I want to avoid having to convert the embeddings into pandas or npy formats for performance reasons.

I am looking for an efficient way to conduct the search without the need to load the entire faiss index into memory or convert the embeddings into pandas or npy.

Any guidance or suggestions would be greatly appreciated. Thank you.

mlomeli1 · 2023-07-05T18:04:58Z

mlomeli1
Jul 5, 2023
Collaborator

@ivishalanand please have a look at this page in the wiki: https://github.com/facebookresearch/faiss/wiki/Indexes-that-do-not-fit-in-RAM

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Distributed Execution of index.search(query_vector, k) Without Loading Large Faiss Index in Memory #3567

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Distributed Execution of index.search(query_vector, k) Without Loading Large Faiss Index in Memory #3567

ivishalanand Jun 12, 2023

Replies: 1 comment

mlomeli1 Jul 5, 2023 Collaborator

ivishalanand
Jun 12, 2023

mlomeli1
Jul 5, 2023
Collaborator