Request: guidelines for memory requirement when distributing index #2929

leothomas · 2023-06-27T23:49:02Z

leothomas
Jun 27, 2023

I'm working on an instance of a FAISS index distributed across EC2 instances in AWS.

Each EC2 instance has 8Gb of ram and I have ~250 million vectors to index, with 128 dimensions each (after PCA).

In theory, 8Gb could fit: 8E9bytes / (128 dim/vector * 4 bytes/dim) ~= 15.6 million vectors but I assume the index itself has additional memory requirements, and the search operations themselves consume additional memory.

Are there any guidelines for how many "shards" I should distribute my index into?

Cheers!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Request: guidelines for memory requirement when distributing index #2929

{{title}}

Replies: 0 comments

Select a reply

Request: guidelines for memory requirement when distributing index #2929

leothomas Jun 27, 2023

Replies: 0 comments

leothomas
Jun 27, 2023