Hi all,

I'm trying to figure out if there is a way to control the number of
segments in an index without explicitly calling forceMerge.

My use-case looks like this: I need to index a static dataset of ~1
billion documents. I know the exact number of docs before indexing starts.
I know the VM where this index is searched has 64 threads. I'd like to end
up with exactly 64 segments, so I can search them in a parallelized fashion.

I know that I could call forceMerge(64), but this takes an extremely long
time.

Is there a straightforward way to ensure that I end up with 64 threads
without force-merging after adding all of the documents?

Thanks in advance for any tips

Alex Klibisz

Reply via email to