If memory serves those are the phases that use POSIX sort right? Sort will try and do an in-memory sort as far as possible and fall back to a disk-based merge sort if not. Also we usually configure sort to run in parallel
If you try to process different indexing in parallel you would create a lot of memory and disk contention which would likely slowdown overall performance For sufficiently large data sets there is also a risk of exhausting disk space during the sort phase and building multiple indexes in parallel would only exacerbate this Rob On 28/10/2016 14:33, "A. Soroka" <[email protected]> wrote: I'm still learning about tdbloader2 and have another question about the index phase: is there any reason why the processes for the various index orderings (SPO, GSPO, etc.) couldn't go on in parallel? Or am I missing some switch or setting that already allows that? --- A. Soroka The University of Virginia Library
