It is true number of terms may be much more than N/10 (or even N for each core), but it is the number of docs per term that will really matter. So you can have N terms in each core but each term has 1/10 number of docs on avg.
2011/12/15 Yury Kats <yuryk...@yahoo.com>: > On 12/15/2011 1:07 PM, Robert Stewart wrote: > >> I think overall memory usage would be close to the same. > > Is this really so? I suspect that the consumed memory is in direct > proportion to the number of terms in the index. I also suspect that > if I divided 1 core with N terms into 10 smaller cores, each smaller > core would have much more than N/10 terms. Let's say I'm indexing > English texts, it's likely that all smaller cores would have almost > the same number of terms, close to the original N. Not so?