Mark, I don't understand this; of course it is use case specific, I haven't
seen any terrible behaviour with 8Gb... 32Mb is extremely small for
Nutch-SOLR -like applications, but it is acceptable for Liferay-SOLR...

Please note also, I have some documents with same IDs updated many thousands
times a day, and I believe (I hope) IndexWriter flushes "optimized" segment
instead of thousands "delete" and single "insert" in many small (32Mb) files
(especially with SOLR)...


> Hmm - came out worse than it looked. Here is a better attempt:
> 
> MergeFactor: 10
> 
> BUF   DOCS/S
> 32       37.40
> 80       39.91
> 120     40.74
> 512     38.25
> 
> Mark Miller wrote:
> > Here is an example using the Lucene benchmark package. Indexing 64,000
> > wikipedia docs (sorry for the formatting):
> >
> >      [java] ------------> Report sum by Prefix (MAddDocs) and Round (4
> > about 32 out of 256058)
> >      [java] Operation     round mrg  flush       runCnt
> > recsPerRun        rec/s  elapsedSec    avgUsedMem    avgTotalMem
> >      [java] MAddDocs_8000     0  10  32.00MB        8
> > 8000        37.40    1,711.22   124,612,472    182,689,792
> >      [java] MAddDocs_8000 -   1  10  80.00MB -  -   8 -  -  - 8000 -
> > -   39.91 -  1,603.76 - 266,716,128 -  469,925,888
> >      [java] MAddDocs_8000     2  10 120.00MB        8
> > 8000        40.74    1,571.02   348,059,488    548,233,216
> >      [java] MAddDocs_8000 -   3  10 512.00MB -  -   8 -  -  - 8000 -
> > -   38.25 -  1,673.05 - 746,087,808 -  926,089,216
> >
> > After about 32-40, you don't gain much, and it starts decreasing once
> > you start getting to high. 8GB is a terrible recommendation.
> >


Reply via email to