what is the size of the documents?

the documents are stored in the main memory until the merge, so if you
increase very much the merge factor, the memory could grow until
virtual memory is used, with the penalization that it
involves.

--
Jokin


On 2/10/07, Sairaj Sunil <[EMAIL PROTECTED]> wrote:
Hi,
I saw an article and it tells me that increasing the mergeFactor speeds up
the indexing. But the reverse had happened in my case.
To be more specific I had conducted some experiments for 1000 documents. The
time taken is quite large, due to pdf file indexing. I had changed the
indexwriter's parameters.

MergeFactor – default(10)
minMergeDocs – default(10)
Time taken – 690 sec

MergeFactor – 50
minMergeDocs – default(10)
Time taken – 765 sec
MergeFactor – default(10)
minMergeDocs – 100
Time taken – 670 sec

MergeFactor –100
minMergeDocs – 100
Time taken – 738 sec
Increasing the mergeFactor did not speed up, but increasing the minMergeDocs
had improved. I am using Lucene.Net.
Can you explain the behavior. I am confused.
just to give more info, I am using Lucene.Net 1.3 version, and not
1.9version. Can you tell me the best way to speed up the performance.
What are
the parameters that I should set. I know that this depends on the system,
but which parameter exactly speeds up the indexing performance.

thanks
--
Sairaj Sunil

Reply via email to