Re: Does more memory help Lucene?

Michael D. Curtin Mon, 12 Jun 2006 05:50:14 -0700

Nadav Har'El wrote:

What I couldn't figure out how to use, however, was the abundant memory (2
GB) that this machine has.


I tried playing with IndexWriter.setMaxBufferedDocs(), and noticed that
there is no speed gain after I set it to 1000, at which point the running
Lucene takes up just 70 MB of memory, or 140 MB for the two threads.

Is there a way for Lucene to make use of the much larger memory I have, to
speed up the indexing process? Does having a huge memory somehow improve
the speed of huge merges, for example?

It may not be a Lucene limit per se, but a JVM limit instead. What are youusing for the JVM's heap (via -Xms and -Xmx switches)? For example, I oftenrun with java -Xmx1000m to let the heap grow to a gigabyte, if necessary.

Might I also suggest that you not try to index all of this data in a singleinvocation of a Java program. That is, index a portion, say 10GB at a time,and then use AddIndexes() later to bring them together. Set your granularitybased on the amount of time you can stand to do work over, when the seeminglyinevitable problems crop up. It sure would stink to be working on the 1000thGB of input data and have a power supply go out, and then have to start allthe way over from the beginning! Other checkpointing schemes are possible, ifyou have the time and inclination to be more clever, too ...


Good luck!

--MDC

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Does more memory help Lucene?

Reply via email to