lucene norms cached twice

Cabansag, Ronald-Alvin R Fri, 29 Oct 2010 06:27:40 -0700

We are working with a large readonly lucene index(single segment) with large 
number of fields and documents and are running into memory usage problems.


We found that when using a ReadOnlyDirectoryReader and IndexSearcher created 
using the same reader, the norms are cached twice - first by the reader itself 
and second by the reader's subreaders. Is there an easy way to avoid having the 
norms cached twice when we only have a single subreader?

We thought of the following options:
1.) pass in the main reader as a subreader when creating the  IndexSearcher?  ( 
e.g. new IndexSearcher(mainReader, IndexReader[] {mainReader}, int[] {0} )
2.) override ReadOnlyDirectoryReader.getSequentialSubReaders() method and 
return null. This tells the IndexSearcher to use the main reader- 
ReadOnlyDirectoryReader.
3.) use SegmentReader.get(boolean, SegmentInfo, int) to create a 
ReadOnlySegmentReader that we use as our main reader instead.

Are there any negative implications to the above approaches? Or are there 
better approaches to the problem?

Thanks in advance for any help.

Alvin Cab
Cengage Learning


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

lucene norms cached twice

Reply via email to