On Fri, Sep 7, 2012 at 2:19 PM, Tom Burton-West <tburt...@umich.edu> wrote:
> Thanks Robert,
>
> I'll have to spend some time understanding the default codec for Solr 4.0.
> Did I miss something in the changes file?

http://lucene.apache.org/core/4_0_0-BETA/

see the file formats section, especially
http://lucene.apache.org/core/4_0_0-BETA/core/org/apache/lucene/codecs/lucene40/Lucene40PostingsFormat.html#Termdictionary

(since blocktree "covers" term dictionary and terms index)

>
>  I'll be digging into the default codec docs and testing sometime in next
> week  or two (with a 2 billion term index)  If I understand it well enough,
> I'll be happy to draft some changes up for either the wiki or Solr the
> example solrconfig.xml  file.

right i think we should remove these parameters.

>
> Does this mean that the default codec will reduce memory use for the terms
> index enough so I don't need to use either of these settings to deal with
> my > 2 billion term indexes?

probably. i dont know enough about your terms or how much RAM you have
to say for sure.

if not, just customize blocktree's params with a CodecFactory in solr,
or even pick another implementation (FixedGap, VariableGap, whatever).

the interval/divisor stuff is mostly only useful if you are not
reindexing from scratch: e.g. if you are gonna plop your 3.x index
into 4.x then you should set
those to whatever you were using before, since it will be using
PreflexCodec to read those.

-- 
lucidworks.com

Reply via email to