Robert Muir created LUCENE-4089:
-----------------------------------

             Summary: fix or document termsIndexInterval/Divisor for 4.0
                 Key: LUCENE-4089
                 URL: https://issues.apache.org/jira/browse/LUCENE-4089
             Project: Lucene - Java
          Issue Type: Bug
          Components: core/index
            Reporter: Robert Muir
             Fix For: 4.0


There are a few parameters on IndexWriterConfig/DirectoryReader that are going 
to be confusing unless we do something about it: at least documentation at the 
minimum:

* IWC.termsIndexInterval: really a codec parameter, actually ignored by 4.0's 
default impl (BlockTree)
* IWC.readerDivisor/DirectoryReader.divisor: really two things, if its -1 it 
means "don't load terms index", and this is respected by the current impls. 
Otherwise, it means "sample the terms index", and this is also actually ignored 
by 4.0's default impl (BlockTree)

I think people will be confused if they set these things and they do nothing. 
As far as fixing, I took a stab at this and its an annoyingly big change. But 
this is the rough sketch of one idea i had so far:
* remove interval: its only applicable if you customize codec and select a 
different terms index/dict impl anyway, so you can just pass this to FixedGap 
or whatever yourself.
* divisor: generalize this into something simple like a Map<String,String> of 
codec "parameters" that you set on IWC/IR. split divisor from "don't load terms 
index". define these as constants where they belong. I got unhappy here in the 
"splitting" part because I wanted the divisor part in TermsIndexReaderBase, but 
that doesnt extend FieldsProducer (where i wanted the "don't load" part) and 
wrap the terms dict, instead its backwards and terms dict wraps the 
TermsIndexReaderBase... maybe we should fix that too? I think this confusing 
the way it is but I didnt look at how difficult this would be.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to