Thanks Doug,

You gave me an important hint to my initial question. I think that one way to add this 
without affecting performance is to store the information in another file and use it 
whenever the search model needs it, thus the IndexReader reads the data on demand. If 
the search model doesn't use this information to calculate scores, then the file 
containing collection frequencies isn't loaded at all. Here, the advantage is that the 
actual index structure is not altered! 

Regards,
Samir

> -----Message d'origine-----
> De : Doug Cutting [mailto:[EMAIL PROTECTED]
> Envoyé : mercredi, 4. août 2004 23:04
> À : Lucene Developers List
> Objet : Re: RE : Term Collection Frequency?
> 
> Grant Ingersoll wrote:
> > Once again, I think a generic Metadata Reader/Writer interface would be
> > the ideal solution for all of these types of problems.
> >
> > See
> > http://issues.apache.org/eyebrowse/ReadMsg?listName=lucene-
> [EMAIL PROTECTED]&msgId=1777978
> >
> > I am more than willing to help w/ an implementation, but do not want to
> > go it alone w/o some consensus from the committers/Doug that such an
> > idea would be accepted as I think the change may be fairly involved.
> 
> My concern is that truly generic metadata of this sort would be big and
> slow.  But I'd love to see a proposal that performs well!
> 
> Adding, e.g., collection frequency to indexes would not be too hard:
> you'd need to add a field to TermInfo, extend TermInfosWriter,
> DocumentWriter, and SegmentMerger to maintain it, then extend
> SegmentTermEnum, IndexReader, SegmentReader and MultiReader to access
> it.  Indexes would be a little larger and a little slower, but not
> significantly.
> 
> Architecting things so that this same change could be easily made
> without modifying any internals is a much bigger challenge.  And, once
> this is done, making it so that index size and performance is little
> altered is harder yet.  If you have a design that achieves this, please
> share it.
> 
> Doug
> 
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
> 



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to