It looks like Lucene stores the string names of the posting lists in the
index.  How compact is this storage (when there may be a very large number
of posting lists, and the string lengths may be large - for example, I may
have an entry that looks like "Cn:4593846->8957363,485".  I've seen other
systems hash the string, use the hash and discard the string.  Does Lucene
have this option?

Thanks,

Sriram.

Reply via email to