It looks like Lucene stores the string names of the posting lists in the index. How compact is this storage (when there may be a very large number of posting lists, and the string lengths may be large - for example, I may have an entry that looks like "Cn:4593846->8957363,485". I've seen other systems hash the string, use the hash and discard the string. Does Lucene have this option?
Thanks, Sriram.