Paul Elschot wrote:
So, adding a document offset from the documents/frequencies into the positions/payloads for each document would allow: - bulk copying of the position/payloads during merging, and - a more efficient implementation of TermPositions.skipTo() in that decoding the positions from the last available skip document to the target of skipTo() could be avoided. Is that correct?
Yes, though this would also add cost of computing/writing/reading that new offset, and would increase the index size.
That would indeed be invasive.
Yes. I think our time would likely be better spent working on using PForDelta for freq/prox.
Mike --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]