Adrien Grand created LUCENE-7351:
------------------------------------
Summary: BKDWriter should compress doc ids when all values in a
block are the same
Key: LUCENE-7351
URL: https://issues.apache.org/jira/browse/LUCENE-7351
Project: Lucene - Core
Issue Type: Improvement
Reporter: Adrien Grand
Priority: Minor
BKDWriter writes doc ids using 4 bytes per document. I think it should compress
similarly to postings when all docs in a block have the same packed value. This
can happen either when a field has a default value which is common across
documents or when quantization makes the number of unique values so small that
a large index will necessarily have blocks that all contain the same value (eg.
there are only 63490 unique half-float values).
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]