Adrien Grand created LUCENE-7351:
------------------------------------

             Summary: BKDWriter should compress doc ids when all values in a 
block are the same
                 Key: LUCENE-7351
                 URL: https://issues.apache.org/jira/browse/LUCENE-7351
             Project: Lucene - Core
          Issue Type: Improvement
            Reporter: Adrien Grand
            Priority: Minor


BKDWriter writes doc ids using 4 bytes per document. I think it should compress 
similarly to postings when all docs in a block have the same packed value. This 
can happen either when a field has a default value which is common across 
documents or when quantization makes the number of unique values so small that 
a large index will necessarily have blocks that all contain the same value (eg. 
there are only 63490 unique half-float values).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to