[
https://issues.apache.org/jira/browse/LUCENE-7839?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16023643#comment-16023643
]
Adrien Grand commented on LUCENE-7839:
--------------------------------------
Agreed. At the moment the patch does not do delta compression, it just reads
plain byte/short/int/long values like master. The only difference with master
is that it splits values into blocks of 16384 and encodes each block
independently. It does not even specialize the case that norms are in 0..15
since I first wanted to get an idea of the performance impact of leveraging the
iterator API so that a single outlier does not raise the number of bits per
value for document.
> Optimize the default NormsFormat for the case that all norms are in 0..16
> -------------------------------------------------------------------------
>
> Key: LUCENE-7839
> URL: https://issues.apache.org/jira/browse/LUCENE-7839
> Project: Lucene - Core
> Issue Type: Task
> Reporter: Adrien Grand
> Priority: Minor
> Attachments: LUCENE-7839.patch
>
>
> Given how we now store the length of the field in norms, we could optimize
> the default norms format for the case that all norms are in 0..16 and store
> it on 4 bits. This would be picked up for short fields that have less than 16
> terms (eg. title fields) and reduce disk utilization by 2.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]