Robert Muir created LUCENE-5797:
-----------------------------------
Summary: improve speed of norms merging
Key: LUCENE-5797
URL: https://issues.apache.org/jira/browse/LUCENE-5797
Project: Lucene - Core
Issue Type: Improvement
Reporter: Robert Muir
Today we use the following procedure:
* track HashSet<Long> uniqueValues, until it exceeds 256 unique values.
* convert to array, sort and assign ordinals to each one
* create encoder map (HashMap<Long,Integer>) to encode each value.
This results in each value being hashed twice... but the vast majority of the
time people will just be using single-byte norms and a simple array is enough
for that range.
--
This message was sent by Atlassian JIRA
(v6.2#6252)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]