Locale-based sort by field with low memory overhead
---------------------------------------------------

                 Key: LUCENE-2369
                 URL: https://issues.apache.org/jira/browse/LUCENE-2369
             Project: Lucene - Java
          Issue Type: New Feature
          Components: Search
            Reporter: Toke Eskildsen
            Priority: Minor


The current implementation of locale-based sort in Lucene uses the FieldCache 
which keeps all sort terms in memory. Beside the huge memory overhead, 
searching requires comparison of terms with collator.compare every time, making 
searches with millions of hits fairly expensive.

This proposed alternative implementation is to create a packed list of 
pre-sorted ordinals for the sort terms and a map from document-IDs to entries 
in the sorted ordinals list. This results in very low memory overhead and 
faster sorted searches, at the cost of increased startup-time. As the ordinals 
can be resolved to terms after the sorting has been performed, this approach 
supports fillFields=true.

This issue is related to https://issues.apache.org/jira/browse/LUCENE-2335 
which contain previous discussions on the subject.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org

Reply via email to