[ https://issues.apache.org/jira/browse/LUCENE-1435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12646525#action_12646525 ]
Michael McCandless commented on LUCENE-1435: -------------------------------------------- Could we, alternatively, push this change into DocumentsWriter, such that on writing a segment it uses a per-field Collator (FieldInfo would be extended to record this) to sort the terms dict? I haven't fully thought through the tradeoffs... but it seems like this'd be simpler to use? Ie rather than putting a CollationKeyFilter in your analyzer chain, and then doing the reverse of this for all searches at search time, you simply set the Collator on the fields (at indexing & searching time, since I agree we should for now not try to serialize into the index which field has which Collator)? I guess there is a performance cost to using the Collator to do live binary search (during searching) and sorting (during indexing) vs doing unicode String comparisions but in practice at search time this is probably a tiny part of the net cost of searching? > CollationKeyFilter: convert tokens into CollationKeys encoded using > IndexableBinaryStringTools > ---------------------------------------------------------------------------------------------- > > Key: LUCENE-1435 > URL: https://issues.apache.org/jira/browse/LUCENE-1435 > Project: Lucene - Java > Issue Type: New Feature > Affects Versions: 2.4 > Reporter: Steven Rowe > Priority: Minor > Fix For: 2.9 > > Attachments: LUCENE-1435.patch, LUCENE-1435.patch > > > Converts each token into its CollationKey using the provided collator, and > then encodes the CollationKey with IndexableBinaryStringTools, to allow it to > be stored as an index term. > This will allow for efficient range searches and Sorts over fields that need > collation for proper ordering. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]