[ https://issues.apache.org/jira/browse/LUCENE-4752?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13598252#comment-13598252 ]
David Smiley commented on LUCENE-4752: -------------------------------------- Shai, I don't think its enough to have a SortingSegmentMerger. That's the simple part, thanks to your just-finished SortingAtomicReader. You pointed out that sorted segments alone will result in segments that are internally sorted and thus should compress a little better, but are effectively random-access between segments. So if some thousand adjacent documents need to be retrieved, it'll probably have to touch every segment, versus one or two. Granted the newly rewritten title I chose for this issue "merge segments to sort them" is limited to just making a SortingSegmentMerger. Should another issue be filed with a title such as "MergePolicy to sort across segments"? The key word being "across" -- I'm not sure how that could be clarified in a succinct title. > Merge segments to sort them > --------------------------- > > Key: LUCENE-4752 > URL: https://issues.apache.org/jira/browse/LUCENE-4752 > Project: Lucene - Core > Issue Type: New Feature > Components: core/index > Reporter: David Smiley > Assignee: Adrien Grand > > It would be awesome if Lucene could write the documents out in a segment > based on a configurable order. This of course applies to merging segments > to. The benefit is increased locality on disk of documents that are likely to > be accessed together. This often applies to documents near each other in > time, but also spatially. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org