[ 
https://issues.apache.org/jira/browse/LUCENE-8004?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16215682#comment-16215682
 ] 

Michael McCandless commented on LUCENE-8004:
--------------------------------------------

Merges are usually compute bound, and a given merge is single threaded ... if 
you look in IndexWriter's info stream log you'll see which parts take the most 
time; it's usually postings in my experience.

Especially if you are merging away deleted docs, then we can't apply bulk copy 
optos for stored fields and term vectors.

We don't have a bulk copy opto for postings.

> IndexUpgraderTool should rewrite segments rather than forceMerge
> ----------------------------------------------------------------
>
>                 Key: LUCENE-8004
>                 URL: https://issues.apache.org/jira/browse/LUCENE-8004
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Erick Erickson
>
> Spinoff from LUCENE-7976. We help users get themselves into a corner by using 
> forceMerge on an index to rewrite all segments in the current Lucene format. 
> We should rewrite each individual segment instead. This would also help with 
> upgrading X-2->X-1, then X-1->X.
> Of course the preferred method is to re-index from scratch.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to