[ https://issues.apache.org/jira/browse/LUCENE-5580?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13962743#comment-13962743 ]
Adrien Grand commented on LUCENE-5580: -------------------------------------- I agree this would be nice to do that on more index formats. I think I'll open a new issue since I would like to have at least this one in 4.8, and make sure it goes through enough Jenkins builds before the release. > Always verify stored fields' checksum on merge > ---------------------------------------------- > > Key: LUCENE-5580 > URL: https://issues.apache.org/jira/browse/LUCENE-5580 > Project: Lucene - Core > Issue Type: Improvement > Reporter: Adrien Grand > Assignee: Adrien Grand > Fix For: 4.8 > > Attachments: LUCENE-5580.patch > > > I have seen a couple of index corruptions over the last months, and most of > them happened on stored fields. The explanation might just be that since > stored fields are usually most of the index size, they are just more likely > to be corrupted due to a hardware/operating-system failure, but it might be > as well a sneaky bug on our side. > Lucene recently added checksums to index files, and you can enable integrity > verification upon merge, but this comes with a cost since you need to read > all index files twice instead of once. If you are merging a very large > segment and your merges are I/O-bound, this might be noticeable. > I would like to implement integrity checks for stored fields on merges on the > fly, so that the stored fields files need to be read only once. -- This message was sent by Atlassian JIRA (v6.2#6252) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org