[
https://issues.apache.org/jira/browse/LUCENE-5580?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13963942#comment-13963942
]
ASF subversion and git services commented on LUCENE-5580:
---------------------------------------------------------
Commit 1585910 from [email protected] in branch 'dev/trunk'
[ https://svn.apache.org/r1585910 ]
LUCENE-5580: Always verify stored fields checksums on bulk merge.
> Always verify stored fields' checksum on merge
> ----------------------------------------------
>
> Key: LUCENE-5580
> URL: https://issues.apache.org/jira/browse/LUCENE-5580
> Project: Lucene - Core
> Issue Type: Improvement
> Reporter: Adrien Grand
> Assignee: Adrien Grand
> Fix For: 4.8
>
> Attachments: LUCENE-5580.patch
>
>
> I have seen a couple of index corruptions over the last months, and most of
> them happened on stored fields. The explanation might just be that since
> stored fields are usually most of the index size, they are just more likely
> to be corrupted due to a hardware/operating-system failure, but it might be
> as well a sneaky bug on our side.
> Lucene recently added checksums to index files, and you can enable integrity
> verification upon merge, but this comes with a cost since you need to read
> all index files twice instead of once. If you are merging a very large
> segment and your merges are I/O-bound, this might be noticeable.
> I would like to implement integrity checks for stored fields on merges on the
> fly, so that the stored fields files need to be read only once.
--
This message was sent by Atlassian JIRA
(v6.2#6252)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]