[
https://issues.apache.org/jira/browse/LUCENE-8961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16921391#comment-16921391
]
Christine Poerschke commented on LUCENE-8961:
---------------------------------------------
Thanks [~jpountz] for your input.
The latest attached patch facilitates potential salvaging of terms by making
the {{CheckIndex}} class extensible so that developer's own deriving classes
could:
* customise the checkIntegrity call
* filter the fields being checked
* intercept any (field,term) pairs e.g. for logging purposes
It seems to me to be a rather awkward change though and if out-of-the-box
{{CheckIndex}} would not support id salvaging then a stand-alone tool just for
that purpose might be a cleaner solution? Either way, I won't have bandwidth to
pursue this further in the near future i.e. just sharing things 'as is' in case
it might help others in the meantime.
> CheckIndex: pre-exorcise document id salvage
> --------------------------------------------
>
> Key: LUCENE-8961
> URL: https://issues.apache.org/jira/browse/LUCENE-8961
> Project: Lucene - Core
> Issue Type: New Feature
> Reporter: Christine Poerschke
> Priority: Minor
> Attachments: LUCENE-8961.patch, LUCENE-8961.patch
>
>
> The
> [CheckIndex|https://github.com/apache/lucene-solr/blob/releases/lucene-solr/8.2.0/lucene/core/src/java/org/apache/lucene/index/CheckIndex.java]
> tool supports the exorcising of corrupt segments from an index.
> This ticket proposes to add an extra option which could first be used to
> potentially salvage the document ids of the segment(s) about to be exorcised.
> Re-ingestion for those documents could then be arranged so as to repair the
> data damage caused by the exorcising.
--
This message was sent by Atlassian Jira
(v8.3.2#803003)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]