Devesh Kumar Singh created HDDS-11036:
-----------------------------------------
Summary: Ozone Recon Data Recovery Tool
Key: HDDS-11036
URL: https://issues.apache.org/jira/browse/HDDS-11036
Project: Apache Ozone
Issue Type: Improvement
Components: Ozone Recon
Reporter: Devesh Kumar Singh
Recon do periodic sync with OM to keep OM namespace metadata like keys,
volumes, buckets up-to date and use it for various other tasks and derive data
out of it using various tasks (ContainerKeyMapperTask, NSSummaryTask,
OMDBInsightTask etc).
If due to unexpected errors at OM or Recon , Recon may lag or OM namespace
metadata may not be in sync either due to
# RocksDB lock issues.
# Recon has fetched OM DB snapshot since last sequence number, but before
processing those incremental events (PUT, DELETE, UPDATE), if some unexpected
error caused any processed or derived data failed to save in recon DB, Recon
will not fetch that incremental data again and will not process those events
again.
In such cases, Recon should have a recovery tool to fetch the OM DB snapshot
from last fetched sequence number and then explicitly process and run tasks on
fetched events for Recon.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]