Anton Vinogradov created IGNITE-17738:
-----------------------------------------
Summary: Historical rebalance must be able to fix the consistency
on cluster restart
Key: IGNITE-17738
URL: https://issues.apache.org/jira/browse/IGNITE-17738
Project: Ignite
Issue Type: Sub-task
Reporter: Anton Vinogradov
On cluster restart (because of power-off or some other problem) it's possible
to have PDS inconsistent (primary partitions may contain operations missed on
backups).
Currently, "historical rebalance" is able to sync the data to the highest LWM
for every partition.
Most likely, a primary will be chosen as a rebalance source, but the data after
the LWM will not be rebalanced. So, all updates between LWM and HWM will not be
synchronized.
A possible solution for the case when the cluster failed and restarted (same
baseline) is to fix counters to help "historical rebalance".
Counters should be set as
- HWM at primary and as LWM at backups for caches with 2+ backups,
- LWM at primary and as HWM at backups for caches with a single backup.
This can be implemented as an extension for the `--consistency finalize`
command, for example `--consistency finalize-on-restart`.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)