Rahul Kumar created PHOENIX-7750:
------------------------------------
Summary: Create PhoenixSyncTable MR Tool to help validate and sync
table data b/w source and target cluster
Key: PHOENIX-7750
URL: https://issues.apache.org/jira/browse/PHOENIX-7750
Project: Phoenix
Issue Type: Task
Affects Versions: 5.3.0, 5.2.1, 5.2.0
Reporter: Rahul Kumar
Assignee: Rahul Kumar
It is required to have a tool that is scalable and efficient to compare and
synchronize copies of tables within a pair of Phoenix clusters. The current
tools available from HBase are not sufficiently reliable and efficient to be
used in large scale production deployments.
HBase replication has traditionally been used to replicate tables between
Phoenix clusters for increased availability and disaster recovery. However,
this method has encountered issues, including operational problems,
misconfigurations, and replication bugs, which can lead to incomplete file
synchronization between Phoenix clusters.
Existing HBase tools have shortcomings to be used on a regular basis to verify
that tables are replicated correctly or resynchronize them when needed. This is
mainly because these tools are not reliable or sufficiently efficient when
tables are large.
The upcoming re-architected Phoenix HA feature incorporates a new replication
solution. This introduces an even greater need for an enhanced tool to verify
the correctness of this new solution.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)