Sankar Hariappan created HIVE-21269:
---------------------------------------
Summary: Hive replication should mandate -update and -delete as
DistCp options to avoid data inconsistency.
Key: HIVE-21269
URL: https://issues.apache.org/jira/browse/HIVE-21269
Project: Hive
Issue Type: Bug
Components: repl
Affects Versions: 4.0.0
Reporter: Sankar Hariappan
Assignee: Sankar Hariappan
Currently, external tables replication, copies the data in directory level. So,
if target directory exist, then DistCp should compare and update or skip data
files in the directory instead of creating new directory inside pre-existing
target directory.
This can be achieved using -update.
Also, -delete option is needed to delete the files missing in source directory
but present in target.
Hive should mandate these DistCp options even if user passes other options.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)