[
https://issues.apache.org/jira/browse/HIVE-21269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16768888#comment-16768888
]
mahesh kumar behera commented on HIVE-21269:
--------------------------------------------
02.patch looks fine to me.
+1
> Mandate -update and -delete as DistCp options to sync data files for
> external tables replication.
> --------------------------------------------------------------------------------------------------
>
> Key: HIVE-21269
> URL: https://issues.apache.org/jira/browse/HIVE-21269
> Project: Hive
> Issue Type: Bug
> Components: repl
> Affects Versions: 4.0.0
> Reporter: Sankar Hariappan
> Assignee: Sankar Hariappan
> Priority: Major
> Labels: DR, pull-request-available, replication
> Attachments: HIVE-21269.01.patch, HIVE-21269.02.patch
>
> Time Spent: 10m
> Remaining Estimate: 0h
>
> Currently, external tables replication, copies the data in directory level.
> So, if target directory exist, then DistCp should compare and update or skip
> data files in the directory instead of creating new directory inside
> pre-existing target directory.
> This can be achieved using -update.
> Also, -delete option is needed to delete the files missing in source
> directory but present in target.
> Hive should mandate these DistCp options even if user passes other options.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)