[
https://issues.apache.org/jira/browse/HIVE-24895?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17304673#comment-17304673
]
Ayush Saxena commented on HIVE-24895:
-------------------------------------
Tried out things as per you suggestions, The present state looks like this:
{noformat}
For Table 1:
2021-03-19T01:46:54,748 INFO [HiveServer2-Background-Pool: Thread-163]
repl.DirCopyTask: Completed DirCopyTask for source:
hdfs://localhost:9000/user/hive/warehouse/source01.db/dest_1 to target:
hdfs://localhost:9000/tmp/ext7/user/hive/warehouse/source01.db/dest_1. Took
0:12.600s. DistCp JobId job_1616095793401_0005
For Table 2:
2021-03-19T01:47:12,377 INFO [HiveServer2-Background-Pool: Thread-163]
repl.DirCopyTask: Completed DirCopyTask for source:
hdfs://localhost:9000/user/hive/warehouse/source01.db/dest_2 to target:
hdfs://localhost:9000/tmp/ext7/user/hive/warehouse/source01.db/dest_2. Took
0:17.629s. DistCp JobId job_1616095793401_0006
The Final Log Post all DirCopyTasks
2021-03-19T01:47:12,390 INFO [HiveServer2-Background-Pool: Thread-163]
ReplState: REPL::DATA_COPY_END: {"time":"Total time taken for all external
table copy tasks: 0:30.270s"}{noformat}
For each DirCopyTask in the finally block, I have the source, destination, time
taken and the jobId.
The jobId would be there post HADOOP-17594
> Add a DataCopyEnd stage in ReplStateLogTask for external table replication
> --------------------------------------------------------------------------
>
> Key: HIVE-24895
> URL: https://issues.apache.org/jira/browse/HIVE-24895
> Project: Hive
> Issue Type: Improvement
> Reporter: Ayush Saxena
> Assignee: Ayush Saxena
> Priority: Major
> Labels: pull-request-available
> Time Spent: 10m
> Remaining Estimate: 0h
>
> Add a task to mark the end of external table copy.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)