[
https://issues.apache.org/jira/browse/HIVE-24502?focusedWorklogId=527555&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-527555
]
ASF GitHub Bot logged work on HIVE-24502:
-----------------------------------------
Author: ASF GitHub Bot
Created on: 23/Dec/20 09:12
Start Date: 23/Dec/20 09:12
Worklog Time Spent: 10m
Work Description: pkumarsinha commented on a change in pull request #1759:
URL: https://github.com/apache/hive/pull/1759#discussion_r547838835
##########
File path: ql/src/java/org/apache/hadoop/hive/ql/exec/repl/ReplDumpTask.java
##########
@@ -607,9 +606,7 @@ private Long incrementalDump(Path dumpRoot, DumpMetaData
dmd, Path cmRoot, Hive
dmd.setDump(DumpType.INCREMENTAL, work.eventFrom, lastReplId, cmRoot,
executionId);
// If repl policy is changed (oldReplScope is set), then pass the current
replication policy,
// so that REPL LOAD would drop the tables which are not included in
current policy.
- if (work.oldReplScope != null) {
- dmd.setReplScope(work.replScope);
- }
+ dmd.setReplScope(work.replScope);
Review comment:
Earlier current ReplicationScope was being passed for load to consume
and figure out if it need to drop extra tables. For which it makes a call to
metastore to get all tables and filter the names by applying the expression in
ReplicationScope. With this change, this comparison will happen all the times.
While this will still work, it will introduce a new cost to get all tables and
compare against the load. Can we maintain two expressions or one expression
and one flag to convey if there have been a change in the expression or not?
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
Issue Time Tracking
-------------------
Worklog Id: (was: 527555)
Time Spent: 20m (was: 10m)
> Store table level regular expression used during dump for table level
> replication
> ---------------------------------------------------------------------------------
>
> Key: HIVE-24502
> URL: https://issues.apache.org/jira/browse/HIVE-24502
> Project: Hive
> Issue Type: Task
> Reporter: Aasha Medhi
> Assignee: Aasha Medhi
> Priority: Major
> Labels: pull-request-available
> Attachments: HIVE-24502.01.patch, HIVE-24502.02.patch
>
> Time Spent: 20m
> Remaining Estimate: 0h
>
> Store include table list and exclude table list as part of dump meta data file
--
This message was sent by Atlassian Jira
(v8.3.4#803005)