[ 
https://issues.apache.org/jira/browse/HIVE-21598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ashutosh Bapat updated HIVE-21598:
----------------------------------
    Status: In Progress  (was: Patch Available)

> CTAS on ACID table during incremental does not replicate data
> -------------------------------------------------------------
>
>                 Key: HIVE-21598
>                 URL: https://issues.apache.org/jira/browse/HIVE-21598
>             Project: Hive
>          Issue Type: Bug
>          Components: HiveServer2, repl
>            Reporter: Ashutosh Bapat
>            Assignee: Ashutosh Bapat
>            Priority: Major
>              Labels: CTAS, Repl
>         Attachments: HIVE-21598.01.patch, HIVE-21598.02.patch, 
> HIVE-21598.03.patch, HIVE-21598.04.patch
>
>
> Scenario
> create database dumpdb with dbproperties('repl.source.for'='1,2,3');
> use dumpdb;
> create table t1 (id int) clustered by(id) into 3 buckets stored as orc 
> tblproperties ("transactional"="true");
> insert into t1 values(1);
> insert into t1 values(2);
> repl dump dumpdb;
> repl load loaddb from <bootstrap load directory>;
> use loaddb;
> select * from t1;
> +--------+
> | t6.id |
> +--------+
> | 1 |
> | 2 |
> +--------
> use dumpdb;
> create table t6 stored as orc tblproperties ("transactional"="true") as 
> select * from t1;
> select * from t6;
> +--------+
> | t6.id |
> +--------+
> | 1 |
> | 2 |
> +--------+
> repl dump dumpdb from <last repl id>
> repl load loaddb from <inc dump location>;
> use loaddb;
> select * from t6;
> +--------+
> | t6.id |
> +--------+
> +--------+
> t6 gets created but there's no data.
>  
> On further investigation, I see that the CommitTxnEvent's dump directory has 
> _files but it is empty. Looks like we do not log names of the files created 
> as part of CTAS.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to