[ 
https://issues.apache.org/jira/browse/HIVE-21213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

mahesh kumar behera updated HIVE-21213:
---------------------------------------
    Description: The current implementation of compaction uses the txn id in 
the directory name. This is used to isolate the queries from reading the 
directory until compaction has finished and to avoid the compactor marking used 
earlier. In case of replication, during bootstrap , directory is copied as it 
is with the same name from source to destination cluster. But the directory 
created by compaction with txn id can not be copied as the txn list at target 
may be different from source. The txn id which is valid at source may be an 
aborted txn at target. So conversion logic is required to create a new 
directory with valid txn at target and dump the data to the newly created 
directory.  (was: The current implementation of compaction uses the txn id in 
the directory name. This is used to isolate the queries from reading the 
directory until compaction has finished and to avoid the compactor marking used 
earlier. In case of replication, the directory can not be copied as the txn 
list at target may be different from source. So conversion logic is required to 
create a new directory with valid txn at target and dump the data to the newly 
created directory.)

> Acid table bootstrap replication needs to handle directory created by 
> compaction with txn id
> --------------------------------------------------------------------------------------------
>
>                 Key: HIVE-21213
>                 URL: https://issues.apache.org/jira/browse/HIVE-21213
>             Project: Hive
>          Issue Type: Sub-task
>          Components: Hive, HiveServer2, repl
>            Reporter: mahesh kumar behera
>            Assignee: mahesh kumar behera
>            Priority: Major
>
> The current implementation of compaction uses the txn id in the directory 
> name. This is used to isolate the queries from reading the directory until 
> compaction has finished and to avoid the compactor marking used earlier. In 
> case of replication, during bootstrap , directory is copied as it is with the 
> same name from source to destination cluster. But the directory created by 
> compaction with txn id can not be copied as the txn list at target may be 
> different from source. The txn id which is valid at source may be an aborted 
> txn at target. So conversion logic is required to create a new directory with 
> valid txn at target and dump the data to the newly created directory.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to