Sankar Hariappan created HIVE-17196:
---------------------------------------

             Summary: CM: ReplCopyTask should retain the original file names 
even if copied from CM path.
                 Key: HIVE-17196
                 URL: https://issues.apache.org/jira/browse/HIVE-17196
             Project: Hive
          Issue Type: Sub-task
          Components: repl
    Affects Versions: 2.1.0
            Reporter: Sankar Hariappan
            Assignee: Sankar Hariappan
             Fix For: 3.0.0


Consider the below scenario,
1. Insert into table T1 with value(X).
2. Insert into table T1 with value(X).
3. Truncate the table T1. 
– This step backs up 2 files with same content to cmroot which ends up with one 
file in cmroot as checksum matches.
4. Incremental repl with above 3 operations.
– In this step, both the insert event files will be read from cmroot where copy 
of one leads to overwrite the other one as the file name is same in cm path 
(checksum as file name).
So, this leads to data loss and hence it is necessary to retain the original 
file names even if we copy from cm path.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to