ShivaKumar SS created HBASE-20844:
-------------------------------------

             Summary: Duplicate rows returned while hbase snapshot reads
                 Key: HBASE-20844
                 URL: https://issues.apache.org/jira/browse/HBASE-20844
             Project: HBase
          Issue Type: Bug
          Components: mapreduce, spark
    Affects Versions: 1.3.1
         Environment: Cluster Details 

Java    1.7
Hbase     1.3.1
Spark      1.6.1
            Reporter: ShivaKumar SS


We are trying to take snapshot from code and read data using MR and spark, both 
approaches are returning duplicate records.

On the API side, {{org.apache.hadoop.hbase.mapreduce.TableSnapshotInputFormat 
}} is used. 

Snapshot was taken during the table is being in the region split state. 

We suspect it is due to data is being returned for both parent and daughter 
regions.




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to