ShivaKumar SS created HBASE-20844: ------------------------------------- Summary: Duplicate rows returned while hbase snapshot reads Key: HBASE-20844 URL: https://issues.apache.org/jira/browse/HBASE-20844 Project: HBase Issue Type: Bug Components: mapreduce, spark Affects Versions: 1.3.1 Environment: Cluster Details
Java 1.7 Hbase 1.3.1 Spark 1.6.1 Reporter: ShivaKumar SS We are trying to take snapshot from code and read data using MR and spark, both approaches are returning duplicate records. On the API side, {{org.apache.hadoop.hbase.mapreduce.TableSnapshotInputFormat }} is used. Snapshot was taken during the table is being in the region split state. We suspect it is due to data is being returned for both parent and daughter regions. -- This message was sent by Atlassian JIRA (v7.6.3#76005)