[
https://issues.apache.org/jira/browse/HBASE-16011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15892252#comment-15892252
]
Zheng Hu commented on HBASE-16011:
----------------------------------
After ran provided UT, I think it's a BUG , and should be fixed. The problem
is that TableSnapshotScanner sort all regions and scan one by one , but do not
consider that region maybe split. [~enis], [~tedyu], [~Apache9],
[[email protected]]
> TableSnapshotScanner and TableSnapshotInputFormat can produce duplicate rows
> ----------------------------------------------------------------------------
>
> Key: HBASE-16011
> URL: https://issues.apache.org/jira/browse/HBASE-16011
> Project: HBase
> Issue Type: Bug
> Components: snapshots
> Affects Versions: 2.0.0, 1.2.2
> Reporter: Youngjoon Kim
> Assignee: Zheng Hu
> Attachments: snapshot_bug_test.patch
>
>
> A snapshot of (non-pre) split table can include both a parent region and
> daughter regions. If run TableSnapshotScanner or TableSnapshotInputFormat on
> the such snapshot, duplicate rows are produced.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)