When we query a table by a particular row key, there is only one row returned 
by HBase, which is expected.
However, when we query a snapshot for that same table, by the same particular 
row key, five duplicate rows are returned.  Why ?




In the log of the master server, we see some snapshot-related error:
===================== ERROR START =====================
ERROR [master:sh-bs-3-b8-namenode-17-208:60000.archivedHFileCleaner] 
snapshot.SnapshotHFileCleaner: Exception while checking if files were valid, 
keeping them just in case.
./hbase-root-master-sh-bs-3-b8-namenode-17-208.log.7:org.apache.hadoop.hbase.snapshot.CorruptedSnapshotException:
 Couldn't read snapshot info 
from:hdfs://master1.hh:8020/hbase/.hbase-snapshot/.tmp/hb_anchor_original_total_7days_stat_1526423587063/.snapshotinfo
./hbase-root-master-sh-bs-3-b8-namenode-17-208.log.7-   at 
org.apache.hadoop.hbase.snapshot.SnapshotDescriptionUtils.readSnapshotInfo(SnapshotDescriptionUtils.java:325)
./hbase-root-master-sh-bs-3-b8-namenode-17-208.log.7-   at 
org.apache.hadoop.hbase.snapshot.SnapshotReferenceUtil.getHFileNames(SnapshotReferenceUtil.java:328)
./hbase-root-master-sh-bs-3-b8-namenode-17-208.log.7-   at 
org.apache.hadoop.hbase.master.snapshot.SnapshotHFileCleaner$1.filesUnderSnapshot(SnapshotHFileCleaner.java:85)
./hbase-root-master-sh-bs-3-b8-namenode-17-208.log.7-   at 
org.apache.hadoop.hbase.master.snapshot.SnapshotFileCache.getSnapshotsInProgress(SnapshotFileCache.java:303)
./hbase-root-master-sh-bs-3-b8-namenode-17-208.log.7-   at 
org.apache.hadoop.hbase.master.snapshot.SnapshotFileCache.getUnreferencedFiles(SnapshotFileCache.java:194)
./hbase-root-master-sh-bs-3-b8-namenode-17-208.log.7-   at 
org.apache.hadoop.hbase.master.snapshot.SnapshotHFileCleaner.getDeletableFiles(SnapshotHFileCleaner.java:62)
./hbase-root-master-sh-bs-3-b8-namenode-17-208.log.7-   at 
org.apache.hadoop.hbase.master.cleaner.CleanerChore.checkAndDeleteFiles(CleanerChore.java:233)
./hbase-root-master-sh-bs-3-b8-namenode-17-208.log.7-   at 
org.apache.hadoop.hbase.master.cleaner.CleanerChore.checkAndDeleteEntries(CleanerChore.java:157)
...
===================== ERROR END =====================
And we find a related issue for this error: 
https://issues.apache.org/jira/browse/HBASE-16464?attachmentSortBy=fileName


However, there is no proof that the error in the log is related to our problem 
of having duplicate records from a snapshot.
Our HBase version is 0.98.18-hadoop2.


Could you help give some hint why we are having duplicate records from the 
snapshot ?







Reply via email to