[
https://issues.apache.org/jira/browse/HBASE-18098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Pavel Salimov updated HBASE-18098:
----------------------------------
Description:
Somehow it happened that a region, say AB, was split onto A and B, but some
portion of B's data were missing in B's H-files, still presenting in A's files.
I am not completely sure, was it a result of just a failure during split, or of
applying hbck repair after that.
Anyway, data of some rows belonging to B were missing accessing normally. I was
able to access them making a scan type query regarding these rows to region A
(despite the rows are after its end). Hbck repairs, splitting A onto A0, A1 and
(full) compaction did not change the situation: the data were still missing in
B migrated to A1.
Copying the file from A1 dir to B finally made the data accessible.
I am reporting the issue in hope to get an advice on how to detect such an
inconsistency as well as hoping to clarify what leaded to such a state (and
fixed if it is a bug).
was:
Somehow it happened that a region, say AB, was split onto A and B, but some
portion of B's data were missing in B's H-files, still presenting in A's files.
I am not completely sure, was it a result of just a failure during split, or of
applying hbck repair after that.
Anyway, data of some rows belonging to B were missing accessing normally. I was
able to access them making a scan type query regarding these rows to region A
(despite the rows are after its end). Hbck repairs, splitting A onto A0, A1 and
(full) compaction did not change the situation: the data were still missing in
B migrated to A1.
I am reporting the issue in hope to get an advice on how to detect such an
inconsistency as well as hoping to clarify what leaded to such a state (and
fixed if it is a bug).
> Region data ended up in a file of an another region
> ---------------------------------------------------
>
> Key: HBASE-18098
> URL: https://issues.apache.org/jira/browse/HBASE-18098
> Project: HBase
> Issue Type: Bug
> Affects Versions: 1.2.0
> Reporter: Pavel Salimov
>
> Somehow it happened that a region, say AB, was split onto A and B, but some
> portion of B's data were missing in B's H-files, still presenting in A's
> files. I am not completely sure, was it a result of just a failure during
> split, or of applying hbck repair after that.
> Anyway, data of some rows belonging to B were missing accessing normally. I
> was able to access them making a scan type query regarding these rows to
> region A (despite the rows are after its end). Hbck repairs, splitting A onto
> A0, A1 and (full) compaction did not change the situation: the data were
> still missing in B migrated to A1.
> Copying the file from A1 dir to B finally made the data accessible.
>
> I am reporting the issue in hope to get an advice on how to detect such an
> inconsistency as well as hoping to clarify what leaded to such a state (and
> fixed if it is a bug).
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)