I have been trying for some time to figure out an issue where splits fail
due to trying to cache the same key, but different value. At this point I
am making very slow progress so I figured I would try to reach out for
help. I will try to explain all that I know/have found out:

What I know/have observed:
This occurs when prefetch on open is true
Both daugher store openers will try to access and cache the first and last
values in the store (HalfStoreFile). [1] [2] [3]
Both daugher store openers will access the same key (the splitkey, I
* However they will access two different paths and retrieve two different
Then when they try to cache these values, it fails because the other thread
cached it's value and the two values aren't equivalent. [4]
This is past the point of no return for the region split so it tries to
roll forward, but fails. These regions are then corrupted and cannot be

One thing I have notices is that these blocks are all leaf index blocks. I
guess these index blocks would be different if it was in a different file.
This is on HBase 1.3.1.

It is definitely a concurrency bug as both daughter openers need to try to
get the block from disk before the other caches the block. Why would it be
trying to access two different paths for the same key? Is that normal in a
reference file?

Does anyone have any idea as to what might be occurring?
If you need more info, I can show you the code path that it is running



Reply via email to