[
https://issues.apache.org/jira/browse/HBASE-5986?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13283881#comment-13283881
]
Enis Soztutar commented on HBASE-5986:
--------------------------------------
Here are the test results for 0.94:
{code}
Tests run: 551, Failures: 0, Errors: 0, Skipped: 0
...
Tests run: 932, Failures: 1, Errors: 2, Skipped: 9
Failed tests:
testShutdownSimpleFixup(org.apache.hadoop.hbase.regionserver.TestSplitTransactionOnCluster):
expected:<1> but was:<0>
Tests in error:
testDelayedRpcImmediateReturnValue(org.apache.hadoop.hbase.ipc.TestDelayedRpc):
Call to /127.0.0.1:53586 failed on socket timeout exception:
java.net.SocketTimeoutException: 1000 millis timeout while waiting for channel
to be ready for read. ch : java.nio.channels.SocketChannel[connected
local=/127.0.0.1:53623 remote=/127.0.0.1:53586]
testLocalHBaseCluster(org.apache.hadoop.hbase.TestLocalHBaseCluster): Master
not initialized after 200 seconds
{code}
I rerun the tests locally with success, except TestLocalHBaseCluster. But it
fails on 0.94 HEAD as well for me.
For 0.92:
{code}
Results :
Failed tests:
testMultipleResubmits(org.apache.hadoop.hbase.master.TestSplitLogManager)
testcomputeHDFSBlocksDistribution(org.apache.hadoop.hbase.util.TestFSUtils)
Tests in error:
testClusterRestart(org.apache.hadoop.hbase.master.TestRestartCluster):
org.apache.hadoop.hbase.PleaseHoldException: Master is initializing
testWholesomeSplit(org.apache.hadoop.hbase.regionserver.TestSplitTransaction):
Failed delete of
/homes/hortonde/enis/code/hbase-0.92/target/test-data/af023188-0b23-4f9d-a9bc-a074e94e57f8/org.apache.hadoop.hbase.regionserver.TestSplitTransaction/table/7c59b6677ad46bf3f652a83de1e62bcb
testRollback(org.apache.hadoop.hbase.regionserver.TestSplitTransaction):
Target HLog directory already exists:
/homes/hortonde/enis/code/hbase-0.92/target/test-data/af023188-0b23-4f9d-a9bc-a074e94e57f8/org.apache.hadoop.hbase.regionserver.TestSplitTransaction/logs
testRollback(org.apache.hadoop.hbase.regionserver.TestSplitTransaction)
loadTest[0](org.apache.hadoop.hbase.util.TestMiniClusterLoadSequential): test
timed out after 120000 milliseconds
loadTest[0](org.apache.hadoop.hbase.util.TestMiniClusterLoadParallel): test
timed out after 120000 milliseconds
Tests run: 1135, Failures: 2, Errors: 6, Skipped: 8
{code}
Also run those failed tests locally with success. It seems we can go ahead with
0.92 and 0.94 if you don't have any concerns.
> Clients can see holes in the META table when regions are being split
> --------------------------------------------------------------------
>
> Key: HBASE-5986
> URL: https://issues.apache.org/jira/browse/HBASE-5986
> Project: HBase
> Issue Type: Bug
> Affects Versions: 0.92.1, 0.96.0, 0.94.1
> Reporter: Enis Soztutar
> Assignee: Enis Soztutar
> Fix For: 0.96.0
>
> Attachments: 5986-v2.txt, HBASE-5986-0.92.patch,
> HBASE-5986-0.94.patch, HBASE-5986-test_v1.patch, HBASE-5986_v3.patch
>
>
> We found this issue when running large scale ingestion tests for HBASE-5754.
> The problem is that the .META. table updates are not atomic while splitting a
> region. In SplitTransaction, there is a time lap between the marking the
> parent offline, and adding of daughters to the META table. This can result in
> clients using MetaScanner, of HTable.getStartEndKeys (used by the
> TableInputFormat) missing regions which are made just offline, but the
> daughters are not added yet.
> This is also related to HBASE-4335.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira