[ https://issues.apache.org/jira/browse/HBASE-5986?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13283881#comment-13283881 ]
Enis Soztutar commented on HBASE-5986: -------------------------------------- Here are the test results for 0.94: {code} Tests run: 551, Failures: 0, Errors: 0, Skipped: 0 ... Tests run: 932, Failures: 1, Errors: 2, Skipped: 9 Failed tests: testShutdownSimpleFixup(org.apache.hadoop.hbase.regionserver.TestSplitTransactionOnCluster): expected:<1> but was:<0> Tests in error: testDelayedRpcImmediateReturnValue(org.apache.hadoop.hbase.ipc.TestDelayedRpc): Call to /127.0.0.1:53586 failed on socket timeout exception: java.net.SocketTimeoutException: 1000 millis timeout while waiting for channel to be ready for read. ch : java.nio.channels.SocketChannel[connected local=/127.0.0.1:53623 remote=/127.0.0.1:53586] testLocalHBaseCluster(org.apache.hadoop.hbase.TestLocalHBaseCluster): Master not initialized after 200 seconds {code} I rerun the tests locally with success, except TestLocalHBaseCluster. But it fails on 0.94 HEAD as well for me. For 0.92: {code} Results : Failed tests: testMultipleResubmits(org.apache.hadoop.hbase.master.TestSplitLogManager) testcomputeHDFSBlocksDistribution(org.apache.hadoop.hbase.util.TestFSUtils) Tests in error: testClusterRestart(org.apache.hadoop.hbase.master.TestRestartCluster): org.apache.hadoop.hbase.PleaseHoldException: Master is initializing testWholesomeSplit(org.apache.hadoop.hbase.regionserver.TestSplitTransaction): Failed delete of /homes/hortonde/enis/code/hbase-0.92/target/test-data/af023188-0b23-4f9d-a9bc-a074e94e57f8/org.apache.hadoop.hbase.regionserver.TestSplitTransaction/table/7c59b6677ad46bf3f652a83de1e62bcb testRollback(org.apache.hadoop.hbase.regionserver.TestSplitTransaction): Target HLog directory already exists: /homes/hortonde/enis/code/hbase-0.92/target/test-data/af023188-0b23-4f9d-a9bc-a074e94e57f8/org.apache.hadoop.hbase.regionserver.TestSplitTransaction/logs testRollback(org.apache.hadoop.hbase.regionserver.TestSplitTransaction) loadTest[0](org.apache.hadoop.hbase.util.TestMiniClusterLoadSequential): test timed out after 120000 milliseconds loadTest[0](org.apache.hadoop.hbase.util.TestMiniClusterLoadParallel): test timed out after 120000 milliseconds Tests run: 1135, Failures: 2, Errors: 6, Skipped: 8 {code} Also run those failed tests locally with success. It seems we can go ahead with 0.92 and 0.94 if you don't have any concerns. > Clients can see holes in the META table when regions are being split > -------------------------------------------------------------------- > > Key: HBASE-5986 > URL: https://issues.apache.org/jira/browse/HBASE-5986 > Project: HBase > Issue Type: Bug > Affects Versions: 0.92.1, 0.96.0, 0.94.1 > Reporter: Enis Soztutar > Assignee: Enis Soztutar > Fix For: 0.96.0 > > Attachments: 5986-v2.txt, HBASE-5986-0.92.patch, > HBASE-5986-0.94.patch, HBASE-5986-test_v1.patch, HBASE-5986_v3.patch > > > We found this issue when running large scale ingestion tests for HBASE-5754. > The problem is that the .META. table updates are not atomic while splitting a > region. In SplitTransaction, there is a time lap between the marking the > parent offline, and adding of daughters to the META table. This can result in > clients using MetaScanner, of HTable.getStartEndKeys (used by the > TableInputFormat) missing regions which are made just offline, but the > daughters are not added yet. > This is also related to HBASE-4335. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira