[ https://issues.apache.org/jira/browse/HBASE-5875?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13399075#comment-13399075 ]
rajeshbabu commented on HBASE-5875: ----------------------------------- Test suite result : {code} Results : Failed tests: testExceptionFromCoprocessorDuringPut(org.apache.hadoop.hbase.coprocessor.TestRegionServerCoprocessorExceptionWithAbort): The put should have failed, as the coprocessor is buggy testDrainingServerOffloading(org.apache.hadoop.hbase.TestDrainingServer): expected:<1> but was:<0> testTaskResigned(org.apache.hadoop.hbase.master.TestSplitLogManager): version1=2, version=2 testNullReturn(org.apache.hadoop.hbase.regionserver.TestServerCustomProtocol): Results should contain region test,bbb,1340328821040.9fe2c292d7f212976859364f8aef27a3. for row 'bbb' testRowMutationMultiThreads(org.apache.hadoop.hbase.regionserver.TestAtomicOperation): expected:<0> but was:<3> testPermMask(org.apache.hadoop.hbase.util.TestFSUtils): expected:<rwx------> but was:<rwxrwxrwx> Tests in error: testWholesomeSplit(org.apache.hadoop.hbase.regionserver.TestSplitTransaction): Failed delete of /mnt/F/hbaseTrunkNew/hbase-server/target/test-data/a9504511-b767-40bb-8c4b-4550baa22da2/org.apache.hadoop.hbase.regionserver.TestSplitTransaction/table/7fcde0d5873845498b313524c3416091 testRollback(org.apache.hadoop.hbase.regionserver.TestSplitTransaction): Failed delete of /mnt/F/hbaseTrunkNew/hbase-server/target/test-data/74d5334b-a9d3-4213-b568-8315e066df68/org.apache.hadoop.hbase.regionserver.TestSplitTransaction/table/9d8fa21602ce5ba40d1fa704094c8e25 testOffPeakCompactionRatio(org.apache.hadoop.hbase.regionserver.TestCompactSelection): Target HLog directory already exists: /mnt/F/hbaseTrunkNew/hbase-server/target/test-data/89a77fb2-2048-414c-8f94-6b9a43a51937/TestCompactSelection/logs testMultiRowMutationMultiThreads(org.apache.hadoop.hbase.regionserver.TestAtomicOperation): java.io.FileNotFoundException: /mnt/F/hbaseTrunkNew/hbase-server/target/classes/hbase-default.xml (Too many open files) testCacheOnWriteInSchema[1](org.apache.hadoop.hbase.regionserver.TestCacheOnWriteInSchema): Target HLog directory already exists: /mnt/F/hbaseTrunkNew/hbase-server/target/test-data/1480ac68-4774-454e-9127-e9bfd20864f6/TestCacheOnWriteInSchema/logs testCacheOnWriteInSchema[2](org.apache.hadoop.hbase.regionserver.TestCacheOnWriteInSchema): Target HLog directory already exists: /mnt/F/hbaseTrunkNew/hbase-server/target/test-data/1480ac68-4774-454e-9127-e9bfd20864f6/TestCacheOnWriteInSchema/logs loadTest[0](org.apache.hadoop.hbase.util.TestMiniClusterLoadParallel): test timed out after 120000 milliseconds loadTest[1](org.apache.hadoop.hbase.util.TestMiniClusterLoadParallel): test timed out after 120000 milliseconds Tests run: 1577, Failures: 6, Errors: 8, Skipped: 9 {code} ran failed test cases individually these test cases passes. {code} Running org.apache.hadoop.hbase.TestDrainingServer Tests run: 2, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 38.338 sec Running org.apache.hadoop.hbase.coprocessor.TestRegionServerCoprocessorExceptionWithAbort Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 31.07 sec Running org.apache.hadoop.hbase.regionserver.TestServerCustomProtocol Tests run: 7, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 23.353 sec Running org.apache.hadoop.hbase.master.TestSplitLogManager Tests run: 12, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 20.243 sec {code} tests in below test cases are failing but these are not related to this issue. I will check these. TestMiniClusterLoadParallel,TestAtomicOperation,TestCacheOnWriteInSchema,TestCompactSelection,TestFSUtils > Process RIT and Master restart may remove an online server considering it as > a dead server > ------------------------------------------------------------------------------------------ > > Key: HBASE-5875 > URL: https://issues.apache.org/jira/browse/HBASE-5875 > Project: HBase > Issue Type: Bug > Affects Versions: 0.92.1 > Reporter: ramkrishna.s.vasudevan > Assignee: ramkrishna.s.vasudevan > Fix For: 0.94.1 > > Attachments: HBASE-5875.patch, HBASE-5875_0.94.patch, > HBASE-5875_0.94_1.patch, HBASE-5875_trunk.patch, HBASE-5875v2.patch > > > If on master restart it finds the ROOT/META to be in RIT state, master tries > to assign the ROOT region through ProcessRIT. > Master will trigger the assignment and next will try to verify the Root > Region Location. > Root region location verification is done seeing if the RS has the region in > its online list. > If the master triggered assignment has not yet been completed in RS then the > verify root region location will fail. > Because it failed > {code} > splitLogAndExpireIfOnline(currentRootServer); > {code} > we do split log and also remove the server from online server list. Ideally > here there is nothing to do in splitlog as no region server was restarted. > So master, though the server is online, master just invalidates the region > server. > In a special case, if i have only one RS then my cluster will become non > operative. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira