[jira] [Updated] (HBASE-4855) SplitLogManager hangs on cluster restart.
[ https://issues.apache.org/jira/browse/HBASE-4855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ramkrishna.s.vasudevan updated HBASE-4855: -- Affects Version/s: 0.92.0 Fix Version/s: 0.92.0 SplitLogManager hangs on cluster restart. -- Key: HBASE-4855 URL: https://issues.apache.org/jira/browse/HBASE-4855 Project: HBase Issue Type: Bug Affects Versions: 0.92.0 Reporter: ramkrishna.s.vasudevan Assignee: ramkrishna.s.vasudevan Fix For: 0.92.0 Start a master and RS RS goes down (kill -9) Wait for ServerShutDownHandler to create the splitlog nodes. As no RS is there it cannot be processed. Restart both master and bring up an RS. The master hangs in SplitLogManager.waitforTasks(). I feel that batch.done is not getting incremented properly. Not yet digged in fully. This may be the reason for occasional failure of TestDistributedLogSplitting.testWorkerAbort(). -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HBASE-4855) SplitLogManager hangs on cluster restart.
[ https://issues.apache.org/jira/browse/HBASE-4855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ramkrishna.s.vasudevan updated HBASE-4855: -- Fix Version/s: (was: 0.92.0) SplitLogManager hangs on cluster restart. -- Key: HBASE-4855 URL: https://issues.apache.org/jira/browse/HBASE-4855 Project: HBase Issue Type: Bug Affects Versions: 0.92.0 Reporter: ramkrishna.s.vasudevan Assignee: ramkrishna.s.vasudevan Start a master and RS RS goes down (kill -9) Wait for ServerShutDownHandler to create the splitlog nodes. As no RS is there it cannot be processed. Restart both master and bring up an RS. The master hangs in SplitLogManager.waitforTasks(). I feel that batch.done is not getting incremented properly. Not yet digged in fully. This may be the reason for occasional failure of TestDistributedLogSplitting.testWorkerAbort(). -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HBASE-4855) SplitLogManager hangs on cluster restart.
[ https://issues.apache.org/jira/browse/HBASE-4855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ramkrishna.s.vasudevan updated HBASE-4855: -- Attachment: HBASE-4855.patch TestDistributedLogSplitting is passing . Other test cases results will get in the morning. SplitLogManager hangs on cluster restart. -- Key: HBASE-4855 URL: https://issues.apache.org/jira/browse/HBASE-4855 Project: HBase Issue Type: Bug Affects Versions: 0.92.0 Reporter: ramkrishna.s.vasudevan Assignee: ramkrishna.s.vasudevan Attachments: HBASE-4855.patch Start a master and RS RS goes down (kill -9) Wait for ServerShutDownHandler to create the splitlog nodes. As no RS is there it cannot be processed. Restart both master and bring up an RS. The master hangs in SplitLogManager.waitforTasks(). I feel that batch.done is not getting incremented properly. Not yet digged in fully. This may be the reason for occasional failure of TestDistributedLogSplitting.testWorkerAbort(). -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HBASE-4855) SplitLogManager hangs on cluster restart due to batch.installed doubly counted
[ https://issues.apache.org/jira/browse/HBASE-4855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ted Yu updated HBASE-4855: -- Summary: SplitLogManager hangs on cluster restart due to batch.installed doubly counted (was: SplitLogManager hangs on cluster restart. ) SplitLogManager hangs on cluster restart due to batch.installed doubly counted -- Key: HBASE-4855 URL: https://issues.apache.org/jira/browse/HBASE-4855 Project: HBase Issue Type: Bug Affects Versions: 0.92.0 Reporter: ramkrishna.s.vasudevan Assignee: ramkrishna.s.vasudevan Attachments: HBASE-4855.patch Start a master and RS RS goes down (kill -9) Wait for ServerShutDownHandler to create the splitlog nodes. As no RS is there it cannot be processed. Restart both master and bring up an RS. The master hangs in SplitLogManager.waitforTasks(). I feel that batch.done is not getting incremented properly. Not yet digged in fully. This may be the reason for occasional failure of TestDistributedLogSplitting.testWorkerAbort(). -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HBASE-4855) SplitLogManager hangs on cluster restart due to batch.installed doubly counted
[ https://issues.apache.org/jira/browse/HBASE-4855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ted Yu updated HBASE-4855: -- Resolution: Fixed Hadoop Flags: Reviewed Status: Resolved (was: Patch Available) SplitLogManager hangs on cluster restart due to batch.installed doubly counted -- Key: HBASE-4855 URL: https://issues.apache.org/jira/browse/HBASE-4855 Project: HBase Issue Type: Bug Affects Versions: 0.92.0 Reporter: ramkrishna.s.vasudevan Assignee: ramkrishna.s.vasudevan Attachments: HBASE-4855.patch Start a master and RS RS goes down (kill -9) Wait for ServerShutDownHandler to create the splitlog nodes. As no RS is there it cannot be processed. Restart both master and bring up an RS. The master hangs in SplitLogManager.waitforTasks(). I feel that batch.done is not getting incremented properly. Not yet digged in fully. This may be the reason for occasional failure of TestDistributedLogSplitting.testWorkerAbort(). -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HBASE-4855) SplitLogManager hangs on cluster restart due to batch.installed doubly counted
[ https://issues.apache.org/jira/browse/HBASE-4855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ramkrishna.s.vasudevan updated HBASE-4855: -- Fix Version/s: 0.92.0 Thanks Ted for your review and committing the patch. Updating fix versions as 0.92. SplitLogManager hangs on cluster restart due to batch.installed doubly counted -- Key: HBASE-4855 URL: https://issues.apache.org/jira/browse/HBASE-4855 Project: HBase Issue Type: Bug Affects Versions: 0.92.0 Reporter: ramkrishna.s.vasudevan Assignee: ramkrishna.s.vasudevan Fix For: 0.92.0 Attachments: HBASE-4855.patch Start a master and RS RS goes down (kill -9) Wait for ServerShutDownHandler to create the splitlog nodes. As no RS is there it cannot be processed. Restart both master and bring up an RS. The master hangs in SplitLogManager.waitforTasks(). I feel that batch.done is not getting incremented properly. Not yet digged in fully. This may be the reason for occasional failure of TestDistributedLogSplitting.testWorkerAbort(). -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira