[jira] Updated: (HBASE-3263) Stack overflow in AssignmentManager
[ https://issues.apache.org/jira/browse/HBASE-3263?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] stack updated HBASE-3263: - Attachment: 3263.txt Patch to bound the attempts at reassign recursions. Not pretty but should prevent this runaway from happening. Stack overflow in AssignmentManager --- Key: HBASE-3263 URL: https://issues.apache.org/jira/browse/HBASE-3263 Project: HBase Issue Type: Bug Components: master Affects Versions: 0.90.0 Reporter: Todd Lipcon Priority: Blocker Fix For: 0.90.0 Attachments: 3263.txt, stackoverflow-log.txt My test cluster experienced a switch outage earlier this week which threw the master into a really bad state. In the catch clause of AssignmentManager.assign, we recurse, and if all of the region servers are inaccessible, we do so until we get a stack overflow. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Updated: (HBASE-3263) Stack overflow in AssignmentManager
[ https://issues.apache.org/jira/browse/HBASE-3263?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Todd Lipcon updated HBASE-3263: --- Attachment: stackoverflow-log.txt Here's a log showing the beginning of the runaway recursion. It goes like this until it gets a stack overflow error. Stack overflow in AssignmentManager --- Key: HBASE-3263 URL: https://issues.apache.org/jira/browse/HBASE-3263 Project: HBase Issue Type: Bug Components: master Affects Versions: 0.90.0 Reporter: Todd Lipcon Priority: Blocker Attachments: stackoverflow-log.txt My test cluster experienced a switch outage earlier this week which threw the master into a really bad state. In the catch clause of AssignmentManager.assign, we recurse, and if all of the region servers are inaccessible, we do so until we get a stack overflow. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Updated: (HBASE-3263) Stack overflow in AssignmentManager
[ https://issues.apache.org/jira/browse/HBASE-3263?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] stack updated HBASE-3263: - Fix Version/s: 0.90.0 Stack overflow in AssignmentManager --- Key: HBASE-3263 URL: https://issues.apache.org/jira/browse/HBASE-3263 Project: HBase Issue Type: Bug Components: master Affects Versions: 0.90.0 Reporter: Todd Lipcon Priority: Blocker Fix For: 0.90.0 Attachments: stackoverflow-log.txt My test cluster experienced a switch outage earlier this week which threw the master into a really bad state. In the catch clause of AssignmentManager.assign, we recurse, and if all of the region servers are inaccessible, we do so until we get a stack overflow. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.