[jira] Updated: (HBASE-3263) Stack overflow in AssignmentManager

2010-11-29 Thread stack (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-3263?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-3263:
-

Attachment: 3263.txt

Patch to bound the attempts at reassign recursions. Not pretty but should 
prevent this runaway from happening.

 Stack overflow in AssignmentManager
 ---

 Key: HBASE-3263
 URL: https://issues.apache.org/jira/browse/HBASE-3263
 Project: HBase
  Issue Type: Bug
  Components: master
Affects Versions: 0.90.0
Reporter: Todd Lipcon
Priority: Blocker
 Fix For: 0.90.0

 Attachments: 3263.txt, stackoverflow-log.txt


 My test cluster experienced a switch outage earlier this week which threw the 
 master into a really bad state. In the catch clause of 
 AssignmentManager.assign, we recurse, and if all of the region servers are 
 inaccessible, we do so until we get a stack overflow.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HBASE-3263) Stack overflow in AssignmentManager

2010-11-23 Thread Todd Lipcon (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-3263?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Todd Lipcon updated HBASE-3263:
---

Attachment: stackoverflow-log.txt

Here's a log showing the beginning of the runaway recursion. It goes like this 
until it gets a stack overflow error.

 Stack overflow in AssignmentManager
 ---

 Key: HBASE-3263
 URL: https://issues.apache.org/jira/browse/HBASE-3263
 Project: HBase
  Issue Type: Bug
  Components: master
Affects Versions: 0.90.0
Reporter: Todd Lipcon
Priority: Blocker
 Attachments: stackoverflow-log.txt


 My test cluster experienced a switch outage earlier this week which threw the 
 master into a really bad state. In the catch clause of 
 AssignmentManager.assign, we recurse, and if all of the region servers are 
 inaccessible, we do so until we get a stack overflow.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HBASE-3263) Stack overflow in AssignmentManager

2010-11-23 Thread stack (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-3263?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-3263:
-

Fix Version/s: 0.90.0

 Stack overflow in AssignmentManager
 ---

 Key: HBASE-3263
 URL: https://issues.apache.org/jira/browse/HBASE-3263
 Project: HBase
  Issue Type: Bug
  Components: master
Affects Versions: 0.90.0
Reporter: Todd Lipcon
Priority: Blocker
 Fix For: 0.90.0

 Attachments: stackoverflow-log.txt


 My test cluster experienced a switch outage earlier this week which threw the 
 master into a really bad state. In the catch clause of 
 AssignmentManager.assign, we recurse, and if all of the region servers are 
 inaccessible, we do so until we get a stack overflow.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.