[
https://issues.apache.org/jira/browse/HBASE-3345?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12971980#action_12971980
]
stack commented on HBASE-3345:
------------------------------
I should have gotten context from you Todd when I was sitting beside you. We
could make a change like this:
{code}
diff --git
a/src/main/java/org/apache/hadoop/hbase/master/AssignmentManager.java
b/src/main/java/org/apache/hadoop/hbase/master/AssignmentManager.java
index ea064f2..1df65e7 100644
--- a/src/main/java/org/apache/hadoop/hbase/master/AssignmentManager.java
+++ b/src/main/java/org/apache/hadoop/hbase/master/AssignmentManager.java
@@ -1079,6 +1079,12 @@ public class AssignmentManager extends ZooKeeperListener
{
HServerInfo server = null;
synchronized (this.regions) {
server = regions.get(region);
+ if (server == null) {
+ LOG.error("Can't unassign region " + region.getRegionNameAsString() +
+ " from a null server; removing from regions!");
+ regions.remove(region);
+ return;
+ }
}
try {
// TODO: We should consider making this look more like it does for the
{code}
... so we flag the problem earlier but just keep going. I wonder how server
got to be null at all. Thats what I'd look in logs for.
> Master crash with NPE during table disable
> ------------------------------------------
>
> Key: HBASE-3345
> URL: https://issues.apache.org/jira/browse/HBASE-3345
> Project: HBase
> Issue Type: Bug
> Components: master
> Affects Versions: 0.90.0
> Reporter: Todd Lipcon
> Priority: Blocker
>
> Running on a config that triggers lots of splits, I attempted to disable a
> table while it was getting a lot of load and injected failures. Got the
> following NPE in master, followed by an abort:
> 2010-12-13 12:52:27,323 DEBUG
> org.apache.hadoop.hbase.master.AssignmentManager: Starting unassignment of
> region
> usertable,user1182862181,1292273503885.d223f1dc4d9003508f2db7566518b05d.
> (offlining)
> 2010-12-13 12:52:27,323 FATAL org.apache.hadoop.hbase.master.HMaster: Remote
> unexpected exception
> java.lang.NullPointerException: Passed server is null
> at
> org.apache.hadoop.hbase.master.ServerManager.sendRegionClose(ServerManager.java:581)
> at
> org.apache.hadoop.hbase.master.AssignmentManager.unassign(AssignmentManager.java:1085)
> at
> org.apache.hadoop.hbase.master.AssignmentManager.unassign(AssignmentManager.java:1032)
> at
> org.apache.hadoop.hbase.master.handler.DisableTableHandler$BulkDisabler$1.run(DisableTableHandler.java:132)
> at
> java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
> at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
> at java.lang.Thread.run(Thread.java:619)
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.