[ 
https://issues.apache.org/jira/browse/HDFS-4138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13489054#comment-13489054
 ] 

Kihwal Lee commented on HDFS-4138:
----------------------------------

When FSNamesystem#startCommonServices() is called from initialize(), the 
following happens during invoking DefaultMetricsSystem.instance().register(). 
Before starting the backup node, a MiniDFSCluster was up and ready.

{panel}
2012-11-01 12:59:43,397 ERROR lib.MethodMetric 
(MethodMetric.java:snapshot(118)) - Error invoking method 
getTransactionsSinceLastLogRoll
        java.lang.reflect.InvocationTargetException
...
Caused by: java.lang.IllegalStateException: Bad state: UNINITIALIZED
        at 
com.google.common.base.Preconditions.checkState(Preconditions.java:172)
        at 
org.apache.hadoop.hdfs.server.namenode.FSEditLog.getCurSegmentTxId(FSEditLog.java:452)
        at 
org.apache.hadoop.hdfs.server.namenode.FSNamesystem.getTransactionsSinceLastLogRoll(FSNamesystem.java:3488)
{panel} 

It then tries to enter the active state, which fails.

{panel}
2012-11-01 12:59:43,445 FATAL namenode.NameNode 
(NameNode.java:doImmediateShutdown(1291)) - Error encountered requiring NN 
shutdown. Shutting down immediately.
java.lang.NullPointerException
        at 
org.apache.hadoop.hdfs.server.namenode.FSNamesystem.startActiveServices(FSNamesystem.java:645)
        at 
org.apache.hadoop.hdfs.server.namenode.NameNode$NameNodeHAContext.startActiveServices(NameNode.java:1315)
        at 
org.apache.hadoop.hdfs.server.namenode.ha.ActiveState.enterState(ActiveState.java:61)
{panel}

This is caught by ExitUtil and the test fails. 
                
> TestBackupNode.testCheckpointNode fails due to a backup node startup issue.
> ---------------------------------------------------------------------------
>
>                 Key: HDFS-4138
>                 URL: https://issues.apache.org/jira/browse/HDFS-4138
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: ha, name-node
>    Affects Versions: 2.0.3-alpha
>            Reporter: Kihwal Lee
>
> When a backup node is getting started, it tries to enter active state and 
> start common services. But when it fails to start services and exits, which 
> is caught by the exit util.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to