Forgot to add that Hadoop distribution is cdh3u3  ...

Thanks
-- Alex

On Wed, May 9, 2012 at 1:58 PM, Alex Levin <ale...@gmail.com> wrote:
> Hi,
>
> I have an issue with crashing secondary namenode due to a simple move
> operation ....
> Appreciate any ideas on the resolution ...
>
> Details bellow:
> I was moving old backups to a separate folder, exact command:
>
>     sudo -u hdfs hadoop fs -mv /hbase-bak /backup/
>
> and shortly after the command secondary namenode crashed with following 
> message:
>
>  2012-05-09 09:37:44,168 INFO
> org.apache.hadoop.hdfs.server.common.Storage: Edits file
> /NNBak/current/edits of size 7680232 edits # 45318 loaded in 1
> seconds.
> 2012-05-09 09:37:44,232 INFO
> org.apache.hadoop.hdfs.server.namenode.FSNamesystem: Number of
> transactions: 0 Total time for transactions(ms): 0Number of
> transactions batched in S
> 2012-05-09 09:37:45,449 ERROR
> org.apache.hadoop.hdfs.server.common.Storage: Unable to save image for
> /NNBak
> java.io.IOException: saveLeases found path
> /backup/base-bak/.logs/data1,60020,1304443405002/data1%3A60020.1308869024750
> but no matching entry in namespace.
>        at 
> org.apache.hadoop.hdfs.server.namenode.FSNamesystem.saveFilesUnderConstruction(FSNamesystem.java:5449)
>        at 
> org.apache.hadoop.hdfs.server.namenode.FSImage.saveFSImage(FSImage.java:1070)
>        at 
> org.apache.hadoop.hdfs.server.namenode.FSImage.saveCurrent(FSImage.java:1172)
>        at 
> org.apache.hadoop.hdfs.server.namenode.FSImage.saveNamespace(FSImage.java:1120)
>        at 
> org.apache.hadoop.hdfs.server.namenode.SecondaryNameNode$CheckpointStorage.doMerge(SecondaryNameNode.java:731)
>        at 
> org.apache.hadoop.hdfs.server.namenode.SecondaryNameNode$CheckpointStorage.access$500(SecondaryNameNode.java:628)
>        at 
> org.apache.hadoop.hdfs.server.namenode.SecondaryNameNode.doMerge(SecondaryNameNode.java:505)
>        at 
> org.apache.hadoop.hdfs.server.namenode.SecondaryNameNode.doCheckpoint(SecondaryNameNode.java:469)
>        at 
> org.apache.hadoop.hdfs.server.namenode.SecondaryNameNode.doWork(SecondaryNameNode.java:333)
>        at 
> org.apache.hadoop.hdfs.server.namenode.SecondaryNameNode.run(SecondaryNameNode.java:297)
>        at java.lang.Thread.run(Thread.java:662)
> 2012-05-09 09:37:45,450 WARN
> org.apache.hadoop.hdfs.server.common.Storage: Removing storage dir
> /NNBak
> 2012-05-09 09:37:45,450 FATAL
> org.apache.hadoop.hdfs.server.namenode.FSNamesystem: No edit streams
> are accessible
> java.lang.Exception: No edit streams are accessible
>        at 
> org.apache.hadoop.hdfs.server.namenode.FSEditLog.fatalExit(FSEditLog.java:410)
>        at 
> org.apache.hadoop.hdfs.server.namenode.FSEditLog.exitIfNoStreams(FSEditLog.java:429)
>        at 
> org.apache.hadoop.hdfs.server.namenode.FSEditLog.open(FSEditLog.java:374)
>        at 
> org.apache.hadoop.hdfs.server.namenode.FSImage.saveNamespace(FSImage.java:1158)
>        at 
> org.apache.hadoop.hdfs.server.namenode.SecondaryNameNode$CheckpointStorage.doMerge(SecondaryNameNode.java:731)
>        at 
> org.apache.hadoop.hdfs.server.namenode.SecondaryNameNode$CheckpointStorage.access$500(SecondaryNameNode.java:628)
>        at 
> org.apache.hadoop.hdfs.server.namenode.SecondaryNameNode.doMerge(SecondaryNameNode.java:505)
>        at 
> org.apache.hadoop.hdfs.server.namenode.SecondaryNameNode.doCheckpoint(SecondaryNameNode.java:469)
>        at 
> org.apache.hadoop.hdfs.server.namenode.SecondaryNameNode.doWork(SecondaryNameNode.java:333)
>        at 
> org.apache.hadoop.hdfs.server.namenode.SecondaryNameNode.run(SecondaryNameNode.java:297)
>        at java.lang.Thread.run(Thread.java:662)
> 2012-05-09 09:37:45,451 INFO
> org.apache.hadoop.hdfs.server.namenode.SecondaryNameNode:
> SHUTDOWN_MSG:
>
>
> looks like it is  expecting the file
> /backup/base-bak/.logs/data1,60020,1304443405002/data1%3A60020.1308869024750
> which never existed
> but there is   
> /backup/hbase-bak/.logs/data1,60020,1304443405002/data1%3A60020.1308869024750
> which was moved
>
>
> fsck / and fsck /backup  returns no issues
> I can backup  fsimage and edits from
>  http://namenode:50070/getimage?getimage=1
>  http://namnoede:50070/getimage?getedit=1
>
>
> but all attempts to start the secondary namenode resulted in the same crash 
> ...
>
>
> Thanks
> -- Alex

Reply via email to