[ https://issues.apache.org/jira/browse/HADOOP-5730?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12702198#action_12702198 ]
Wang Xu commented on HADOOP-5730: --------------------------------- > 1. What happens if all directories are removed on SecondareNameNode? It's quite a problem, and do you think SecondaryNameNode should throw Exception or kill itself ? > 2. Why do you remove directories only if mkdir() fails? What if rename() > fails before mkdir() for example. I think rename maybe should also be "try...catch" > 3. You cannot just remove a list entry while iterating, this will cause > ConcurrentModificationException > on the next iteration of the loop. oh. I am sorry for that, I will change its position. I will modify it and upload another patch. And I wonder whether it is OK if we only record this problem in logfiles and ignore it. > SecondaryNameNode: should not throw exception and exit if only one makedir > failure > ----------------------------------------------------------------------------------- > > Key: HADOOP-5730 > URL: https://issues.apache.org/jira/browse/HADOOP-5730 > Project: Hadoop Core > Issue Type: Bug > Components: dfs > Affects Versions: 0.19.1 > Reporter: Wang Xu > Assignee: Wang Xu > Fix For: 0.19.2 > > Attachments: secondarynamenode-startcp.patch > > Original Estimate: 2h > Remaining Estimate: 2h > > In CheckpointStorage.startCheckPointing(), if one mkdir failed, it > will throw an exception and exit. > However, because the editlog has been closed before, the editStreams > of FSEditLog of NameNode will becomes empty as a result, which > will affect any further logSync operations. > Hence we think it should only print WARN message instead of > throw the exception -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.