[
https://issues.apache.org/jira/browse/HADOOP-4045?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12689724#action_12689724
]
Boris Shkolnik commented on HADOOP-4045:
----------------------------------------
Manual testing done:
1. Mount two directories (one for Edits and Image, one for Edits only).
2. create some files
3. unmount one of them and wait for checkpoint (or create a file) , verify that
failed dir is removed
4. unmount another one (optional) - more verifications
5. mount one back - (checkpoint or new files), verify that checkpointtime is
updated and files have the same size and MD5
6. mount the other one (optional) - more verifications
7. repeat 3 and 5
8. check WebUI all the time.
> Increment checkpoint if we see failures in rollEdits
> ----------------------------------------------------
>
> Key: HADOOP-4045
> URL: https://issues.apache.org/jira/browse/HADOOP-4045
> Project: Hadoop Core
> Issue Type: Bug
> Components: dfs
> Reporter: Lohit Vijayarenu
> Assignee: Boris Shkolnik
> Priority: Critical
> Fix For: 0.19.2
>
> Attachments: HADOOP-4045-1.patch, HADOOP-4045.patch
>
>
> In _FSEditLog::rollEdits_, if we encounter an error during opening edits.new,
> we remove the store directory associated with it. At this point we should
> also increment checkpoint on all other directories.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.