[ https://issues.apache.org/jira/browse/YARN-4552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Junping Du updated YARN-4552: ----------------------------- Attachment: YARN-4552.patch Upload a quick patch to fix it. It haven't included any test so far. > NM ResourceLocalizationService should check and initialize local filecache > dir (and log dir) even if NM recover is enabled. > --------------------------------------------------------------------------------------------------------------------------- > > Key: YARN-4552 > URL: https://issues.apache.org/jira/browse/YARN-4552 > Project: Hadoop YARN > Issue Type: Bug > Components: nodemanager > Reporter: Junping Du > Assignee: Junping Du > Priority: Critical > Attachments: YARN-4552.patch > > > In some cases, user are cleanup localized file cache for debugging/trouble > shooting purpose during NM down time. However, after bring back NM (with > recovery enabled), the job submission could be failed for exception like > below: > {noformat} > Diagnostics: java.io.FileNotFoundException: File > /disk/12/yarn/local/filecache does not exist. > {noformat} > This is due to we only create filecache dir when recover is not enabled > during ResourceLocalizationService get initialized/started. -- This message was sent by Atlassian JIRA (v6.3.4#6332)