Daryn Sharp created YARN-3760:
---------------------------------
Summary: Log aggregation failures
Key: YARN-3760
URL: https://issues.apache.org/jira/browse/YARN-3760
Project: Hadoop YARN
Issue Type: Bug
Components: nodemanager
Affects Versions: 2.4.0
Reporter: Daryn Sharp
Priority: Critical
The aggregated log file does not appear to be properly closed when writes fail.
This leaves a lease renewer active in the NM that spams the NN with lease
renewals. If the token is marked not to be cancelled, the renewals appear to
continue until the token expires. If the token is cancelled, the periodic
renew spam turns into a flood of failed connections until the lease renewer
gives up.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)