Daryn Sharp created YARN-3760: --------------------------------- Summary: Log aggregation failures Key: YARN-3760 URL: https://issues.apache.org/jira/browse/YARN-3760 Project: Hadoop YARN Issue Type: Bug Components: nodemanager Affects Versions: 2.4.0 Reporter: Daryn Sharp Priority: Critical
The aggregated log file does not appear to be properly closed when writes fail. This leaves a lease renewer active in the NM that spams the NN with lease renewals. If the token is marked not to be cancelled, the renewals appear to continue until the token expires. If the token is cancelled, the periodic renew spam turns into a flood of failed connections until the lease renewer gives up. -- This message was sent by Atlassian JIRA (v6.3.4#6332)