Jason Lowe created YARN-4414:

             Summary: Nodemanager connection errors are retried at multiple 
                 Key: YARN-4414
                 URL: https://issues.apache.org/jira/browse/YARN-4414
             Project: Hadoop YARN
          Issue Type: Bug
          Components: nodemanager
    Affects Versions: 2.6.2, 2.7.1
            Reporter: Jason Lowe

This is related to YARN-3238.  Ran into more scenarios where connection errors 
are being retried at multiple levels, like NoRouteToHostException.  The fix for 
YARN-3238 was too specific, and I think we need a more general solution to 
catch a wider array of connection errors that can occur to avoid retrying them 
both at the RPC layer and at the NM proxy layer.

This message was sent by Atlassian JIRA

Reply via email to