Jason Lowe created YARN-4414:
--------------------------------

             Summary: Nodemanager connection errors are retried at multiple 
levels
                 Key: YARN-4414
                 URL: https://issues.apache.org/jira/browse/YARN-4414
             Project: Hadoop YARN
          Issue Type: Bug
          Components: nodemanager
    Affects Versions: 2.6.2, 2.7.1
            Reporter: Jason Lowe


This is related to YARN-3238.  Ran into more scenarios where connection errors 
are being retried at multiple levels, like NoRouteToHostException.  The fix for 
YARN-3238 was too specific, and I think we need a more general solution to 
catch a wider array of connection errors that can occur to avoid retrying them 
both at the RPC layer and at the NM proxy layer.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to