Sergey Shelukhin created HIVE-18078:
---------------------------------------

             Summary: WM getSession needs some retry logic
                 Key: HIVE-18078
                 URL: https://issues.apache.org/jira/browse/HIVE-18078
             Project: Hive
          Issue Type: Sub-task
            Reporter: Sergey Shelukhin


When we get a bad session (e.g. no registry info because AM has gone 
catatonic), the failure by the timeout future fails the getSession call.
The retry model in TezTask is that it would get a session (which in original 
model can be completely unusable, but we still get the object), and then retry 
(reopen) if it's a lemon. If the reopen fails, we fail.
getSession is not covered by this retry scheme, and should thus do its own 
retries (or the retry logic needs to be changed)



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to