Michael Ho created IMPALA-9006:
----------------------------------

             Summary: Consolidate the Statestore subscriber's retry logic
                 Key: IMPALA-9006
                 URL: https://issues.apache.org/jira/browse/IMPALA-9006
             Project: IMPALA
          Issue Type: Improvement
          Components: Distributed Exec
    Affects Versions: Impala 3.4.0
            Reporter: Michael Ho
            Assignee: Michael Ho


Currently, a Statestore subscriber starts a separate thread after the initial 
registration with Statestore to periodically check if the Statestore may have 
failed and re-registered with Statestore if necessary. Similarly, the function 
{{StatestoreSubscriber::Register()}} also relies on the old Thrift client's 
retry logic to retry failed RPC attempts to Statestore. This is needed as the 
initial registration relies on this retry logic to wait for Statestore to 
startup in case an Impala daemon starts before the Statestore.

Last but not least, the current registration logic at Statestore doesn't check 
if the address provided by the subscriber can actually be resolved. In certain 
deployment scenarios, it's possible that the address passed by a subscriber is 
not yet resolvable (e.g. a Kubernetes pod whose readiness probe failed). 
Statestore should check for if the address is resolvable and fail the 
registration if not. The subscriber can keep retrying until its address can be 
resolved by Statestore. This is particularly useful in configuration where 
readiness probe of a pod in Kubernetes is exploited for a warm backup 
configuration.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to