Tim Armstrong created IMPALA-7857:
-------------------------------------

             Summary: Log more information about statestore failure detector
                 Key: IMPALA-7857
                 URL: https://issues.apache.org/jira/browse/IMPALA-7857
             Project: IMPALA
          Issue Type: Bug
          Components: Distributed Exec
            Reporter: Tim Armstrong
            Assignee: Tim Armstrong


For debugging heartbeat failures (or non-failures) it would be useful to log 
enough information to infer the current state of the failure detector from 
logs. Specifically:
* Upon a failure, we should log the number of consecutive failures according to 
the failure detector. And also maybe how many failures remain until it's 
considered to be failed.
* We should log when the failure count is reset to 0 by a successful heartbeat.

Currently if there are occasional failures it's hard to tell with certainty 
whether it was reset correctly.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to