Tim Armstrong created IMPALA-9611:
-------------------------------------

             Summary: Hang in HandoffToProbesAndWait()
                 Key: IMPALA-9611
                 URL: https://issues.apache.org/jira/browse/IMPALA-9611
             Project: IMPALA
          Issue Type: Bug
          Components: Backend
    Affects Versions: Impala 3.4.0
            Reporter: Tim Armstrong
            Assignee: Tim Armstrong


I saw a hang triggered by test_failpoints in 
JoinBuilder::HandofftoProbesAndWait(), where the thread was blocked but 
build_side_state->is_cancelled_ is true.

The sequence of events leading to the bug is as follows:
 * Thread A is in HandoffToProbesAndWait(), reads is_cancelled_ and sees false.
 * Thread B in RuntimeState::Cancel() sets is_cancelled_ = true, acquires 
cancellation_cvs_lock_, then calls NotifyAll() on the condition variable
 * Thread A calls Wait() on the cv, blocks forever.

I think this is most likely if thread A is de-scheduled at the wrong time.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to