wzhfy removed a comment on pull request #29580:
URL: https://github.com/apache/spark/pull/29580#issuecomment-686211231


   > > Then other threads can not process messages in that inbox, which causes 
the endpoint to hang
   > 
   > Other than what inbox has been stopped, this would not happen.
   > Are you referring to this ? Or any other cases ?
   
   In our case, the `DriverEndpoint` can't process any message after OOM 
happens in a dispatcher thread. Cluster's spark version is 2.3, but I think 
same problem would exist in 2.4, and for other endpoints in 3.0 
(`DriverEndpoint` becomes an `IsolatedRpcEndpoint` instead of 
`ThreadSafeRpcEndpoint`).
   
   IIUC, an inbox is stopped only when it's unregistered.
   When a dispatcher thread is processing messages in an inbox, if a fatal 
error (e.g. OOM) happens, it will just throw the error. 
   I don't find any place to stop the inbox when this case happens.
   Please correct me if I'm wrong, thanks!


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to