GitHub user zsxwing opened a pull request:
https://github.com/apache/spark/pull/19762
[SPARK-22535][PySpark] Sleep before killing the python worker in
PythonRunner.MonitorThread
## What changes were proposed in this pull request?
`PythonRunner.MonitorThread` should give the task a little time to finish
before forcibly killing the python worker. This will reduce the chance of the
race condition a lot. I also improved the log a bit to find out the task to
blame when it's stuck.
## How was this patch tested?
Jenkins
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/zsxwing/spark SPARK-22535
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/19762.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #19762
----
commit a1f054ba9aff2bef5caf4534ef56dfd5e22eb96e
Author: Shixiong Zhu <[email protected]>
Date: 2017-11-16T00:35:09Z
Sleep before killing the python worker in PythonRunner.MonitorThread
----
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]