GitHub user jiangxb1987 opened a pull request:
https://github.com/apache/spark/pull/21927
[SPARK-24820][Core] Fail fast when submitted job contains
PartitionPruningRDD in a barrier stage
## What changes were proposed in this pull request?
`PartitionPruningRDD` may launch tasks on partial partitions, we shall
check on job submit to make sure we are not launching a barrier stage that
contains PartitionPruningRDD, otherwise shall fail fast.
## How was this patch tested?
Add test cases in `BarrierStageOnSubmittedSuite`.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/jiangxb1987/spark SPARK-24820
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/21927.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #21927
----
commit 0733bfb06c8641969a70f59a3f8c5b2e4c7a5eca
Author: Xingbo Jiang <xingbo.jiang@...>
Date: 2018-07-31T03:27:33Z
Fail fast when submitted job contains PartitionPruningRDD in a barrier stage
----
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]