Github user ala commented on a diff in the pull request:
https://github.com/apache/spark/pull/20664#discussion_r170277224
--- Diff: core/src/test/scala/org/apache/spark/rdd/RDDSuite.scala ---
@@ -1129,6 +1129,36 @@ class RDDSuite extends SparkFunSuite with
SharedSparkContext {
}.collect()
}
+ test("SPARK-23496: order of input partitions can result in severe skew
in coalesce") {
--- End diff --
The test is in fact deterministic. The seed is already fixed here:
https://github.com/apache/spark/blob/049f243c59737699fee54fdc9d65cbd7c788032a/core/src/main/scala/org/apache/spark/rdd/CoalescedRDD.scala#L163
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]