[GitHub] spark issue #21698: [SPARK-23243][Core] Fix RDD.repartition() data correctne...

cloud-fan Wed, 11 Jul 2018 01:21:27 -0700

Github user cloud-fan commented on the issue:

    https://github.com/apache/spark/pull/21698
  
    For `zip`, it's hard to define what result is "corrected", given the fact 
that RDD is unordered. I think sample should be similar.
    
    `repartition` is special because we may change the number of outputs if 
this bug is triggered, which is obviously wrong result.



---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark issue #21698: [SPARK-23243][Core] Fix RDD.repartition() data correctne...

Reply via email to