[GitHub] spark issue #21698: [SPARK-23243][Core] Fix RDD.repartition() data correctne...

mridulm Wed, 11 Jul 2018 10:40:25 -0700

Github user mridulm commented on the issue:

    https://github.com/apache/spark/pull/21698
  
    @jiangxb1987 Different number of output rows is due to data loss - it is 
not another valid run.
    A complete re-execution of the job in this case could result in a different 
ordering, but consistent output characterstics (number of rows for example).



---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark issue #21698: [SPARK-23243][Core] Fix RDD.repartition() data correctne...

Reply via email to