Github user mridulm commented on the issue:
https://github.com/apache/spark/pull/21698
@jiangxb1987 Different number of output rows is due to data loss - it is
not another valid run.
A complete re-execution of the job in this case could result in a different
ordering, but consistent output characterstics (number of rows for example).
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]