Github user gaborgsomogyi commented on the issue:
https://github.com/apache/spark/pull/22331
I've taken a look at the things and I think the issue solved in the
mentioned PR but not yet documented. If somebody would like to use the output
directory of a spark application which uses a file sink (with exactly-once),
then it must read the metadata first to get the list of valid files.
Considering these this PR can be closed.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]