GitHub user yhuai opened a pull request:
https://github.com/apache/spark/pull/6964
[SPSARK-8578] [SQL] Should ignore user defined output committer when
appending data
https://issues.apache.org/jira/browse/SPARK-8578
It is not very safe to use a custom output committer when append data to an
existing dir. This changes adds the logic to check if we are appending data,
and if so, we use the output committer associated with the file output format.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/yhuai/spark SPARK-8578
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/6964.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #6964
----
commit d55652efd41670c1cef3d812e012226c80423749
Author: Yin Huai <[email protected]>
Date: 2015-06-23T22:38:14Z
Test.
commit d2e9cce7e787abe355461a148d1f65d8e87eb87e
Author: Yin Huai <[email protected]>
Date: 2015-06-23T23:09:08Z
Do not use custom output commiter when append data.
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]