GitHub user rxin opened a pull request:
https://github.com/apache/spark/pull/16622
[SPARK-18917][SQL] Remove schema check in appending data
## What changes were proposed in this pull request?
In append mode, we check whether the schema of the write is compatible with
the schema of the existing data. It can be a significant performance issue in
cloud environment to find the existing schema for files. This patch removes the
check.
Note that for catalog tables, we always do the check, as discussed in
https://github.com/apache/spark/pull/16339#discussion_r96208357
## How was this patch tested?
N/A
Closes #16339.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/rxin/spark SPARK-18917
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/16622.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #16622
----
commit 25272e978c3e5e4e663a9f0d91ae1cc2f4a2a35d
Author: Reynold Xin <[email protected]>
Date: 2017-01-17T18:57:51Z
[SPARK-18917][SQL] Remove schema check in appending data
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]