GitHub user gatorsmile opened a pull request:
https://github.com/apache/spark/pull/10344
[SPARK-12218] [SQL] [Backport-1.5] Fixed the Parquet's filter generation
rule when `Not` is included in Parquet filter pushdown
Added the test case that can cause data loss in the following scenario:
When applying the operator Not, the current generation rule for Parquet
filters simply applies `Not` to all the inclusive/underlying filters.
Note: will submit the fix after the test case failure.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/gatorsmile/spark parquetPushDown
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/10344.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #10344
----
commit 660eef52fca74b7ef61d5bd0784df62c126a4a51
Author: gatorsmile <[email protected]>
Date: 2015-12-17T03:16:38Z
added a test case that could cause data loss.
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]