GitHub user liancheng opened a pull request:
https://github.com/apache/spark/pull/863
[SPARK-1913][SQL] Bug fix: column pruning error in Parquet support
JIRA issue: [SPARK-1913](https://issues.apache.org/jira/browse/SPARK-1913)
When scanning Parquet tables, attributes referenced only in predicates that
are pushed down are not passed to the `ParquetTableScan` operator and causes
exception.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/liancheng/spark spark-1913
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/863.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #863
----
commit ae60ab38e729c977cac83e956c553e53254dd01f
Author: Cheng Lian <[email protected]>
Date: 2014-05-23T10:57:29Z
[SPARK-1913] Attributes referenced only in predicates pushed down should
remain in ParquetTableScan operator
commit f5b257dc7830b663f1a52c37371e0821da0a684b
Author: Cheng Lian <[email protected]>
Date: 2014-05-23T11:25:26Z
Added back comments deleted by mistake
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---