[
https://issues.apache.org/jira/browse/PARQUET-473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wes McKinney updated PARQUET-473:
---------------------------------
Summary: [C++] Develop external predicate pushdown API for column readers
(was: Develop external predicate pushdown API for column readers)
> [C++] Develop external predicate pushdown API for column readers
> ----------------------------------------------------------------
>
> Key: PARQUET-473
> URL: https://issues.apache.org/jira/browse/PARQUET-473
> Project: Parquet
> Issue Type: New Feature
> Components: parquet-cpp
> Reporter: Wes McKinney
> Priority: Major
>
> This will happen significantly downstream of where we are at right now, but
> we should be planning ahead to facilitate scanning Parquet files with
> externally-defined predicates as a primary use case.
> I suggest that the most general (and high performance) predicate will be
> batch-oriented; i.e. the predicate will be passed a batch of materialized
> values from one or more columns, and it returns an array of booleans
> indicating whether or not the predicate is true. We can also develop a
> row-by-row "scalar" predicate API if users need that.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)