[ 
https://issues.apache.org/jira/browse/PARQUET-473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wes McKinney updated PARQUET-473:
---------------------------------
    Summary: [C++] Develop external predicate pushdown API for column readers  
(was: Develop external predicate pushdown API for column readers)

> [C++] Develop external predicate pushdown API for column readers
> ----------------------------------------------------------------
>
>                 Key: PARQUET-473
>                 URL: https://issues.apache.org/jira/browse/PARQUET-473
>             Project: Parquet
>          Issue Type: New Feature
>          Components: parquet-cpp
>            Reporter: Wes McKinney
>            Priority: Major
>
> This will happen significantly downstream of where we are at right now, but 
> we should be planning ahead to facilitate scanning Parquet files with 
> externally-defined predicates as a primary use case. 
> I suggest that the most general (and high performance) predicate will be 
> batch-oriented; i.e. the predicate will be passed a batch of materialized 
> values from one or more columns, and it returns an array of booleans 
> indicating whether or not the predicate is true. We can also develop a 
> row-by-row "scalar" predicate API if users need that. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to