[
https://issues.apache.org/jira/browse/ASTERIXDB-3575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17944636#comment-17944636
]
ASF subversion and git services commented on ASTERIXDB-3575:
------------------------------------------------------------
Commit e30e9029fb605475d0b925f7d0798b7322da6a37 in asterixdb's branch
refs/heads/master from Peeyush Gupta
[ https://gitbox.apache.org/repos/asf?p=asterixdb.git;h=e30e9029fb ]
[ASTERIXDB-3575][EXT] Pushdown predicates for Parquet external datasets to
filter row groups
- user model changes: no
- storage format changes: no
- interface changes: yes
Ext-ref: MB-65316
Change-Id: I2c3214e2a351252fb1929aa1562cbab2d67fa9aa
Reviewed-on: https://asterix-gerrit.ics.uci.edu/c/asterixdb/+/19633
Integration-Tests: Jenkins <[email protected]>
Tested-by: Jenkins <[email protected]>
Reviewed-by: Peeyush Gupta <[email protected]>
Reviewed-by: Ali Alsuliman <[email protected]>
> Pushdown predicates for Parquet external datasets to filter row groups
> ----------------------------------------------------------------------
>
> Key: ASTERIXDB-3575
> URL: https://issues.apache.org/jira/browse/ASTERIXDB-3575
> Project: Apache AsterixDB
> Issue Type: Improvement
> Components: EXT - External data
> Reporter: Peeyush Gupta
> Assignee: Peeyush Gupta
> Priority: Major
> Labels: triaged
>
> Parquet files contain min-max values for each column for each row group. This
> information can be used to filter out row groups based on the predicates used
> in the query.
> We should pushdown predicates to external data scan for parquet to filter out
> row groups.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)