[ 
https://issues.apache.org/jira/browse/IMPALA-3900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Armstrong resolved IMPALA-3900.
-----------------------------------
    Resolution: Duplicate

> Add per-split runtime filtering to HdfsParquetScanner::ProcessSplit()
> ---------------------------------------------------------------------
>
>                 Key: IMPALA-3900
>                 URL: https://issues.apache.org/jira/browse/IMPALA-3900
>             Project: IMPALA
>          Issue Type: Improvement
>          Components: Backend
>    Affects Versions: Impala 2.6.0
>            Reporter: Henry Robinson
>            Assignee: Henry Robinson
>            Priority: Minor
>              Labels: runtime-filters
>
> If a partition filter arrives after a footer scan range for a Parquet has 
> been issued, but before {{HdfsParquetScanner::ProcessSplit()}}, there's an 
> opportunity to filter out all the scan ranges that would otherwise be issued 
> when reading that footer, by adding a call to {{ScanRangeIsFilteredOut()}} at 
> the top of that method.
> Care must be taken to ensure that all scan ranges are marked as done, since 
> they won't be processed by their own scanner instances. This will avoid a 
> recurrence of IMPALA-3804.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to