[ 
https://issues.apache.org/jira/browse/IMPALA-10898?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Quanlong Huang updated IMPALA-10898:
------------------------------------
    Attachment: TPCDS-42-orc-snap-IN-list-filter-performance-result.txt

> Runtime IN-list filters for ORC tables
> --------------------------------------
>
>                 Key: IMPALA-10898
>                 URL: https://issues.apache.org/jira/browse/IMPALA-10898
>             Project: IMPALA
>          Issue Type: Improvement
>          Components: Backend
>            Reporter: Quanlong Huang
>            Assignee: Quanlong Huang
>            Priority: Critical
>         Attachments: TPCDS-42-orc-snap-IN-list-filter-performance-result.txt
>
>
> Currently Impala has two kinds of runtime filters: bloom filter and min-max 
> filter. Unfortunately they can't leverage the bloom filters in ORC files. 
> Only EQUALS and IN-list 
> predicates can leverage them to skip unrelated ORC RowGroups.
> This JIRA aims to add runtime IN-list filters for small build side (e.g. 
> #rows <= 1024) of a hash join.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to