[ 
https://issues.apache.org/jira/browse/IMPALA-10873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17402965#comment-17402965
 ] 

Quanlong Huang commented on IMPALA-10873:
-----------------------------------------

Note that only EQUALS and IN-list predicates will be evaluated on ORC file's 
bloom filters:
{code:cpp}
  static bool shouldEvaluateBloomFilter(PredicateLeaf::Operator op,
                                        TruthValue result,
                                        const BloomFilter * bloomFilter) {
    // evaluate bloom filter only when
    // 1) Bloom filter is available
    // 2) Min/Max evaluation yield YES or MAYBE
    // 3) Predicate is EQUALS or IN list
    // 4) Decimal type stores its string representation
    //    but has inconsistency in trailing zeros
{code}
IMPALA-6505 only pushes down non-equal binary predicates, e.g. <, <=, >, etc. 
Apart from the IN-list predicates, we can also push down EQUALS predicates for 
ORC to make good use of its bloom filters.

> Push down IN-list predicate to ORC reader
> -----------------------------------------
>
>                 Key: IMPALA-10873
>                 URL: https://issues.apache.org/jira/browse/IMPALA-10873
>             Project: IMPALA
>          Issue Type: Improvement
>          Components: Backend
>            Reporter: Quanlong Huang
>            Priority: Major
>
> IMPALA-6505 pushs down the min-max predicates into the ORC reader. Since 
> ORC's SearchArguments also support IN-list predicates, we can consider 
> pushing down IN-list and not IN-list predicates into it.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to