[ 
https://issues.apache.org/jira/browse/ORC-422?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16725210#comment-16725210
 ] 

Owen O'Malley commented on ORC-422:
-----------------------------------

After looking at the patch, I think it needs a little more work. In particular, 
the hard cases are things like:

Assuming:

String limit of 4 bytes for ease of reading.

name col1, values: \{"abcdef", "gggggg"}, lower bound: "abcd", upper bound: 
"gggh"
 * col1 > "abcd" -> true
 * col1 = "abcd" -> false
 * col1 <= "abcd" -> false
 * col1 >= "gggh" -> false
 * col1 = "gggh" -> false
 * col1 < "gggh" -> true
 * col1 <= "gggh" -> true

> Predicate push down can return incorrect value when lower/upper bounds are set
> ------------------------------------------------------------------------------
>
>                 Key: ORC-422
>                 URL: https://issues.apache.org/jira/browse/ORC-422
>             Project: ORC
>          Issue Type: Bug
>    Affects Versions: 1.6.0
>            Reporter: Sandeep More
>            Assignee: Sandeep More
>            Priority: Major
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> As part of [ORC-203|https://issues.apache.org/jira/browse/ORC-203] change, 
> Min and/or Max returned by StringStatistics will be null if lower and/or 
> upper bounds are set.
> This can cause predicate push down to return incorrect value.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to