[
https://issues.apache.org/jira/browse/ORC-422?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16725210#comment-16725210
]
Owen O'Malley commented on ORC-422:
-----------------------------------
After looking at the patch, I think it needs a little more work. In particular,
the hard cases are things like:
Assuming:
String limit of 4 bytes for ease of reading.
name col1, values: \{"abcdef", "gggggg"}, lower bound: "abcd", upper bound:
"gggh"
* col1 > "abcd" -> true
* col1 = "abcd" -> false
* col1 <= "abcd" -> false
* col1 >= "gggh" -> false
* col1 = "gggh" -> false
* col1 < "gggh" -> true
* col1 <= "gggh" -> true
> Predicate push down can return incorrect value when lower/upper bounds are set
> ------------------------------------------------------------------------------
>
> Key: ORC-422
> URL: https://issues.apache.org/jira/browse/ORC-422
> Project: ORC
> Issue Type: Bug
> Affects Versions: 1.6.0
> Reporter: Sandeep More
> Assignee: Sandeep More
> Priority: Major
> Time Spent: 10m
> Remaining Estimate: 0h
>
> As part of [ORC-203|https://issues.apache.org/jira/browse/ORC-203] change,
> Min and/or Max returned by StringStatistics will be null if lower and/or
> upper bounds are set.
> This can cause predicate push down to return incorrect value.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)