[ 
https://issues.apache.org/jira/browse/HIVE-10331?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14495590#comment-14495590
 ] 

Prasanth Jayachandran commented on HIVE-10331:
----------------------------------------------

The original patch had check for hasHashNull() but the default value for 
hasNull was wrong.
https://github.com/apache/hive/blob/trunk/ql/src/java/org/apache/hadoop/hive/ql/io/orc/ColumnStatisticsImpl.java#L819

Can you change the default value to true in constructor and reset() method?

> ORC : Is null SARG filters out all row groups written in old ORC format
> -----------------------------------------------------------------------
>
>                 Key: HIVE-10331
>                 URL: https://issues.apache.org/jira/browse/HIVE-10331
>             Project: Hive
>          Issue Type: Bug
>          Components: Hive
>    Affects Versions: 1.1.0
>            Reporter: Mostafa Mokhtar
>            Assignee: Prasanth Jayachandran
>             Fix For: 1.2.0
>
>         Attachments: HIVE-10331.01.patch
>
>
> Queries are returning wrong results as all row groups gets filtered out and 
> no rows get scanned.
> {code}
> SELECT 
>   count(*)
>     FROM
>         store_sales
>     WHERE
>         ss_addr_sk IS NULL
> {code}
> With hive.optimize.index.filter disabled we get the correct results
> In pickRowGroups stats show that hasNull_ is fales, while the rowgroup 
> actually has null.
> Same query runs fine for newly loaded ORC tables.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to