[
https://issues.apache.org/jira/browse/HIVE-10331?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14495590#comment-14495590
]
Prasanth Jayachandran commented on HIVE-10331:
----------------------------------------------
The original patch had check for hasHashNull() but the default value for
hasNull was wrong.
https://github.com/apache/hive/blob/trunk/ql/src/java/org/apache/hadoop/hive/ql/io/orc/ColumnStatisticsImpl.java#L819
Can you change the default value to true in constructor and reset() method?
> ORC : Is null SARG filters out all row groups written in old ORC format
> -----------------------------------------------------------------------
>
> Key: HIVE-10331
> URL: https://issues.apache.org/jira/browse/HIVE-10331
> Project: Hive
> Issue Type: Bug
> Components: Hive
> Affects Versions: 1.1.0
> Reporter: Mostafa Mokhtar
> Assignee: Prasanth Jayachandran
> Fix For: 1.2.0
>
> Attachments: HIVE-10331.01.patch
>
>
> Queries are returning wrong results as all row groups gets filtered out and
> no rows get scanned.
> {code}
> SELECT
> count(*)
> FROM
> store_sales
> WHERE
> ss_addr_sk IS NULL
> {code}
> With hive.optimize.index.filter disabled we get the correct results
> In pickRowGroups stats show that hasNull_ is fales, while the rowgroup
> actually has null.
> Same query runs fine for newly loaded ORC tables.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)