Ryan Blue created PARQUET-645:
---------------------------------
Summary: DictionaryFilter incorrectly handles null
Key: PARQUET-645
URL: https://issues.apache.org/jira/browse/PARQUET-645
Project: Parquet
Issue Type: Bug
Components: parquet-mr
Affects Versions: 1.9.0
Reporter: Ryan Blue
Assignee: Ryan Blue
Fix For: 1.9.0
DictionaryFilter checks whether a column can match a query and filters out row
groups that can't match. Equality checks don't currently handle null correctly,
which is never in the dictionary and is encoded by the definition level. This
is causing row groups to be filtered when they should not be because "col is
null" is always true.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)