Cheng Lian created SPARK-4493:
---------------------------------
Summary: Don't pushdown Eq, NotEq, Lt, LtEq, Gt and GtEq
predicates with nulls for Parquet
Key: SPARK-4493
URL: https://issues.apache.org/jira/browse/SPARK-4493
Project: Spark
Issue Type: Bug
Components: SQL
Affects Versions: 1.1.0
Reporter: Cheng Lian
Predicates like {{a = NULL}} and {{a < NULL}} can't be pushed down since
Parquet {{Operators.ColumnFilterPredicate\[_\]}} doesn't accept null value.
However, normally this issue doesn't cause NPE because any value compared to
{{NULL}} results {{NULL}}, and Spark SQL automatically optimizes out {{NULL}}
predicate in the {{SimplifyFilters}} rule. Only testing code that intentionally
disables the optimizer may trigger this issue. (That's why this issue is not
marked as blocker).
On the other hand, {{a IS NULL}} and {{a IS NOT NULL}} should be pushed down.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]