[ 
https://issues.apache.org/jira/browse/SPARK-16152?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dongjoon Hyun closed SPARK-16152.
---------------------------------
    Resolution: Invalid

Hi, [~fushar]. 
This seems to be a SQL question. [~kevinyu98] is right. Spark/PostgreSQL/MySQL 
are consistent with this. `NULL IN (NULL)` is NULL. Please run the following 
query. The result is also `TRUE` for the above SQL engines.
{code}
SELECT (NULL IN (NULL)) IS NULL
{code}

> `In` predicate does not work with null values
> ---------------------------------------------
>
>                 Key: SPARK-16152
>                 URL: https://issues.apache.org/jira/browse/SPARK-16152
>             Project: Spark
>          Issue Type: Bug
>          Components: SQL
>    Affects Versions: 1.6.1
>            Reporter: Ashar Fuadi
>
> According to 
> https://github.com/apache/spark/blob/v1.6.1/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/predicates.scala#L134..L136:
> {code}
>  override def eval(input: InternalRow): Any = {
>     val evaluatedValue = value.eval(input)
>     if (evaluatedValue == null) {
>       null
>     } else {
>       ...
> {code}
> we always return {{null}} when the current value is null, ignoring the 
> elements of {{list}}. Therefore, we cannot have a predicate which tests 
> whether a column contains values in e.g. {{[1, 2, 3, null]}}
> Is this a bug, or is this actually the expected behavior?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to