[
https://issues.apache.org/jira/browse/HIVE-16029?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15884347#comment-15884347
]
Edward Capriolo commented on HIVE-16029:
----------------------------------------
I do not thin you should change this now. This will change the behavior of many
applications. I am -1 in the current form.
I suggest you do this.
CollectSet(a) <- original behaivor
CollectSet(a,true) <- allow nulls.
This way you get your feature and the result set you want and existing
applications are not effected.
> COLLECT_SET and COLLECT_LIST does not return NULL in the result
> ---------------------------------------------------------------
>
> Key: HIVE-16029
> URL: https://issues.apache.org/jira/browse/HIVE-16029
> Project: Hive
> Issue Type: Bug
> Affects Versions: 2.1.1
> Reporter: Eric Lin
> Assignee: Eric Lin
> Priority: Minor
> Attachments: HIVE-16029.patch
>
>
> See the test case below:
> {code}
> 0: jdbc:hive2://localhost:10000/default> select * from collect_set_test;
> +---------------------+
> | collect_set_test.a |
> +---------------------+
> | 1 |
> | 2 |
> | NULL |
> | 4 |
> | NULL |
> +---------------------+
> 0: jdbc:hive2://localhost:10000/default> select collect_set(a) from
> collect_set_test;
> +---------------+
> | _c0 |
> +---------------+
> | [1,2,4] |
> +---------------+
> {code}
> The correct result should be:
> {code}
> 0: jdbc:hive2://localhost:10000/default> select collect_set(a) from
> collect_set_test;
> +---------------+
> | _c0 |
> +---------------+
> | [1,2,null,4] |
> +---------------+
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)