[
https://issues.apache.org/jira/browse/PIG-1014?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12766069#action_12766069
]
Dmitriy V. Ryaboy commented on PIG-1014:
----------------------------------------
A link that talks about some of the more "interesting" behaviors of NULL in
SQL: http://thoughts.j-davis.com/2009/08/02/what-is-the-deal-with-nulls/
The difference between COUNT and COUNT_STAR is that COUNT_STAR counts nulls. I
think this ticket boils down to the question, "what do we consider a null
tuple?". At the moment, we consider A.$0 to determine whether the tuple is
null; that doesn't seem right, and surprises users. We have two options that
both make sense -- a null tuple is a tuple in which all fields are null, or a
null tuple is a tuple which is completely null (ie, doesn't even have any
fields). I am in favor of the first definition, which is a superset of the
second.
> Pig should convert COUNT(relation) to COUNT_STAR(relation) so that all
> records are counted without considering nullness of the fields in the records
> ----------------------------------------------------------------------------------------------------------------------------------------------------
>
> Key: PIG-1014
> URL: https://issues.apache.org/jira/browse/PIG-1014
> Project: Pig
> Issue Type: Bug
> Affects Versions: 0.4.0
> Reporter: Pradeep Kamath
>
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.