[
https://issues.apache.org/jira/browse/PIG-1014?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12765241#action_12765241
]
Pradeep Kamath commented on PIG-1014:
-------------------------------------
Here are the options we have in my mind:
1) status Quo which is confusing - so not quite an option
2) We disallow anything other than counting a single column with COUNT - So
COUNT(A), COUNT(A.*), COUNT(A.($0,$1)) all would throw an error saying COUNT
does not support this - I this case would COUNT_STAR be the suggestion to
users? It would seem so for COUNT(A) and COUNT(A.*) not so sure for
COUNT(A.($0,$1))
3) Same as 2) but instead of throwing an error, the parser replaces COUNT with
COUNT_STAR - here again we need to decide what we should do with
COUNT(A.($0,$1))
4) Change COUNT to look at all fields in the bag and only if all fields are
null not count that towards the result - In this case, COUNT(A) and
COUNT_STAR(A) will not give same results but we could document this.
> Pig should convert COUNT(relation) to COUNT_STAR(relation) so that all
> records are counted without considering nullness of the fields in the records
> ----------------------------------------------------------------------------------------------------------------------------------------------------
>
> Key: PIG-1014
> URL: https://issues.apache.org/jira/browse/PIG-1014
> Project: Pig
> Issue Type: Bug
> Affects Versions: 0.4.0
> Reporter: Pradeep Kamath
>
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.