Pradeep Kamath commented on PIG-1014:

Here are the options we have in my mind:
1) status Quo which is confusing - so not quite an option
2) We disallow anything other than counting a single column with COUNT - So 
COUNT(A), COUNT(A.*), COUNT(A.($0,$1)) all would throw an error saying COUNT 
does not support this - I this case would COUNT_STAR be the suggestion to 
users? It would seem so for COUNT(A) and COUNT(A.*) not so sure for 
3) Same as 2) but instead of throwing an error, the parser replaces COUNT with 
COUNT_STAR - here again we need to decide what we should do with 
4) Change COUNT to look at all fields in the bag and only if all fields are 
null not count that towards the result - In this case, COUNT(A) and 
COUNT_STAR(A) will not give same results but we could document this.

> Pig should convert COUNT(relation) to COUNT_STAR(relation) so that all 
> records are counted without considering nullness of the fields in the records
> ----------------------------------------------------------------------------------------------------------------------------------------------------
>                 Key: PIG-1014
>                 URL: https://issues.apache.org/jira/browse/PIG-1014
>             Project: Pig
>          Issue Type: Bug
>    Affects Versions: 0.4.0
>            Reporter: Pradeep Kamath

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

Reply via email to