[ 
https://issues.apache.org/jira/browse/PIG-2353?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13173431#comment-13173431
 ] 

Ashutosh Chauhan commented on PIG-2353:
---------------------------------------

I was also thinking of this problem of implementing statistical measures (like 
top-K, median, quantiles) etc. efficiently in a distributed manner which is 
amenable to MR framework. Rank is a basis of it. I came up with similiar 
outline as yours, your have laid it out well. I think this is pretty useful to 
be in Pig and these are kind of features which higher level language like Pig 
should make available to its users. Sophisticated users will expect this and 
this will derive adoption.   
+1 for distributed implementation of RANK in Pig.
                
> RANK function like in SQL
> -------------------------
>
>                 Key: PIG-2353
>                 URL: https://issues.apache.org/jira/browse/PIG-2353
>             Project: Pig
>          Issue Type: New Feature
>            Reporter: Gianmarco De Francisci Morales
>         Attachments: PIG2353.patch
>
>
> Implement a function that given a (sorted) bag adds to each tuple a unique, 
> increasing identifier without gaps, like what RANK does for SQL.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to