[
https://issues.apache.org/jira/browse/PIG-2353?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13173431#comment-13173431
]
Ashutosh Chauhan commented on PIG-2353:
---------------------------------------
I was also thinking of this problem of implementing statistical measures (like
top-K, median, quantiles) etc. efficiently in a distributed manner which is
amenable to MR framework. Rank is a basis of it. I came up with similiar
outline as yours, your have laid it out well. I think this is pretty useful to
be in Pig and these are kind of features which higher level language like Pig
should make available to its users. Sophisticated users will expect this and
this will derive adoption.
+1 for distributed implementation of RANK in Pig.
> RANK function like in SQL
> -------------------------
>
> Key: PIG-2353
> URL: https://issues.apache.org/jira/browse/PIG-2353
> Project: Pig
> Issue Type: New Feature
> Reporter: Gianmarco De Francisci Morales
> Attachments: PIG2353.patch
>
>
> Implement a function that given a (sorted) bag adds to each tuple a unique,
> increasing identifier without gaps, like what RANK does for SQL.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira