Hi Ruslan -- no need to write your own UDF. There is a built-in function TOP() which will extract for you the top N tuples of a relation, where N is a configurable parameter. Take a look at:
http://pig.apache.org/docs/r0.9.0/func.html#topx Norbert On Thu, Sep 8, 2011 at 9:13 AM, Ruslan Al-Fakikh <[email protected]> wrote: > Hey guys, > > How can I LIMIT a relation by percentage? > What I need is to sort a relation by a numeric column and then take > top 5% of tuples. > As far as I understand I cannot use an expression in the LIMIT > operator. Do I have to write my own UDF? What type of UDF should I use > then? > > -- > Best Regards, > Ruslan Al-Fakikh >
