Hi, I have a big dataset which contains mainly urls and their html contents. Now given a regular expression I want to get 'x' number of urls matching the regex pattern. I have written a UDF to filter out urls based on regular expression. Is there a way in Pig script to limit the number of results to 'x' ? ( 'x' is some configurable value)
Thanks, Souri
