Hi Chris, For just doing simple analysis, default functions provided in pig (Pig latin/cookbook) would be sufficient,UDF's might come into picture only if you need something customized. Even then most of your needs can be directly met from the piggybank.
If you need to write one, please look into http://hadoop.apache.org/pig/docs/r0.5.0/udf.html. It has some sample UDFs and usage. Cheers, /R On 1/11/10 11:59 PM, "Chris Hartjes" <[email protected]> wrote: My apologies if this is the wrong mailing list to ask this question. I've started playing around with Pig and Hadoop, with the intention of using it to do some analysis of a collection of MySQL slow query log files. I am not a Java programmer (been using PHP for a very long time, dabbled in other languages as required) so I am slightly intimidated by the documentation in Pig for writing your own UDF's. If anyone has done anything like this, I would appreciate some tips and some pointers on how to approach it. Sure, I could hunker down and learn to use some CLI tools for analyzing the slow query log, but then I couldn't use Pig and Hadoop. ;) -- Chris Hartjes
