My apologies if this is the wrong mailing list to ask this question. I've started playing around with Pig and Hadoop, with the intention of using it to do some analysis of a collection of MySQL slow query log files. I am not a Java programmer (been using PHP for a very long time, dabbled in other languages as required) so I am slightly intimidated by the documentation in Pig for writing your own UDF's.
If anyone has done anything like this, I would appreciate some tips and some pointers on how to approach it. Sure, I could hunker down and learn to use some CLI tools for analyzing the slow query log, but then I couldn't use Pig and Hadoop. ;) -- Chris Hartjes
