Chris Hartjes wrote:
My apologies if this is the wrong mailing list to ask this question.  I've
started playing around with Pig and Hadoop, with the intention of using it
to do some analysis of a collection of MySQL slow query log files.  I am not
a Java programmer (been using PHP for a very long time, dabbled in other
languages as required) so I am slightly intimidated by the documentation in
Pig for writing your own UDF's.

If anyone has done anything like this, I would appreciate some tips and some
pointers on how to approach it.  Sure, I could hunker down and learn to use
some CLI tools for analyzing the slow query log, but then I couldn't use Pig
and Hadoop. ;)


Until you are comfortable with pig udf's, you could use streaming and working with the language you are comfortable with ! There might be some things you might not be able to do (load/store, etc for example) ... but pretty much everything else should be possible.


Regards,
Mridul

Reply via email to