Johannes Rußek
Wed, 17 Mar 2010 15:51:57 -0700
Hi Dmitriy! Am 17.03.2010 18:46, schrieb Dmitriy Ryaboy:
The recipe on the cloudera blog works. It's made a little extra complex for educational purposes -- you should just put the geoip db on all the nodes so that you don't have to package it up and ship every time.
I just want to clarify, you do mean this: http://www.cloudera.com/blog/2009/06/analyzing-apache-logs-with-pig/ right?
unfortunately this does use streaming to solve this.However, i found this: http://www.data-miners.com/blog/2009/12/hadoop-and-mapreduce-what-country-is-ip.html which seems interesting. Getting this into a UDF is just a matter of wrapping it?
Thanks! Johannes