Pig has some clear advantages over raw mapreduce code, but IHMO the most
important is the learning curve. But, if you are just loading, probably you
don't want to just translate it into pig, well, maybe just for the fun of it
(: but if you are planing to do some more other operations like joining or
grouping, it would be a lot more simple to do it from pig.

Give this a look, it will help you understand better the bigger picture.
http://www.slideshare.net/hadoop/practical-problem-solving-with-apache-hadoop-pig

Renato M.


If you already have it as a hadoop job, why would you want it pass to pig?

2010/11/15 Gerrit van Vuuren <[email protected]>

> Is this a bot?
>
> Y si no, si puedes utilizar pig anque te consejo reutilizar lo que ya se ha
> desarollado y no repetir udfs si existe :)
>
>
> ----- Original Message -----
> From: Cornelio Iñigo <[email protected]>
> To: [email protected] <[email protected]>
> Sent: Mon Nov 15 20:48:35 2010
> Subject: pig needed?
>
> Hi
>
> My name is Cornelio Iñigo and I´m a developer just beginning with this of
> hadoop and pig.
> I have a doubt about developing an application on pig, I already have my
> program on hadoop, this program gets just a column from a dataset (csv
> file)
> and process this data with some functions (like language analisis, analysis
> of the content)
>  note that in the process of the file I dont use FILTERS COUNTS or any
> built
> in function of Pig, I think that all the fucntions have to be User Defined
> Functions
>
>  so Is a good idea (has sense ) to develop this program in Pig?
>
> Thanks in advice
> --
> *Cornelio*
>

Reply via email to