This is very much useful guys . And informative too . Now i am clear Syed Abdul kather send from Samsung S3 On Jul 31, 2012 11:11 PM, "Manoj Babu" <manoj...@gmail.com> wrote:
> Thanks Abhishek. > > Cheers! > Manoj. > > > > On Tue, Jul 31, 2012 at 10:43 PM, Abhishek Shivkumar < > abhisheksgum...@gmail.com> wrote: > >> Hi Manoj, >> >> Pig is basically a data-flow language used to perform high-level >> simple operations such as summarizations and basic analysis on top of the >> data residing on HDFS. It uses a language called Pig-Latin. It gives your >> HDFS a datawarehouse kind of perspective, and lets you do a data analysis >> job by writing simple scripts. >> >> Pig Latin is easy to learn and one necessarily doesn't need to know >> mapreduce to write and run Pig Latin. It is important to note that once you >> write the Pig scripts, when they are run, internally they generate >> mapreduce jobs to run the scripts. So, eventually, you are using mapreduce >> internally. >> >> On the other hand, you use mapreduce to perform a job that is not as >> simple to be written using a script in pig Latin. for this, you will need >> to design the mapreduce job by deciding how many reducers do you need, >> designing the combiner, partitioner and grouping class for various >> performance issues. >> >> Of course it is easy to run jobs using pig scripts, but it may not be >> possible to write everything in Pig. >> >> Hope it is fine. >> >> Thank you! >> >> With Regards, >> Abhishek S >> >> >> >> On Tue, Jul 31, 2012 at 10:37 PM, Manoj Babu <manoj...@gmail.com> wrote: >> >>> Hi, >>> >>> It would be great if any of you compare Pig and Hadoop map reduce. When >>> we should go for Hadoop or Pig? >>> I love to program using java but peoples were arguing that can be >>> easily achieved in ping with very few lines of code even my boss too... >>> I am a fresh developer for Hadoop. Could kindly provide the pros and >>> cons? >>> >>> Cheers! >>> Manoj. >>> >>> >>> >> >