Which version of hadoop do you have? The latest hadoop at this moment
is 0.18.1 I believe. The pig patch
(https://issues.apache.org/jira/browse/PIG-253) to integrate with hadoop
0.18 was only checked in Sept 18. If you downloaded pig 0.1.0, it works
with hadoop 0.17.x. Or, if you directly downloaded the source code
before 9/18, it will work with hadoop 0.17.x.
Alan.
Latha wrote:
Hi All
Am a student trying to integrate PIG and Hadoop technologies to build a
custom application as a part of my MS project.
Am trying out a simple scenario where I have setup a single node hadoop
cluster and trying to execute the pig script "script1-*hadoop*.*pig" *mentioned
in the pig tutorial.
Am hitting several issues like "Failed to create data storage "etc.
Had posted same to the groups already.
http://www.nabble.com/Integration-of-pig-and-hadoop-fails-with-%22Failed-to-create-DataStorage%22-error.-td18931962.html
Could you please suggest me the proper steps to integrate pig and hadoop.
Right now, am following the below ones.
1) Have downloded latest source for hadoop and PIG
2) Compiled hadoop and started single node cluster
3) Compiled PIG and replaced the hadoop class files with the new ones from
step 2 in the pig.jar
4) executing the pig script by setting HADOOPSITEPATH
Please let me if the above steps needs are incorrect (or) should i use any
specific pig and hadoop versions? We are stuck up with the errors. Request
you to pls help in resolving the same.
Thankyou
Srilatha