Hello, Check the HADOOP_CONF_DIR and/or PIG_CLASSPATH environment variables. It might be that you are looking at incorrect HDFS location.
Regards, Amit ________________________________ From: lulynn_2008 <[email protected]> To: user <[email protected]> Sent: Thursday, August 1, 2013 11:02 PM Subject: During running "store" command, output data file part-m-00000 is missing Hi All, I am using following test case with mr1+hdfs2, the mapreduce job succeed but there is no output data file "part-m-00000" is generated. Following is the detail of the test case and my current investigation. I want to trace this issue, please give your suggestions. Like which classes or functions I should pay attention to during debugging. Thanks~ cat $PIG_HOME/bin/test/student lynn,28,3 ff,22,4 chen,27,5 John,20,4 Mary,25,4 Bill,30,5 Joe,40,4 Run into pig grunt via command "$PIG_HOME/bin/pig": grunt> copyFromLocal $PIG_HOME/pig/bin/test/student /user/pig/student grunt> A = load 'student' using PigStorage(',') as (name:chararray, age:int, gpa:float); grunt> B = foreach A generate name; grunt> store B into 'result'; The correct output folder "result" stored at hdfs should be like following: hadoop fs -ls /user/pig/result Found 3 items -rw-r--r-- 2 pig pig 0 2013-07-30 00:52 /user/pig/result/_SUCCESS drwxr-xr-x - pig pig 0 2013-07-30 00:52 /user/pig/result/_logs -rw-r--r-- 2 pig pig 23 2013-07-30 00:52 /user/pig/part-m-00000 But in this test case, there is no output data(part-m-00000) stored at hdfs,: grunt> fs -ls /user/pig/result Found 2 items -rw-r--r-- 1 pig pig 0 2013-07-30 01:37 /user/pig/result/_SUCCESS drwx------ - pig pig 0 2013-07-30 01:37 /user/pig/result/_logs During running the test case, I can see the output data can be generated at hdfs: "/user/pig/result/_temporary/_attempt_201308010000_0008_m_000000_0/part-m-00000". This "_temporary" file will be deleted at the end of this job. But file "part-m-00000" is not saved as "/user/biadmin/tmpuser0/part-m-00000" in hdfs via rename command.
