Hi again! :) I am trying to run Pig on a local machine, but I want it to connect to a remote cluster. I can't make it use my settings - whatever I do, I get this: ----- $ pig -x mapreduce 10/10/16 22:17:43 INFO pig.Main: Logging error messages to: /home/pigtest/conf/pig_1287260263699.log 2010-10-16 22:17:43,896 [main] INFO org.apache.pig.backend.hadoop.executionengine.HExecutionEngine - Connecting to hadoop file system at: file:/// grunt> -----
I have copied the hadoop settings files (/etc/hadoop/conf/*) from the remote cluster's namenode to /home/pigtest/conf/ and exported PIG_CLASSPATH, PIGDIR, HADOOP_CLASSPATH,... I have also tried changing /etc/pig/conf/pig.configuration (even wrote there some free text so it would at least give me an error message) - nothing. It still connects to file:/// and is still doesn't display a message about a jobtracker: ----- $ export HADOOPDIR=/etc/hadoop/conf $ export PIG_PATH=/etc/pig/conf $ export PIG_CLASSPATH=$HADOOPDIR $ export PIG_HADOOP_VERSION=0.20.2 $ export PIG_HOME="/usr/lib/pig" $ export PIG_CONF_DIR="/etc/pig/" $ export PIG_LOG_DIR="/var/log/pig" $ pig -x mapreduce 10/10/16 22:32:34 INFO pig.Main: Logging error messages to: /home/pigtest/conf/pig_1287261154272.log 2010-10-16 22:32:34,471 [main] INFO org.apache.pig.backend.hadoop.executionengine.HExecutionEngine - Connecting to hadoop file system at: file:/// grunt> ----- I am guessing I am doing something fundamentally wrong. How do I change the Pig's settings? More info: using Cloudera package hadoop-pig from CDH3b3 (0.7.0+16-1~lenny- cdh3b3). I would appreciate some pointers. Kind regards, Anze
