Hi again! :)

I am trying to run Pig on a local machine, but I want it to connect to a 
remote cluster. I can't make it use my settings - whatever I do, I get this:
-----
$ pig -x mapreduce
10/10/16 22:17:43 INFO pig.Main: Logging error messages to: 
/home/pigtest/conf/pig_1287260263699.log
2010-10-16 22:17:43,896 [main] INFO  
org.apache.pig.backend.hadoop.executionengine.HExecutionEngine - Connecting to 
hadoop file system at: file:///
grunt>
-----

I have copied the hadoop settings files (/etc/hadoop/conf/*) from the remote 
cluster's namenode to /home/pigtest/conf/ and exported PIG_CLASSPATH, PIGDIR, 
HADOOP_CLASSPATH,... I have also tried changing 
/etc/pig/conf/pig.configuration (even wrote there some free text so it would 
at least give me an error message) - nothing. It still connects to file:/// 
and is still doesn't display a message about a jobtracker:
-----
$ export HADOOPDIR=/etc/hadoop/conf
$ export PIG_PATH=/etc/pig/conf
$ export PIG_CLASSPATH=$HADOOPDIR
$ export PIG_HADOOP_VERSION=0.20.2
$ export PIG_HOME="/usr/lib/pig"
$ export PIG_CONF_DIR="/etc/pig/"
$ export PIG_LOG_DIR="/var/log/pig"
$ pig -x mapreduce
10/10/16 22:32:34 INFO pig.Main: Logging error messages to: 
/home/pigtest/conf/pig_1287261154272.log
2010-10-16 22:32:34,471 [main] INFO  
org.apache.pig.backend.hadoop.executionengine.HExecutionEngine - Connecting to 
hadoop file system at: file:///
grunt> 
-----

I am guessing I am doing something fundamentally wrong. How do I change the 
Pig's settings? 

More info: using Cloudera package hadoop-pig from CDH3b3 (0.7.0+16-1~lenny-
cdh3b3). I would appreciate some pointers. 

Kind regards,

Anze

Reply via email to