pig-user  

Re: connecting to cluster with bin/pig

Gert Pfeifer
Tue, 01 Jul 2008 11:01:31 -0700

Hi,
exectype is not touched, so it defaults to mapreduce (correct me if I am wrong)

My mistake was to put the hadoop-site.xml into the path. Thank you very much!!!

Now I found this:
Protocol org.apache.hadoop.dfs.ClientProtocol version mismatch. (client = 29, server = 23)

Is there a compatibility mode, or do I have to update the server?

Gert

Alan Gates schrieb:
Are you setting exectype to 'mapreduce'? It looks like you have it set to 'local'. Also, make sure you include the directory that your hadoop-site.xml is in in the class path, not the file itself.

If neither of those resolve your issue, please post your whole pig.properties file.

Alan.

Gert Pfeifer wrote:
Hi,
I am trying to start pig for the first time, so here is a beginner's
question.

How do I tell the bin/pig shell script where the cluster can be found?
I used the conf/pic.properties as follows:

# clustername, name of the hadoop jobtracker. If no port is defined port
50020 will be used.
cluster=<ip address of the job tracker>

Then I get a message:
[main] INFO
org.apache.pig.backend.hadoop.executionengine.HExecutionEngine  -
Connecting to hadoop file system at: file:///

Then I get the grunt shell on the local file system, which is not quite
what I wanted.

I also tried this:
java -cp pig.jar:../../path/to/hadoop-site.xml  org.apache.pig.Main

But I saw the same result. So how do I connect to the name node and the
job tracker? I guess I need both, don't I?

Thanks for any hints,
Gert