[
https://issues.apache.org/jira/browse/PIG-2459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13831260#comment-13831260
]
Rekha Joshi commented on PIG-2459:
----------------------------------
With consensus on PIG-3034, penny no longer exists with pig.
> Can't run penny in MapReduce mode
> ---------------------------------
>
> Key: PIG-2459
> URL: https://issues.apache.org/jira/browse/PIG-2459
> Project: Pig
> Issue Type: Bug
> Affects Versions: 0.9.1
> Environment: Testing in a cluster with 8 node
> Ubuntu 11.10 64bit
> java 1.6
> Hadoop 0.20.203
> Reporter: Andy He
> Priority: Minor
> Labels: newbie
>
> Hi folks,
> I have problems to run penny in my clusters.
> First of all, the hadoop cluster is fine, and the pig program performs well.
> Yet, when I'm trying to run the penny tool in the MapReduce Mode with the
> command:
> _java -cp
> pig-0.9.1/contrib/penny/java/penny.jar:pig-0.9.1/pig-0.9.1.jar:$HADOOP_CONF_DIR
> org.apache.pig.penny.apps.ds.Main test.pig_
> I get the following errors:
> ===========
> INFO executionengine.HExecutionEngine: Connecting to hadoop file system at:
> hdfs://lotr4.comp.polyu.edu.hk:9000
> Exception in thread "main" java.lang.RuntimeException: Failed to create
> DataStorage
> at
> org.apache.pig.backend.hadoop.datastorage.HDataStorage.init(HDataStorage.java:75)
> at
> org.apache.pig.backend.hadoop.datastorage.HDataStorage.<init>(HDataStorage.java:58)
> at
> org.apache.pig.backend.hadoop.executionengine.HExecutionEngine.init(HExecutionEngine.java:203)
> at
> org.apache.pig.backend.hadoop.executionengine.HExecutionEngine.init(HExecutionEngine.java:118)
> at org.apache.pig.impl.PigContext.connect(PigContext.java:185)
> at org.apache.pig.PigServer.<init>(PigServer.java:244)
> at org.apache.pig.PigServer.<init>(PigServer.java:229)
> at org.apache.pig.tools.ToolsPigServer.<init>(ToolsPigServer.java:70)
> at org.apache.pig.penny.ParsedPigScript.<init>(ParsedPigScript.java:82)
> at org.apache.pig.penny.PennyServer.parse(PennyServer.java:44)
> at org.apache.pig.penny.apps.ds.Main.main(Main.java:36)
> Caused by: java.io.IOException: Call to
> lotr4.comp.polyu.edu.hk/158.132.10.162:9000 failed on local exception:
> java.io.EOFException
> at org.apache.hadoop.ipc.Client.wrapException(Client.java:775)
> at org.apache.hadoop.ipc.Client.call(Client.java:743)
> at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:220)
> at $Proxy0.getProtocolVersion(Unknown Source)
> at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:359)
> at
> org.apache.hadoop.hdfs.DFSClient.createRPCNamenode(DFSClient.java:106)
> at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:207)
> at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:170)
> at
> org.apache.hadoop.hdfs.DistributedFileSystem.initialize(DistributedFileSystem.java:82)
> at
> org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:1378)
> at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:66)
> at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:1390)
> at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:196)
> at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:95)
> at
> org.apache.pig.backend.hadoop.datastorage.HDataStorage.init(HDataStorage.java:72)
> ... 10 more
> Caused by: java.io.EOFException
> at java.io.DataInputStream.readInt(DataInputStream.java:375)
> at
> org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:501)
> at org.apache.hadoop.ipc.Client$Connection.run(Client.java:446)
> ===========
> The test.pig script just loads a file and then stores it back:
> -----------
> data = LOAD 'input/student';
> STORE data INTO 'output';
> -----------
> Is it the problem that I miss some environmental variables for penny? Hope
> someone can help.
> Thanks
--
This message was sent by Atlassian JIRA
(v6.1#6144)