Hi,
I am trying to run pig job to read HAR data from S3 and run the job on ec2
cluster and I am getting the following error:
Any ideas on what could be running
Error before Pig is launched
----------------------------
ERROR 2999: Unexpected internal error. Failed to create DataStorage
java.lang.RuntimeException: Failed to create DataStorage^M
at
org.apache.pig.backend.hadoop.datastorage.HDataStorage.init(HDataStorage.java:75)^M
at
org.apache.pig.backend.hadoop.datastorage.HDataStorage.<init>(HDataStorage.java:58)^M
at
org.apache.pig.backend.hadoop.executionengine.HExecutionEngine.init(HExecutionEngine.java:214)^M
at
org.apache.pig.backend.hadoop.executionengine.HExecutionEngine.init(HExecutionEngine.java:134)^M
at org.apache.pig.impl.PigContext.connect(PigContext.java:183)^M
at org.apache.pig.PigServer.<init>(PigServer.java:226)^M
at org.apache.pig.PigServer.<init>(PigServer.java:215)^M
at org.apache.pig.tools.grunt.Grunt.<init>(Grunt.java:55)^M
at org.apache.pig.Main.run(Main.java:492)^M
at org.apache.pig.Main.main(Main.java:107)^M
Caused by: java.io.IOException: Call to
ip-10-148-63-198.us-west-1.compute.internal:9000 failed on local exception:
java.net.SocketException: Malformed reply from SOCKS
server^M
at org.apache.hadoop.ipc.Client.wrapException(Client.java:1142)^M
at org.apache.hadoop.ipc.Client.call(Client.java:1110)^M
at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:226)^M
at $Proxy0.getProtocolVersion(Unknown Source)^M
at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:398)^M
at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:384)^M
at
org.apache.hadoop.hdfs.DFSClient.createRPCNamenode(DFSClient.java:111)^M
at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:213)^M
at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:180)^M
at
org.apache.hadoop.hdfs.DistributedFileSystem.initialize(DistributedFileSystem.java:89)^M
at
org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:1514)^M
at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:67)^M
at
org.apache.hadoop.fs.FileSystem$Cache.getInternal(FileSystem.java:1548)^M
at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:1530)^M
at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:228)^M
at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:111)^M
at
org.apache.pig.backend.hadoop.datastorage.HDataStorage.init(HDataStorage.java:72)^M
... 9 more^M
Caused by: java.net.SocketException: Malformed reply from SOCKS server^M
at
java.net.SocksSocketImpl.readSocksReply(SocksSocketImpl.java:90)^M
at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:472)^M
at java.net.Socket.connect(Socket.java:529)^M
at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:406)^M
at
org.apache.hadoop.ipc.Client$Connection.setupConnection(Client.java:425)^M
at
org.apache.hadoop.ipc.Client$Connection.setupIOstreams(Client.java:532)^M
at
org.apache.hadoop.ipc.Client$Connection.access$2300(Client.java:210)^M
at org.apache.hadoop.ipc.Client.getConnection(Client.java:1247)^M
at org.apache.hadoop.ipc.Client.call(Client.java:1078)^M
Thanks
Gayatri