Hi
I'm following a short tutorial from
http://blog.whitepages.com/2011/10/27/hbase-storage-and-pig/
I have a running HBase cluster and Hadoop cluster.
Steps I've performed:
- prepared a sample input file and put it on HDFS.
- created a table in HBase
- created a script file with contents:
raw_data = LOAD 'sample_data.csv' USING PigStorage( ',' ) AS (
listing_id: chararray,
fname: chararray,
lname: chararray );
STORE raw_data INTO 'hbase://sample_names' USING
org.apache.pig.backend.hadoop.hbase.HBaseStorage (
'info:fname info:lname');
and the whole thing freezes. It gets to the point saying
HadoopJobId: job_201203071602_0001
More information at:
http://hadoop1:50030/jobdetails.jsp?jobid=job_201203071602_0001
and after some time the job fails. The trace log from this run contains
only this exception:
Pig Stack Trace
---------------
ERROR 2244: Job failed, hadoop does not return any error message
org.apache.pig.backend.executionengine.ExecException: ERROR 2244: Job
failed, hadoop does not return any error message
at
org.apache.pig.tools.grunt.GruntParser.executeBatch(GruntParser.java:139)
at
org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:192)
at
org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:164)
at org.apache.pig.tools.grunt.Grunt.exec(Grunt.java:84)
at org.apache.pig.Main.run(Main.java:561)
at org.apache.pig.Main.main(Main.java:111)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at org.apache.hadoop.util.RunJar.main(RunJar.java:156)
================================================================================
I've atteched logs from this execution (pig.log)
My environment:
- HBase 0.92.0
- pig 0.9.2
- hadoop 1.0.0
Is there something wrong in my doing?
Regards
Marcin
2012-03-07 16:36:28,206 [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:host.name=hadoop1
2012-03-07 16:36:28,206 [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:java.version=1.6.0_26
2012-03-07 16:36:28,206 [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:java.vendor=Sun Microsystems Inc.
2012-03-07 16:36:28,206 [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:java.home=/usr/java/jdk1.6.0_26/jre
2012-03-07 16:36:28,206 [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:java.class.path=/usr/lib/hadoop-1.0.0/libexec/../conf:/usr/java/jdk1.6.0_26//lib/tools.jar:/usr/lib/hadoop-1.0.0/libexec/..:/usr/lib/hadoop-1.0.0/libexec/../hadoop-core-1.0.0.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/asm-3.2.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/aspectjrt-1.6.5.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/aspectjtools-1.6.5.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/commons-beanutils-1.7.0.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/commons-beanutils-core-1.8.0.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/commons-cli-1.2.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/commons-codec-1.4.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/commons-collections-3.2.1.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/commons-configuration-1.6.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/commons-daemon-1.0.1.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/commons-digester-1.8.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/commons-el-1.0.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/commons-httpclient-3.0.1.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/commons-lang-2.4.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/commons-logging-1.1.1.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/commons-logging-api-1.0.4.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/commons-math-2.1.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/commons-net-1.4.1.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/core-3.1.1.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/hadoop-capacity-scheduler-1.0.0.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/hadoop-fairscheduler-1.0.0.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/hadoop-thriftfs-1.0.0.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/hsqldb-1.8.0.10.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/jackson-core-asl-1.0.1.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/jackson-mapper-asl-1.0.1.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/jasper-compiler-5.5.12.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/jasper-runtime-5.5.12.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/jdeb-0.8.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/jersey-core-1.8.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/jersey-json-1.8.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/jersey-server-1.8.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/jets3t-0.6.1.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/jetty-6.1.26.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/jetty-util-6.1.26.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/jsch-0.1.42.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/junit-4.5.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/kfs-0.2.2.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/log4j-1.2.15.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/mockito-all-1.8.5.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/oro-2.0.8.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/servlet-api-2.5-20081211.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/slf4j-api-1.4.3.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/slf4j-log4j12-1.4.3.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/xmlenc-0.52.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/jsp-2.1/jsp-2.1.jar:/usr/lib/hadoop-1.0.0/libexec/../lib/jsp-2.1/jsp-api-2.1.jar:/home/michler/hbase/conf:/usr/java/jdk1.6.0_26//lib/tools.jar:/home/michler/hbase:/home/michler/hbase/hbase-0.92.0.jar:/home/michler/hbase/hbase-0.92.0-tests.jar:/home/michler/hbase/lib/activation-1.1.jar:/home/michler/hbase/lib/asm-3.1.jar:/home/michler/hbase/lib/avro-1.5.3.jar:/home/michler/hbase/lib/avro-ipc-1.5.3.jar:/home/michler/hbase/lib/commons-beanutils-1.7.0.jar:/home/michler/hbase/lib/commons-beanutils-core-1.8.0.jar:/home/michler/hbase/lib/commons-cli-1.2.jar:/home/michler/hbase/lib/commons-codec-1.4.jar:/home/michler/hbase/lib/commons-collections-3.2.1.jar:/home/michler/hbase/lib/commons-configuration-1.6.jar:/home/michler/hbase/lib/commons-digester-1.8.jar:/home/michler/hbase/lib/commons-el-1.0.jar:/home/michler/hbase/lib/commons-httpclient-3.1.jar:/home/michler/hbase/lib/commons-lang-2.5.jar:/home/michler/hbase/lib/commons-logging-1.1.1.jar:/home/michler/hbase/lib/commons-math-2.1.jar:/home/michler/hbase/lib/commons-net-1.4.1.jar:/home/michler/hbase/lib/core-3.1.1.jar:/home/michler/hbase/lib/guava-r09.jar:/home/michler/hbase/lib/hadoop-core-1.0.0.jar:/home/michler/hbase/lib/high-scale-lib-1.1.1.jar:/home/michler/hbase/lib/httpclient-4.0.1.jar:/home/michler/hbase/lib/httpcore-4.0.1.jar:/home/michler/hbase/lib/jackson-core-asl-1.5.5.jar:/home/michler/hbase/lib/jackson-jaxrs-1.5.5.jar:/home/michler/hbase/lib/jackson-mapper-asl-1.5.5.jar:/home/michler/hbase/lib/jackson-xc-1.5.5.jar:/home/michler/hbase/lib/jamon-runtime-2.3.1.jar:/home/michler/hbase/lib/jasper-compiler-5.5.23.jar:/home/michler/hbase/lib/jasper-runtime-5.5.23.jar:/home/michler/hbase/lib/jaxb-api-2.1.jar:/home/michler/hbase/lib/jaxb-impl-2.1.12.jar:/home/michler/hbase/lib/jersey-core-1.4.jar:/home/michler/hbase/lib/jersey-json-1.4.jar:/home/michler/hbase/lib/jersey-server-1.4.jar:/home/michler/hbase/lib/jettison-1.1.jar:/home/michler/hbase/lib/jetty-6.1.26.jar:/home/michler/hbase/lib/jetty-util-6.1.26.jar:/home/michler/hbase/lib/jruby-complete-1.6.5.jar:/home/michler/hbase/lib/jsp-2.1-6.1.14.jar:/home/michler/hbase/lib/jsp-api-2.1-6.1.14.jar:/home/michler/hbase/lib/libthrift-0.7.0.jar:/home/michler/hbase/lib/log4j-1.2.16.jar:/home/michler/hbase/lib/netty-3.2.4.Final.jar:/home/michler/hbase/lib/protobuf-java-2.4.0a.jar:/home/michler/hbase/lib/servlet-api-2.5-6.1.14.jar:/home/michler/hbase/lib/servlet-api-2.5.jar:/home/michler/hbase/lib/slf4j-api-1.5.8.jar:/home/michler/hbase/lib/snappy-java-1.0.3.2.jar:/home/michler/hbase/lib/stax-api-1.0.1.jar:/home/michler/hbase/lib/velocity-1.7.jar:/home/michler/hbase/lib/xmlenc-0.52.jar:/home/michler/hbase/lib/zookeeper-3.4.2.jar:/home/michler/pig-0.9.2/bin/../conf:/usr/java/jdk1.6.0_26//lib/tools.jar:/home/michler/pig-0.9.2/bin/../lib/automaton.jar:/home/michler/pig-0.9.2/bin/../lib/jython-2.5.0.jar:/home/michler/pig-0.9.2/bin/../pig-0.9.2-withouthadoop.jar:
2012-03-07 16:36:28,206 [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:java.library.path=/usr/lib/hadoop-1.0.0/libexec/../lib/native/Linux-amd64-64
2012-03-07 16:36:28,206 [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:java.io.tmpdir=/tmp
2012-03-07 16:36:28,206 [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:java.compiler=<NA>
2012-03-07 16:36:28,206 [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:os.name=Linux
2012-03-07 16:36:28,206 [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:os.arch=amd64
2012-03-07 16:36:28,206 [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:os.version=2.6.18-274.18.1.el5xen
2012-03-07 16:36:28,206 [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:user.name=michler
2012-03-07 16:36:28,206 [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:user.home=/home/michler
2012-03-07 16:36:28,206 [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:user.dir=/home/michler/pig-0.9.2
2012-03-07 16:36:28,208 [main] INFO org.apache.zookeeper.ZooKeeper - Initiating client connection, connectString=hadoop3:2222,hadoop2:2222,hadoop4:2222 sessionTimeout=180000 watcher=hconnection
2012-03-07 16:36:28,230 [main-SendThread()] INFO org.apache.zookeeper.ClientCnxn - Opening socket connection to server /172.16.36.72:2222
2012-03-07 16:36:28,231 [main-SendThread(hadoop3:2222)] INFO org.apache.zookeeper.ClientCnxn - Socket connection established to hadoop3/172.16.36.72:2222, initiating session
2012-03-07 16:36:28,233 [main] INFO org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper - The identifier of this process is 29450@hadoop1
2012-03-07 16:36:28,245 [main-SendThread(hadoop3:2222)] INFO org.apache.zookeeper.ClientCnxn - Session establishment complete on server hadoop3/172.16.36.72:2222, sessionid = 0x135edccb3170002, negotiated timeout = 180000
2012-03-07 16:36:28,467 [main] INFO org.apache.hadoop.hbase.mapreduce.TableOutputFormat - Created table instance for sample_names
2012-03-07 16:36:28,564 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MRCompiler - File concatenation threshold: 100 optimistic? false
2012-03-07 16:36:28,594 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MultiQueryOptimizer - MR plan size before optimization: 1
2012-03-07 16:36:28,594 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MultiQueryOptimizer - MR plan size after optimization: 1
2012-03-07 16:36:28,671 [main] INFO org.apache.pig.tools.pigstats.ScriptState - Pig script settings are added to the job
2012-03-07 16:36:28,680 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler - mapred.job.reduce.markreset.buffer.percent is not set, set to default 0.3
2012-03-07 16:36:28,682 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler - creating jar file Job5695153650838249666.jar
2012-03-07 16:36:34,406 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler - jar file Job5695153650838249666.jar created
2012-03-07 16:36:34,461 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler - Setting up single store job
2012-03-07 16:36:34,517 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - 1 map-reduce job(s) waiting for submission.
2012-03-07 16:36:35,021 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - 0% complete
2012-03-07 16:36:35,594 [Thread-7] INFO org.apache.hadoop.hbase.mapreduce.TableOutputFormat - Created table instance for sample_names
2012-03-07 16:36:35,616 [Thread-7] INFO org.apache.hadoop.mapreduce.lib.input.FileInputFormat - Total input paths to process : 1
2012-03-07 16:36:35,616 [Thread-7] INFO org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total input paths to process : 1
2012-03-07 16:36:35,626 [Thread-7] INFO org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total input paths (combined) to process : 1
2012-03-07 16:36:36,473 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - HadoopJobId: job_201203071602_0001
2012-03-07 16:36:36,473 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - More information at: http://hadoop1:50030/jobdetails.jsp?jobid=job_201203071602_0001
2012-03-07 16:44:36,429 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - job job_201203071602_0001 has failed! Stop running all dependent jobs
2012-03-07 16:44:36,429 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - 100% complete
2012-03-07 16:44:36,434 [main] ERROR org.apache.pig.tools.pigstats.PigStatsUtil - 1 map reduce job(s) failed!
2012-03-07 16:44:36,436 [main] INFO org.apache.pig.tools.pigstats.SimplePigStats - Script Statistics:
HadoopVersion PigVersion UserId StartedAt FinishedAt Features
1.0.0 0.9.2 michler 2012-03-07 16:36:28 2012-03-07 16:44:36 UNKNOWN
Failed!
Failed Jobs:
JobId Alias Feature Message Outputs
job_201203071602_0001 raw_data MAP_ONLY Message: Job failed! Error - JobCleanup Task Failure, Task: task_201203071602_0001_m_000001 hbase://sample_names,
Input(s):
Failed to read data from "/user/michler/in.csv"
Output(s):
Failed to produce result in "hbase://sample_names"
Counters:
Total records written : 0
Total bytes written : 0
Spillable Memory Manager spill count : 0
Total bags proactively spilled: 0
Total records proactively spilled: 0
Job DAG:
job_201203071602_0001
2012-03-07 16:44:36,436 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - Failed!
2012-03-07 16:44:36,457 [main] ERROR org.apache.pig.tools.grunt.GruntParser - ERROR 2244: Job failed, hadoop does not return any error message
Details at logfile: /home/michler/pig-0.9.2/pig_1331134586580.log