Nick Dimiduk created HCATALOG-623:
-------------------------------------

             Summary: Understanding how to use the HBase bulk import feature
                 Key: HCATALOG-623
                 URL: https://issues.apache.org/jira/browse/HCATALOG-623
             Project: HCatalog
          Issue Type: Documentation
          Components: hbase
    Affects Versions: 0.5
            Reporter: Nick Dimiduk


I'm working through use of the HBaseBulkOutputFormat and I'm getting stuck. I 
have a simple example that replicates the [ImportTsv 
example|http://hbase.apache.org/book/ops_mgt.html#importtsv] from the HBase 
documentation. The end result is the ImportSequenceFile job failing due to jars 
missing from its classpath. Presumably I've not configured something correctly. 
In this example I'm using Pig.

Here's the error message and also the command files and commands I use to run 
them.

{noformat}
$ hadoop fs -put simple.tsv /tmp/
$ HCAT_CLASSPATH=$(hbase classpath) hcat -f simple.ddl
$ PIG_CLASSPATH=$(hbase classpath) pig -v -useHCatalog simple.bulkload.pig
{noformat}

Error message:

{noformat}
2013-02-19 19:55:30,354 WARN 
org.apache.hadoop.hbase.mapreduce.TableMapReduceUtil: Could not find jar for 
class class org.apache.zookeeper.ZooKeeper in order to ship it to the cluster.
2013-02-19 19:55:30,355 WARN 
org.apache.hadoop.hbase.mapreduce.TableMapReduceUtil: Could not find jar for 
class class org.apache.hadoop.hbase.client.HTable in order to ship it to the 
cluster.
2013-02-19 19:55:30,357 WARN 
org.apache.hadoop.hbase.mapreduce.TableMapReduceUtil: Could not find jar for 
class class org.apache.hadoop.hive.ql.metadata.HiveException in order to ship 
it to the cluster.
2013-02-19 19:55:30,358 WARN 
org.apache.hadoop.hbase.mapreduce.TableMapReduceUtil: Could not find jar for 
class class org.apache.hcatalog.mapreduce.HCatOutputFormat in order to ship it 
to the cluster.
2013-02-19 19:55:30,359 WARN 
org.apache.hadoop.hbase.mapreduce.TableMapReduceUtil: Could not find jar for 
class class org.apache.hcatalog.hbase.HBaseHCatStorageHandler in order to ship 
it to the cluster.
2013-02-19 19:55:30,360 WARN 
org.apache.hadoop.hbase.mapreduce.TableMapReduceUtil: Could not find jar for 
class class org.apache.hadoop.hive.hbase.HBaseSerDe in order to ship it to the 
cluster.
2013-02-19 19:55:30,361 WARN 
org.apache.hadoop.hbase.mapreduce.TableMapReduceUtil: Could not find jar for 
class class org.apache.hadoop.hive.metastore.api.Table in order to ship it to 
the cluster.
2013-02-19 19:55:30,363 WARN 
org.apache.hadoop.hbase.mapreduce.TableMapReduceUtil: Could not find jar for 
class interface org.apache.thrift.TBase in order to ship it to the cluster.
2013-02-19 19:55:30,364 WARN 
org.apache.hadoop.hbase.mapreduce.TableMapReduceUtil: Could not find jar for 
class class org.apache.hadoop.hbase.util.Bytes in order to ship it to the 
cluster.
2013-02-19 19:55:30,365 WARN 
org.apache.hadoop.hbase.mapreduce.TableMapReduceUtil: Could not find jar for 
class class com.facebook.fb303.FacebookBase in order to ship it to the cluster.
2013-02-19 19:55:30,366 WARN 
org.apache.hadoop.hbase.mapreduce.TableMapReduceUtil: Could not find jar for 
class class com.google.common.util.concurrent.ThreadFactoryBuilder in order to 
ship it to the cluster.
{noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to