Hi all.
I have one question regarding bulk load.
How to load data with table empty column values in few rows using bulk load
tool ?
I tried the following simple example in HBase 0.94.11 and Hadoop-2, with table
having three columns and second column value is empty in few rows using bulk
load tool.
Ø Data in file is in below format
row0,value1,value0
row1,,value1
row2,value3,value2
row3,,value3
row4,value5,value4
row5,,value5
row6,value7,value6
row7,,value7
row8,value9,value8
Ø When I execute the command
hadoop jar <HBASE_HOME>/hbase-0.94.11-security.jar importtsv
-Dimporttsv.skip.bad.lines=false -Dimporttsv.separator=,
-Dimporttsv.columns=HBASE_ROW_KEY,cf1:c1,cf1:c2 -Dimporttsv.bulk.output=
/bulkdata/comma_separated _3columns comma_separated_3columns /comma_separated_
3columns.txt
I get the below Exception.
2014-04-07 11:15:01,870 INFO [main] mapreduce.Job
(Job.java:printTaskEvents(1424)) - Task Id :
attempt_1396526639698_0028_m_000000_2, Status : FAILED
Error: java.io.IOException:
org.apache.hadoop.hbase.mapreduce.ImportTsv$TsvParser$BadTsvLineException: No
delimiter
at
org.apache.hadoop.hbase.mapreduce.TsvImporterTextMapper.map(TsvImporterTextMapper.java:135)
at
org.apache.hadoop.hbase.mapreduce.TsvImporterTextMapper.map(TsvImporterTextMapper.java:33)
at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:145)
at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:763)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:339)
at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:167)
Regards,
Ashish Singhi