Hello,

Which is the fastest way to dump  the content of Hbase table to Hdfs?  is
it possible to use the hbase snapshot + Spark to do this?

now we have already use the hbase snapshot + mapreduce-v2(does not via the
Htable) to convert the HFiles to OrcFile, but we found the 'spilling map
output' occupied most of whole time.  so the spark can decrease the cost?

map task: read the hfile, and convert it to KeyValues

reduce task: merge the keyvalues of same rowkey

thanks.

-- 
*Ric Dong*

Reply via email to