BulkLoading MR tasks

abhay ratnaparkhi Mon, 08 Aug 2011 04:14:44 -0700

Hello,

I am trying to load data in HBase table using Map Reduce task.
I have input from one HBase table which has some id's stored (around
100000).


My task reads data from that table and invokes one API giving that ID as
input and fetching some documents and meta information (through API).
I am using "IdentityTableReducer" class and number of reducer tasks has been
set to 10.

When I run a program I can only see one map task running. Do I need to
configure number of map tasks also while running job?
My input table has only one region. Does having multiple regions for input
table increase number of map tasks.
I think framework is intelligent enough to generate Map tasks based on data
locality. Will region split help in this case?

Thank You!
Abhay

BulkLoading MR tasks

Reply via email to