Hello, I am trying to load data in HBase table using Map Reduce task. I have input from one HBase table which has some id's stored (around 100000).
My task reads data from that table and invokes one API giving that ID as input and fetching some documents and meta information (through API). I am using "IdentityTableReducer" class and number of reducer tasks has been set to 10. When I run a program I can only see one map task running. Do I need to configure number of map tasks also while running job? My input table has only one region. Does having multiple regions for input table increase number of map tasks. I think framework is intelligent enough to generate Map tasks based on data locality. Will region split help in this case? Thank You! Abhay
