Hi, Bulk load tool uses TotalOrderPartitioner class to partition the map output into disjoint ranges of partitions, based on the start keys of regions in the table. Number of reducers created for bulk load job are equal to number of regions and each reducer gets data corresponding to a single region.
Regards, Jyothi -----Original Message----- From: divye sheth [mailto:[email protected]] Sent: 25 March 2014 16:08 To: [email protected] Subject: Bulk Loading with Presplits Hi, I am having a table with presplits, and am writing a utility to bulkLoad StoreFiles into this table using the doBulkLoad functionality. The question that comes to my mind is how does Hbase handle the distribution of the keys when performing a bulkLoad? How does it decide which key(row) goes to which partition? Please help me understand this. Hbase version 0.94.2 Thanks Divye Sheth
