[
https://issues.apache.org/jira/browse/HBASE-10017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Dimiduk updated HBASE-10017:
---------------------------------
Attachment: TestHRegionServerBulkLoad-more-splits.txt
TEST-org.apache.hadoop.hbase.mapreduce.IntegrationTestBulkLoad.xml.gz
Here's a change to IntegrationTestBulkLoad that makes use of the
HRegionPartitioner. The test fails in the 1st or 2nd run with this change. Logs
included.
> HRegionPartitioner, rows directed to last partition are wrongly mapped.
> -----------------------------------------------------------------------
>
> Key: HBASE-10017
> URL: https://issues.apache.org/jira/browse/HBASE-10017
> Project: HBase
> Issue Type: Bug
> Components: mapreduce
> Affects Versions: 0.94.6
> Reporter: Roman Nikitchenko
> Priority: Critical
> Attachments: HBASE-10017-r1544633.patch, HBASE-10017-r1544633.patch,
> TEST-org.apache.hadoop.hbase.mapreduce.IntegrationTestBulkLoad.xml.gz,
> TestHRegionServerBulkLoad-more-splits.txt,
> TestHRegionServerBulkLoad-more-splits.txt, patchSiteOutput.txt
>
>
> Inside HRegionPartitioner class there is getPartition() method which should
> map first numPartitions regions to appropriate partitions 1:1. But based on
> condition last region is hashed which could lead to last reducer not having
> any data. This is considered serious issue.
> I reproduced this only starting from 16 regions per table. Original defect
> was found in 0.94.6 but at least today's trunk and 0.91 branch head have the
> same HRegionPartitioner code in this part which means the same issue.
--
This message was sent by Atlassian JIRA
(v6.1#6144)