[
https://issues.apache.org/jira/browse/HBASE-18090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16167984#comment-16167984
]
Mikhail Antonov commented on HBASE-18090:
-----------------------------------------
Looks good to me! The only thing that would be good to do is probably to add
some tests for the new methods in the RegionSplitter and the SplitAlgos.
That looks actually backward compatible and self-contained enough that it can
go to branch-1 (cc [~apurtell]). I'd consider it for 1.3.2 port - I think
risk-benefit ratio is good.
[~ghelmling] [~ashu210890] you may be interested too.
> Improve TableSnapshotInputFormat to allow more multiple mappers per region
> --------------------------------------------------------------------------
>
> Key: HBASE-18090
> URL: https://issues.apache.org/jira/browse/HBASE-18090
> Project: HBase
> Issue Type: Improvement
> Components: mapreduce
> Affects Versions: 1.4.0
> Reporter: Mikhail Antonov
> Assignee: xinxin fan
> Attachments: HBASE-18090-branch-1.3-v1.patch,
> HBASE-18090-branch-1.3-v2.patch, HBASE-18090-V3-master.patch
>
>
> TableSnapshotInputFormat runs one map task per region in the table snapshot.
> This places unnecessary restriction that the region layout of the original
> table needs to take the processing resources available to MR job into
> consideration. Allowing to run multiple mappers per region (assuming
> reasonably even key distribution) would be useful.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)