[
https://issues.apache.org/jira/browse/HBASE-20056?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16380639#comment-16380639
]
Ted Yu edited comment on HBASE-20056 at 2/28/18 4:38 PM:
---------------------------------------------------------
Thanks for the patch, Yechao
Thanks for the review, Chiaping.
was (Author: [email protected]):
Thanks for the patch, Yehao
Thanks for the review, Chiaping.
> Performance optimization on MultiTableInputFormatBase#getSplits()
> ------------------------------------------------------------------
>
> Key: HBASE-20056
> URL: https://issues.apache.org/jira/browse/HBASE-20056
> Project: HBase
> Issue Type: Bug
> Components: hbase, mapreduce
> Affects Versions: 1.0.1, 1.3.1, 1.2.6
> Reporter: ShivaKumar SS
> Assignee: Yechao Chen
> Priority: Minor
> Labels: hbase, mapreduce, performance
> Fix For: 1.2.7
>
> Attachments: HBASE-20056.branch-1.2.patch
>
>
> Currently this method iterates the List of scan objects to get splits and for
> each iteration it opens the HConnection object and closes it, which is heavy.
> It can be optimized such that a single Hconnection can be used to compute all
> the splits of for all the scan objects for their splits computation.
> This optimization will help in reducing the launch time for MR Job.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)