[
https://issues.apache.org/jira/browse/HBASE-21285?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16696592#comment-16696592
]
Lavinia-Stefania Sirbu commented on HBASE-21285:
------------------------------------------------
[[email protected]] In my first approach, I used the getters because my idea
was that if someone wants to do a size based TableSnapshotInputFormat, they
need access to all the members, so they can create new input split objects.
I have added a new patch (004) with a new approach where the additional
splitting based on size is done inside the getSplits method. Can you take a
look? Thank you!
> Enhanced TableSnapshotInputFormat to allow a size based splitting
> -----------------------------------------------------------------
>
> Key: HBASE-21285
> URL: https://issues.apache.org/jira/browse/HBASE-21285
> Project: HBase
> Issue Type: Improvement
> Components: snapshots
> Affects Versions: 1.4.0
> Reporter: Lavinia-Stefania Sirbu
> Assignee: Lavinia-Stefania Sirbu
> Priority: Minor
> Attachments: HBASE-21285.branch-1.4.001.patch,
> HBASE-21285.branch-1.4.002.patch, HBASE-21285.branch-1.4.003.patch,
> HBASE-21285.branch-1.4.004.patch, HBASE-21285.master.001.patch
>
>
> Currently, all the splits generated by a snapshot are having length 0. Right
> now, we have a configuration for the number of splits per region, but it's a
> general one and not very helpful when the sizes for regions are really
> different. The modification must be done in TableSnapshotInputFormatImpl
> where the length must be computed.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)