[ 
https://issues.apache.org/jira/browse/HBASE-21285?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16696592#comment-16696592
 ] 

Lavinia-Stefania Sirbu commented on HBASE-21285:
------------------------------------------------

[[email protected]] In my first approach, I used the getters because my idea 
was that if someone wants to do a size based TableSnapshotInputFormat, they 
need access to all the members, so they can create new input split objects.

I have added a new patch (004) with a new approach where the additional 
splitting based on size is done inside the getSplits method. Can you take a 
look? Thank you!

> Enhanced TableSnapshotInputFormat to allow a size based splitting
> -----------------------------------------------------------------
>
>                 Key: HBASE-21285
>                 URL: https://issues.apache.org/jira/browse/HBASE-21285
>             Project: HBase
>          Issue Type: Improvement
>          Components: snapshots
>    Affects Versions: 1.4.0
>            Reporter: Lavinia-Stefania Sirbu
>            Assignee: Lavinia-Stefania Sirbu
>            Priority: Minor
>         Attachments: HBASE-21285.branch-1.4.001.patch, 
> HBASE-21285.branch-1.4.002.patch, HBASE-21285.branch-1.4.003.patch, 
> HBASE-21285.branch-1.4.004.patch, HBASE-21285.master.001.patch
>
>
> Currently, all the splits generated by a snapshot are having length 0. Right 
> now, we have a configuration for the number of splits per region, but it's a 
> general one and not very helpful when the sizes for regions are really 
> different. The modification must be done in TableSnapshotInputFormatImpl 
> where the length must be computed.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to