[ 
https://issues.apache.org/jira/browse/HBASE-1861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12792031#action_12792031
 ] 

stack commented on HBASE-1861:
------------------------------

Not to my knowledge.   Thinking on it, this case is a little tougher than the 
single family case. 

1. In single family case, we just write single files and read the file metadata 
to create region (We extract from the file its start and end rows and use these 
conjuring the region description).  In the multiple family case, somehow you'll 
have to tie all files in a region together -- perhaps in metadata or with a 
file suffix or prefix.  I was thinking that you'd keep a running tab on the 
size of the file in each family and then as soon as any one file went over the 
region maximum file size limit, you'd rotate all files.
2. The loadtables.rb script would need to change to read across all files in a 
region to find the least first row and the maximum last row by looking at all 
file metadatas. 

If you want to discuss this issue more, put up some questions and I'll have a 
stab at them.  Thanks.

> Multi-Family support for bulk upload tools (HFileOutputFormat / loadtable.rb)
> -----------------------------------------------------------------------------
>
>                 Key: HBASE-1861
>                 URL: https://issues.apache.org/jira/browse/HBASE-1861
>             Project: Hadoop HBase
>          Issue Type: Improvement
>          Components: mapreduce
>    Affects Versions: 0.20.0
>            Reporter: Jonathan Gray
>             Fix For: 0.21.0
>
>
> Add multi-family support to bulk upload tools from HBASE-48.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to