[ 
https://issues.apache.org/jira/browse/HBASE-3329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12970328#action_12970328
 ] 

ryan rawson commented on HBASE-3329:
------------------------------------

When we split hlog we can just stream and do a straightforward split
into multiple output files.

To turn those hlogs into hfiles we will need to _sort_, and typically
this is done in memory.  There is a bunch of code about memory
accounting in the regionserver to make this not fail.

Wouldn't we just be centralizing and possibly bottlenecking the sort
of the hlog to 1 machine when previously it was done by all machines?


> HLog splitting after RS/cluster death should directly create HFiles
> -------------------------------------------------------------------
>
>                 Key: HBASE-3329
>                 URL: https://issues.apache.org/jira/browse/HBASE-3329
>             Project: HBase
>          Issue Type: Bug
>          Components: regionserver
>            Reporter: Karthik Ranganathan
>
> After a RS dies or the cluster goes down and we are recovering, we first 
> split HLogs into the logs for the regions. Then the region servers that host 
> the regions replay the logs and open the regions.
> This can be made more efficient by directly creating HFiles from the HLogs 
> (instead of producing a split HLogs file).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to