[
https://issues.apache.org/jira/browse/HBASE-3329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12970328#action_12970328
]
ryan rawson commented on HBASE-3329:
------------------------------------
When we split hlog we can just stream and do a straightforward split
into multiple output files.
To turn those hlogs into hfiles we will need to _sort_, and typically
this is done in memory. There is a bunch of code about memory
accounting in the regionserver to make this not fail.
Wouldn't we just be centralizing and possibly bottlenecking the sort
of the hlog to 1 machine when previously it was done by all machines?
> HLog splitting after RS/cluster death should directly create HFiles
> -------------------------------------------------------------------
>
> Key: HBASE-3329
> URL: https://issues.apache.org/jira/browse/HBASE-3329
> Project: HBase
> Issue Type: Bug
> Components: regionserver
> Reporter: Karthik Ranganathan
>
> After a RS dies or the cluster goes down and we are recovering, we first
> split HLogs into the logs for the regions. Then the region servers that host
> the regions replay the logs and open the regions.
> This can be made more efficient by directly creating HFiles from the HLogs
> (instead of producing a split HLogs file).
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.