[ 
https://issues.apache.org/jira/browse/HBASE-7006?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13632172#comment-13632172
 ] 

Jeffrey Zhong commented on HBASE-7006:
--------------------------------------

{quote}
 it sounds we trade disk io to network io
{quote}
No, we cut both disk io and network ios relating to recovered.edits files 
creations & deletions.

Currently we replay the wal to the destination region server while in old way 
the destination RS reads recovered edits from underlying hdfs. In terms of 
network io, they're same because the old way still need read recovered edits 
file across wire. The difference is that in distributed replay wal edits are 
pushed to the destination RS while the old way is pulling edits from recovered 
edits(which are intermediate files). 

In summary, the IOs related to recovered.edits files are all gone without any 
extra IOs. I think this question is common and I'll include this in the write 
up.

{quote}
Suppose a region server failed again in the middle, does a split worker need to 
split the WAL again? This means a WAL may be read/split multiple times
{quote}
We handle sequential RS failures like a new RS failure and replay its WALs left 
behind.  We may read a WAL multiple times in sequential failures but not replay 
multiple times if edits are flushed.  

{quote}
In the attached performance testing, do we have a breakdown on how many time it 
spends on reading the log file, writing to the recovered edits file? How did 
you measure the log splitting time?
{quote}
I don't have the break down since reading and writing happen at the same time. 
In normal cases, writing finish several secs after reading is done. We have 
metrics in splitlogmanager which measures the total splitting time and that's 
what I used in the testing. 

The latest combined patch is attached in 7837.

                
> [MTTR] Study distributed log splitting to see how we can make it faster
> -----------------------------------------------------------------------
>
>                 Key: HBASE-7006
>                 URL: https://issues.apache.org/jira/browse/HBASE-7006
>             Project: HBase
>          Issue Type: Bug
>          Components: MTTR
>            Reporter: stack
>            Assignee: Jeffrey Zhong
>            Priority: Critical
>             Fix For: 0.95.1
>
>         Attachments: LogSplitting Comparison.pdf, 
> ProposaltoimprovelogsplittingprocessregardingtoHBASE-7006.pdf
>
>
> Just saw interesting issue where a cluster went down  hard and 30 nodes had 
> 1700 WALs to replay.  Replay took almost an hour.  It looks like it could run 
> faster that much of the time is spent zk'ing and nn'ing.
> Putting in 0.96 so it gets a look at least.  Can always punt.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to