[ 
https://issues.apache.org/jira/browse/HBASE-2922?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12900903#action_12900903
 ] 

HBase Review Board commented on HBASE-2922:
-------------------------------------------

Message from: "Jean-Daniel Cryans" <[email protected]>

-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
http://review.cloudera.org/r/694/
-----------------------------------------------------------

(Updated 2010-08-20 15:51:17.366257)


Review request for hbase.


Changes
-------

New patch, I decided to dig even more into this problem as I'm thinking that 
what happens in the updateLock should be as quick as possible. The difference 
with the previous patch is that the new HLog is created before we take the 
updateLock, as well as all the preparation surrounding that. Then I tested on a 
randomWrite 4 + 6 incrementing threads on a single RS that was instrumented to 
sysout the time it took to go through the sync block in rollWriter.

Without the patch:
Test duration: 406835ms
Total sync block duration: 56048ms
Percentage of the time updates were blocked: 13.7%
Average time in sync block: 610ms
Median time in sync block: 37ms

With the patch:
Test duration: 383503ms
Total sync block duration: 14072ms
Percentage of the time updates were blocked: 3.6%
Average time in sync block: 142ms
Median time in sync block: 3ms

Comments:
I was surprised by how closing a HLog could take a wildly varying time. In both 
cases I saw multi-seconds to close a writer (as you can tell by the averages 
compared to medians). In any case, my patch makes it much faster and less 
blocking. Your mileage may vary.


Summary
-------

Simply moves the LogActionListeners and outputfiles handling out of the 
updateLock synchronization.


This addresses bug HBASE-2922.
    http://issues.apache.org/jira/browse/HBASE-2922


Diffs (updated)
-----

  /trunk/src/main/java/org/apache/hadoop/hbase/regionserver/wal/HLog.java 
987355 

Diff: http://review.cloudera.org/r/694/diff


Testing
-------

Unit tests and some PEs.


Thanks,

Jean-Daniel




> HLog cleanup is done under the updateLock, major slowdown
> ---------------------------------------------------------
>
>                 Key: HBASE-2922
>                 URL: https://issues.apache.org/jira/browse/HBASE-2922
>             Project: HBase
>          Issue Type: Bug
>    Affects Versions: 0.20.6, 0.89.20100621
>            Reporter: Jean-Daniel Cryans
>
> Something I've seen quite often in our production environment:
> {quote}
> 2010-08-16 16:17:27,104 INFO org.apache.hadoop.hbase.regionserver.HLog: 
> removing old hlog file 
> /hbase/.logs/rs22,60020,1280909840873/hlog.dat.1282000385321 whose highest 
> sequence/edit id is 64837079950
> 2010-08-16 16:17:27,286 INFO org.apache.hadoop.hbase.regionserver.HLog: 
> removing old hlog file 
> /hbase/.logs/rs22,60020,1280909840873/hlog.dat.1282000392770 whose highest 
> sequence/edit id is 64837088260
> 2010-08-16 16:17:27,452 INFO org.apache.hadoop.hbase.regionserver.HLog: 
> removing old hlog file 
> /hbase/.logs/rs22,60020,1280909840873/hlog.dat.1282000399300 whose highest 
> sequence/edit id is 64837096566
> 2010-08-16 16:17:27,635 INFO org.apache.hadoop.hbase.regionserver.HLog: 
> removing old hlog file 
> /hbase/.logs/rs22,60020,1280909840873/hlog.dat.1282000406997 whose highest 
> sequence/edit id is 64837104865
> 2010-08-16 16:17:27,827 INFO org.apache.hadoop.hbase.regionserver.HLog: 
> removing old hlog file 
> /hbase/.logs/rs22,60020,1280909840873/hlog.dat.1282000413803 whose highest 
> sequence/edit id is 64837113153
> 2010-08-16 16:17:27,993 INFO org.apache.hadoop.hbase.regionserver.HLog: 
> removing old hlog file 
> /hbase/.logs/rs22,60020,1280909840873/hlog.dat.1282000421709 whose highest 
> sequence/edit id is 64837121467
> 2010-08-16 16:17:28,160 INFO org.apache.hadoop.hbase.regionserver.HLog: 
> removing old hlog file 
> /hbase/.logs/rs22,60020,1280909840873/hlog.dat.1282000427333 whose highest 
> sequence/edit id is 64837129775
> 2010-08-16 16:17:28,432 INFO org.apache.hadoop.hbase.regionserver.HLog: 
> removing old hlog file 
> /hbase/.logs/rs22,60020,1280909840873/hlog.dat.1282000434365 whose highest 
> sequence/edit id is 64837138074
> 2010-08-16 16:17:28,518 INFO org.apache.hadoop.hbase.regionserver.HLog: 
> removing old hlog file 
> /hbase/.logs/rs22,60020,1280909840873/hlog.dat.1282000440347 whose highest 
> sequence/edit id is 64837146376
> 2010-08-16 16:17:28,612 WARN org.apache.hadoop.hbase.regionserver.HLog: IPC 
> Server handler 39 on 60020 took 1801ms appending an edit to hlog; editcount=0
> 2010-08-16 16:17:28,615 WARN org.apache.hadoop.hbase.regionserver.HLog: IPC 
> Server handler 37 on 60020 took 1804ms appending an edit to hlog; editcount=1
> 2010-08-16 16:17:28,615 WARN org.apache.hadoop.hbase.regionserver.HLog: IPC 
> Server handler 25 on 60020 took 1805ms appending an edit to hlog; editcount=2
> ...
> 2010-08-16 16:17:28,619 WARN org.apache.hadoop.hbase.regionserver.HLog: IPC 
> Server handler 41 on 60020 took 1875ms appending an edit to hlog; editcount=50
> 2010-08-16 16:17:28,619 WARN org.apache.hadoop.hbase.regionserver.HLog: IPC 
> Server handler 24 on 60020 took 1876ms appending an edit to hlog; editcount=51
> 2010-08-16 16:17:28,619 WARN org.apache.hadoop.hbase.regionserver.HLog: IPC 
> Server handler 48 on 60020 took 1881ms appending an edit to hlog; editcount=54
> {quote}
> And looking at HLog.rollWriter, we roll then cleanup those unused hlog files 
> under updateLock, which blocks all the appenders (as shown). We should only 
> do the first part under that lock

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to