[ https://issues.apache.org/jira/browse/HBASE-15265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15184500#comment-15184500 ]
Duo Zhang commented on HBASE-15265: ----------------------------------- Run WALPE with 5 threads {noformat} ./bin/hbase org.apache.hadoop.hbase.wal.WALPerformanceEvaluation -threads 5 {noformat} master {noformat} -- Histograms ------------------------------------------------------------------ org.apache.hadoop.hbase.wal.WALPerformanceEvaluation.latencyHistogram.nanos count = 5000000 min = 452200 max = 1834676 mean = 723087.74 stddev = 125994.96 median = 703103.00 75% <= 745173.00 95% <= 920767.00 98% <= 1148129.00 99% <= 1288697.00 99.9% <= 1554315.00 org.apache.hadoop.hbase.wal.WALPerformanceEvaluation.syncCountHistogram.countPerSync count = 4571938 min = 1 max = 2 mean = 1.06 stddev = 0.23 median = 1.00 75% <= 1.00 95% <= 2.00 98% <= 2.00 99% <= 2.00 99.9% <= 2.00 org.apache.hadoop.hbase.wal.WALPerformanceEvaluation.syncHistogram.nanos-between-syncs count = 4571938 min = 415874 max = 1571407 mean = 693891.09 stddev = 107159.47 median = 673660.00 75% <= 717641.00 95% <= 896732.00 98% <= 992794.00 99% <= 1098338.00 99.9% <= 1518093.00 -- Meters ---------------------------------------------------------------------- org.apache.hadoop.hbase.wal.WALPerformanceEvaluation.appendMeter.bytes count = 2785000000 mean rate = 3354500.98 events/second 1-minute rate = 3613345.23 events/second 5-minute rate = 3254224.23 events/second 15-minute rate = 2049616.71 events/second org.apache.hadoop.hbase.wal.WALPerformanceEvaluation.syncMeter.syncs count = 4571938 mean rate = 5506.81 events/second 1-minute rate = 6090.84 events/second 5-minute rate = 5397.76 events/second 15-minute rate = 3376.47 events/second 2016-03-08 13:59:10,692 INFO [t4,r0] wal.WALPerformanceEvaluation: t4,r0 took 809.430s 1235.437ops/s 2016-03-08 13:59:10,730 INFO [t0,r0] wal.WALPerformanceEvaluation: t0,r0 took 809.469s 1235.378ops/s 2016-03-08 13:59:10,925 INFO [t3,r0] wal.WALPerformanceEvaluation: t3,r0 took 809.664s 1235.080ops/s 2016-03-08 13:59:11,164 INFO [t2,r0] wal.WALPerformanceEvaluation: t2,r0 took 809.903s 1234.716ops/s 2016-03-08 13:59:11,232 INFO [t1,r0] wal.WALPerformanceEvaluation: t1,r0 took 809.972s 1234.611ops/s 2016-03-08 13:59:11,233 INFO [main] wal.WALPerformanceEvaluation: Summary: threads=5, iterations=1000000, syncInterval=0 took 809.973s 6173.045ops/s {noformat} patched {noformat} -- Histograms ------------------------------------------------------------------ org.apache.hadoop.hbase.wal.WALPerformanceEvaluation.latencyHistogram.nanos count = 5000000 min = 67653 max = 1211514 mean = 355929.64 stddev = 100616.20 median = 338680.00 75% <= 386992.00 95% <= 466571.00 98% <= 678368.00 99% <= 767800.00 99.9% <= 1174408.00 org.apache.hadoop.hbase.wal.WALPerformanceEvaluation.syncCountHistogram.countPerSync count = 2982546 min = 0 max = 4 mean = 1.63 stddev = 0.74 median = 2.00 75% <= 2.00 95% <= 3.00 98% <= 4.00 99% <= 4.00 99.9% <= 4.00 org.apache.hadoop.hbase.wal.WALPerformanceEvaluation.syncHistogram.nanos-between-syncs count = 2982546 min = 180697 max = 57874230 mean = 375459.18 stddev = 1551439.38 median = 310104.00 75% <= 349804.00 95% <= 469386.00 98% <= 755095.00 99% <= 1064668.00 99.9% <= 1679322.00 -- Meters ---------------------------------------------------------------------- org.apache.hadoop.hbase.wal.WALPerformanceEvaluation.appendMeter.bytes count = 2785000000 mean rate = 6784702.13 events/second 1-minute rate = 6957392.69 events/second 5-minute rate = 5188375.04 events/second 15-minute rate = 2511006.30 events/second org.apache.hadoop.hbase.wal.WALPerformanceEvaluation.syncMeter.syncs count = 2982546 mean rate = 7265.86 events/second 1-minute rate = 7499.49 events/second 5-minute rate = 5574.20 events/second 15-minute rate = 2692.50 events/second 2016-03-08 14:06:59,454 INFO [t3,r0] wal.WALPerformanceEvaluation: t3,r0 took 386.169s 2589.540ops/s 2016-03-08 14:06:59,997 INFO [t2,r0] wal.WALPerformanceEvaluation: t2,r0 took 386.712s 2585.904ops/s 2016-03-08 14:07:00,037 INFO [t0,r0] wal.WALPerformanceEvaluation: t0,r0 took 386.752s 2585.636ops/s 2016-03-08 14:07:01,290 INFO [t4,r0] wal.WALPerformanceEvaluation: t4,r0 took 388.004s 2577.293ops/s 2016-03-08 14:07:01,513 INFO [t1,r0] wal.WALPerformanceEvaluation: t1,r0 took 388.227s 2575.813ops/s 2016-03-08 14:07:01,513 INFO [main] wal.WALPerformanceEvaluation: Summary: threads=5, iterations=1000000, syncInterval=0 took 388.228s 12879.030ops/s {noformat} Still 2x faster :) 12879.030ops/s vs. 6173.045ops/s And one more thing, in the 5 threads test, it prints a lot of slow sync logs. For master, the latency is usually 300~400ms, and for the patched version, it is usually 100~150ms. I do not know why there will be slow sync, maybe something related to the config of datanode. But at least, the fan out mechanism can reduce about 3x of the slow sync latency. > Implement an asynchronous FSHLog > -------------------------------- > > Key: HBASE-15265 > URL: https://issues.apache.org/jira/browse/HBASE-15265 > Project: HBase > Issue Type: Sub-task > Components: wal > Reporter: Duo Zhang > Assignee: Duo Zhang > Fix For: 2.0.0 > > Attachments: HBASE-15265-v1.patch, HBASE-15265-v2.patch, > HBASE-15265-v3.patch, HBASE-15265-v4.patch, HBASE-15265-v5.patch, > HBASE-15265-v6.patch, HBASE-15265.patch > > -- This message was sent by Atlassian JIRA (v6.3.4#6332)