(incubator-pegasus-website) branch master updated: Update benchmark docs (#50)

wangdan Thu, 21 Dec 2023 23:04:24 -0800

This is an automated email from the ASF dual-hosted git repository.

wangdan pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/incubator-pegasus-website.git



The following commit(s) were added to refs/heads/master by this push:
     new bbd28bc9 Update benchmark docs (#50)
bbd28bc9 is described below

commit bbd28bc9431ee13dffa88938d887aa447f430756
Author: Yingchun Lai <[email protected]>
AuthorDate: Fri Dec 22 15:03:18 2023 +0800

    Update benchmark docs (#50)
---
 _overview/en/benchmark.md | 291 +++++++++++++++++++++++++++++++++++++++++++++-
 _overview/zh/benchmark.md |  78 ++++++++-----
 2 files changed, 336 insertions(+), 33 deletions(-)

diff --git a/_overview/en/benchmark.md b/_overview/en/benchmark.md
index 2a04b7e7..cfd238e0 100755
--- a/_overview/en/benchmark.md
+++ b/_overview/en/benchmark.md
@@ -2,4 +2,293 @@
 permalink: /overview/benchmark/
 ---
 
-TRANSLATING
+## Benchmark tools and configurations
+
+* Testing by Pegasus Java client drive in 
[YCSB](https://github.com/xiaomi/pegasus-ycsb)
+* Set `requestdistribution=zipfian`, which means the discrete probability 
distribution whose rank-frequency distribution is an inverse power law 
relation, see [Zipfian 
distribution](https://en.wiktionary.org/wiki/Zipfian_distribution#English).
+
+### Benchmark result explanation
+
+- Case: The cases are read-only testing `Read`, write only testing `Write`, 
and read and write combined testing `Read & Write`
+- Threads: written in the form of `a * b`, where `a` represents the number of 
client instances (i.e., the number of instances YCSB runs on), and `b` 
represents the number of threads (i.e., the value of the `threadcount` options 
in YCSB)
+- RW Ratio: Read / write operation ratio, which is the ratio of 
`readpromotion` to `updatepromotion` or `insertpromotion` in YCSB options
+- Duration: Total testing time in hours
+- R-QPS: Read operations per second
+- R-AVG-Lat, R-P99-Lat, R-P999-Lat: average, P99, P999 latency of read 
operations, in microseconds
+- W-QPS: Write operations per second
+- W-AVG-Lat, W-P99-Lat, W-P999-Lat: average, P99, P999 latency of write 
operations, in microseconds
+
+## Benchmark on different versions
+
+### 2.4.0
+
+#### Test environment
+
+##### Hardware
+
+* CPU: Intel® Xeon® Silver 4210 * 2 2.20 GHz / 3.20 GHz
+* Memory: 128 GB
+* Disk: SSD 480 GB * 8
+* Network card: Bandwidth 10 Gb
+
+##### Cluster scale
+
+* The count of replica server nodes: 5
+* The partitions count of the test table: 64
+
+#### Benchmark results
+
+- Single data size: 1KB
+
+| Case         | threads | Read/Write | R-QPS   | R-AVG-Lat | R-P99-Lat | 
W-QPS  | W-AVG-Lat | W-P99-Lat |
+|--------------|---------|------------|---------|-----------|-----------|--------|-----------|-----------|
+| Write        | 3 * 15  | 0:1        | -       | -         | -         | 
56,953 | 787       | 1,786     |
+| Read         | 3 * 50  | 1:0        | 360,642 | 413       | 984       | -    
  | -         | -         |
+| Read & Write | 3 * 30  | 1:1        | 62,572  | 464       | 5,274     | 
62,561 | 985       | 3,764     |
+| Read & Write | 3 * 15  | 1:3        | 16,844  | 372       | 3,980     | 
50,527 | 762       | 1,551     |
+| Read & Write | 3 * 15  | 1:30       | 1,861   | 381       | 3,557     | 
55,816 | 790       | 1,688     |
+| Read & Write | 3 * 30  | 3:1        | 140,484 | 351       | 3,277     | 
46,822 | 856       | 2,044     |
+| Read & Write | 3 * 50  | 30:1       | 336,106 | 419       | 1,221     | 
11,203 | 763       | 1,276     |
+
+### 2.3.0
+
+#### Test environment
+
+Same as 2.4.0
+
+#### Benchmark results
+
+| Case         | threads | Read/Write | R-QPS   | R-AVG-Lat | R-P99-Lat | 
W-QPS  | W-AVG-Lat | W-P99-Lat |
+|--------------|---------|------------|---------|-----------|-----------|--------|-----------|-----------|
+| Write        | 3 * 15  | 0:1        | -       | -         | -         | 
42,386 | 1,060     | 6,628     |
+| Read         | 3 * 50  | 1:0        | 331,623 | 585       | 2,611     | -    
  | -         | -         |
+| Read & Write | 3 * 30  | 1:1        | 38,766  | 1,067     | 15,521    | 
38,774 | 1,246     | 7,791     |
+| Read & Write | 3 * 15  | 1:3        | 13,140  | 819       | 11,460    | 
39,428 | 863       | 4,884     |
+| Read & Write | 3 * 15  | 1:30       | 1,552   | 937       | 9,524     | 
46,570 | 930       | 5,315     |
+| Read & Write | 3 * 30  | 3:1        | 93,746  | 623       | 6,389     | 
31,246 | 996       | 5,543     |
+| Read & Write | 3 * 50  | 30:1       | 254,534 | 560       | 2,627     | 
8,481  | 901       | 3,269     |
+
+### 1.12.3
+
+#### Test environment
+
+##### Hardware
+
+* CPU: Intel® Xeon® CPU E5-2620 v3 @ 2.40 GHz
+* Memory: 128 GB
+* Disk: SSD 480 GB * 8
+* Network card: Bandwidth 10 Gb
+
+Other testing environments are the same as 2.4.0
+
+#### Benchmark results
+
+- Single data size: 320B
+
+| Case         | threads | Read/Write | R-QPS   | R-AVG-Lat | R-P99-Lat | 
W-QPS  | W-AVG-Lat | W-P99-Lat |
+|--------------|---------|------------|---------|-----------|-----------|--------|-----------|-----------|
+| Read & Write | 3 * 15  | 0:1        | -       | -         | -         | 
41,068 | 728       | 3,439     |
+| Read & Write | 3 * 15  | 1:3        | 16,011  | 242       | 686       | 
48,036 | 851       | 4,027     |
+| Read & Write | 3 * 30  | 30:1       | 279,818 | 295       | 873       | 
9,326  | 720       | 3,355     |
+
+- Single data size: 1KB
+
+| Case         | threads | Read/Write | R-QPS   | R-AVG-Lat | R-P99-Lat | 
W-QPS  | W-AVG-Lat | W-P99-Lat |
+|--------------|---------|------------|---------|-----------|-----------|--------|-----------|-----------|
+| Read & Write | 3 * 15  | 0:1        | -       | -         | -         | 
40,732 | 1,102     | 4,216     |
+| Read & Write | 3 * 15  | 1:3        | 14,355  | 476       | 2,855     | 
38,547 | 1,016     | 4,135     |
+| Read & Write | 3 * 20  | 3:1        | 87,480  | 368       | 4,170     | 
29,159 | 940       | 4,170     |
+| Read & Write | 3 * 50  | 1:0        | 312,244 | 479       | 1,178     | -    
  | -         | -         |
+
+### 1.12.2
+
+#### Test environment
+
+The testing environment is the same as 1.12.3
+
+#### Benchmark results
+
+- Single data size: 320B
+
+| Case         | threads | Read/Write | R-QPS   | R-AVG-Lat | R-P99-Lat | 
W-QPS  | W-AVG-Lat | W-P99-Lat |
+|--------------|---------|------------|---------|-----------|-----------|--------|-----------|-----------|
+| Read & Write | 3 * 15  | 0:1        | -       | -         | -         | 
40,439 | 739       | 2,995     |
+| Read & Write | 3 * 15  | 1:3        | 16,022  | 309       | 759       | 
48,078 | 830       | 3,995     |
+| Read & Write | 3 * 30  | 30:1       | 244,392 | 346       | 652       | 
8,137  | 731       | 2,995     |
+
+### 1.11.6
+
+#### Test environment
+
+- Test interfaces: `multi_get()` and `batch_set()`
+- A hashkey contains 3 sortkeys
+- Single `batch_set()` call will set 3 hashkeys
+- Single data size: 3KB
+- The partitions count of the test table: 128
+- rocksdb_block_cache_capacity = 40G
+- Other testing environments are the same as 1.12.3
+
+- #### Benchmark results
+
+| Case         | threads | Read/Write | duration | Max cache hit rate | R-QPS 
| R-AVG-Lat | R-P99-Lat | R-P999-Lat | W-QPS | W-AVG-Lat | W-P99-Lat | 
W-P999-Lat |
+|--------------|---------|------------|----------|--------------------|-------|-----------|-----------|------------|-------|-----------|-----------|------------|
+| Read & Write | 3 * 15  | 20:1       | 1        | 10%                | 150K  
| 263       | 808       | 12,615     | 8k    | 1,474     | 7,071     | 26,342   
  |
+| Read & Write | 3 * 7   | 20:1       | 2        | 17%                | 75K   
| 226       | 641       | 5,331      | 4K    | 1,017     | 4,583     | 14,983   
  |
+
+### 1.11.1
+
+#### Test environment
+
+The testing environment is the same as 1.12.3
+
+#### Benchmark results
+
+- Single data size: 20KB * 2 replicas
+
+| Case         | threads | Read/Write | duration | R-QPS   | R-AVG-Lat | 
R-P99-Lat | W-QPS  | W-AVG-Lat | W-P99-Lat |
+|--------------|---------|------------|----------|---------|-----------|-----------|--------|-----------|-----------|
+| Write        | 3 * 10  | 0:1        | 0.78     | -       | -         | -     
    | 14,181 | 2,110     | 15,468    |
+| Read & Write | 3 * 15  | 1:3        | 0.52     | 4,024   | 5,209     | 41247 
    | 12,069 | 1,780     | 14,495    |
+| Read & Write | 3 * 30  | 30:1       | 0.76     | 105,841 | 816       | 9613  
    | 3,527  | 1,107     | 4,155     |
+| Read         | 6 * 100 | 1:0        | 1.04     | 162,150 | 1,868     | 6733  
    | -      | -         | -         |
+
+- Single data size: 20KB * 3 replicas
+
+| Case         | threads | Read/Write | duration | R-QPS   | R-AVG-Lat | 
R-P99-Lat | W-QPS | W-AVG-Lat | W-P99-Lat |
+|--------------|---------|------------|----------|---------|-----------|-----------|-------|-----------|-----------|
+| Write        | 3 * 10  | 0:1        | 1.16     | -       | -         | -     
    | 9,603 | 3,115     | 20,468    |
+| Read & Write | 3 * 15  | 1:3        | 0.69     | 3,043   | 5,657     | 
38,783    | 9,126 | 3,140     | 27,956    |
+| Read & Write | 3 * 30  | 30:1       | 0.89     | 90,135  | 937       | 
13,324    | 3,002 | 1,185     | 4,816     |
+| Read         | 6 * 100 | 1:0        | 1.08     | 154,869 | 1,945     | 7,757 
    | -     | -         | -         |
+
+- Single data size: 10KB * 2 replicas
+
+| Case         | threads | Read/Write | duration | R-QPS   | R-AVG-Lat | 
R-P99-Lat | W-QPS  | W-AVG-Lat | W-P99-Lat |
+|--------------|---------|------------|----------|---------|-----------|-----------|--------|-----------|-----------|
+| Write        | 3 * 10  | 0:1        | 0.78     | -       | -         | -     
    | 14,181 | 2,110     | 15,468    |
+| Read & Write | 3 * 15  | 1:3        | 0.52     | 4,024   | 5,209     | 41247 
    | 12,069 | 1,780     | 14,495    |
+| Read & Write | 3 * 30  | 30:1       | 0.76     | 105,841 | 816       | 9613  
    | 3,527  | 1,107     | 4,155     |
+| Read         | 6 * 100 | 1:0        | 1.04     | 162,150 | 1,868     | 6733  
    | -      | -         | -         |
+
+- Single data size: 10KB * 3 replicas
+
+| Case         | threads | Read/Write | duration | R-QPS   | R-AVG-Lat | 
R-P99-Lat | W-QPS | W-AVG-Lat | W-P99-Lat |
+|--------------|---------|------------|----------|---------|-----------|-----------|-------|-----------|-----------|
+| Write        | 3 * 10  | 0:1        | 1.16     | -       | -         | -     
    | 9,603 | 3,115     | 20,468    |
+| Read & Write | 3 * 15  | 1:3        | 0.69     | 3,043   | 5,657     | 
38,783    | 9,126 | 3,140     | 27,956    |
+| Read & Write | 3 * 30  | 30:1       | 0.89     | 90,135  | 937       | 
13,324    | 3,002 | 1,185     | 4,816     |
+| Read         | 6 * 100 | 1:0        | 1.08     | 154,869 | 1,945     | 7,757 
    | -     | -         | -         |
+
+### 1.11.0
+
+#### Test environment
+
+The testing environment is the same as 1.12.3
+
+#### Benchmark results
+
+- Single data size: 320B
+
+| Case         | threads  | Read/Write | duration | R-QPS     | R-AVG-Lat | 
R-P99-Lat | W-QPS  | W-AVG-Lat | W-P99-Lat |
+|--------------|----------|------------|----------|-----------|-----------|-----------|--------|-----------|-----------|
+| Write        | 3 * 10   | 0:1        | 1.89     | -         | -         | -  
       | 44,039 | 679       | 3,346     |
+| Read & Write | 3 * 15   | 1:3        | 1.24     | 16,690    | 311       | 
892       | 50,076 | 791       | 4,396     |
+| Read & Write | 3 * 30   | 30:1       | 1.04     | 311,633   | 264       | 
511       | 10,388 | 619       | 2,468     |
+| Read         | 6 * 100  | 1:0        | 0.17     | 978,884   | 623       | 
1,671     | -      | -         | -         |
+| Read         | 12 * 100 | 1:0        | 0.28     | 1,194,394 | 1,003     | 
2,933     | -      | -         | -         |
+
+## Benchmark in different scenarios
+
+Unless otherwise specified, the testing environment is as follows:
+* The testing environment is the same as 1.12.3
+* Single data size: 1KB
+* Client:
+  * Number of nodes: 3
+  * version: 1.11.10-thrift-0.11.0-inlined-release
+* Server:
+  * Number of nodes: 5
+  * Version: 1.12.3
+  * The partitions count of the test table: 64
+  * 配置: `rocksdb_limiter_max_write_megabytes_per_sec = 500`, 
`rocksdb_limiter_enable_auto_tune = false`
+
+### Cluster throughput capability
+
+This test aims to compare the latency changes in the same cluster under 
different client request throughput.
+
+> NOTE: RocksDB rate-limiter is not enabled
+
+![5-node-write](/assets/images/benchmark/5-node-write.png)
+
+![5-node-read](/assets/images/benchmark/5-node-read.png)
+
+From the above figure, it can be seen that the maximum writing throughput is 
about 43K, and the maximum reading throughput is about 370K.
+You can estimate the corresponding latency based on reading/writing throughput.
+
+### Whether to enable RocksDB rate-limiter
+
+> The testing scenario: set `threads` as 3 * 20, with IPS of approximately 44K
+
+Pegasus uses RocksDB as the storage engine, when write data accumulated, it 
will trigger more compaction operations, occupy more disk IO, and cause more 
long tails.
+This test demonstrates that after enabling the RocksDB rate-limiter, the 
compaction load can be reduced, significantly reducing the occurrence of long 
tails.
+
+The following figures show the IO usage and P99 write latency for three 
scenarios:
+- `rocksdb_limiter_max_write_megabytes_per_sec = 0`
+- `rocksdb_limiter_max_write_megabytes_per_sec = 500` and 
`rocksdb_limiter_enable_auto_tune = false`
+- `rocksdb_limiter_max_write_megabytes_per_sec = 500` and 
`rocksdb_limiter_enable_auto_tune = true`
+
+- Disk IO usage:
+![io-no-limit](/assets/images/benchmark/io-no-limit.png)
+![io-limit-500MB](/assets/images/benchmark/io-limit-500MB.png)
+![io-limit-500MB-auto](/assets/images/benchmark/io-limit-500MB-auto.png)
+
+- P99 latency:
+![no-limit-set](/assets/images/benchmark/no-limit-set.png)
+![500-limit-set](/assets/images/benchmark/500-limit-set.png)
+![500-limit-auto-set](/assets/images/benchmark/500-limit-auto-set.png)
+
+It can be observed that when RocksDB rate-limiter enabled, the disk IO 
utilization rate has been reduced, and write latency has also been greatly 
alleviated.
+
+![limit](/assets/images/benchmark/limit.png)
+We obtained the results from YCSB benchmark:
+
+* When rate-limiter enabled, the throughput increased by about 5%
+* When both rate-limiter and auto-tune enabled, the throughput increased by 
about 20%
+* When rate-limiter enabled, only the latency in extreme situations 
(P999/P9999) has a significant improvement, but the improvement is not 
significant for most requests
+
+However, it should be noted that:
+
+The auto-tune function may trigger [write 
stall](https://github.com/facebook/rocksdb/wiki/Write-Stalls) issues in 
scenarios which a single piece of data is larger, please evaluate whether to 
enable auto-tune reasonably in your environment.
+
+### Cluster scale
+
+This test aims to observe the impact of different number of replica servers on 
read and write throughput.
+
+> The testing cases are read-only and write-only
+
+![node-write](/assets/images/benchmark/node-qps-write.png)
+
+![node-read](/assets/images/benchmark/node-qps-read.png)
+
+As can be seen from the figure:
+* Write throughput improves more than read throughput when the cluster scales 
out
+* The throughput is not improved linearly when the cluster scales out
+
+You can reasonably estimate the throughput of a cluster in different scales 
based on this test.
+
+### Different partition count of table
+
+This test aims to observe the impact of different partition count of a table 
on performance.
+
+> The testing scenario is:
+> Only read: `threads` configured as 3 * 50
+> Only write: `threads` configured as 3 * 40
+
+![partition](/assets/images/benchmark/partition.png)
+
+As can be seen from the figure:
+* Increasing partition count can improve read throughput
+* But it also reduces write throughput
+
+Therefore, please evaluate the partition count according to your application 
requirements reasonably.
+
+In addition, if the partition count is too small, it may cause issues such as 
large single partition and data skew between disks. In production environment, 
it is recommended to keep the size of a single partition less than 10GB if 
there are no special requirements.
diff --git a/_overview/zh/benchmark.md b/_overview/zh/benchmark.md
index 22c9a1b5..868d5660 100755
--- a/_overview/zh/benchmark.md
+++ b/_overview/zh/benchmark.md
@@ -5,7 +5,7 @@ permalink: /overview/benchmark/
 ## 测试工具及配置
 
 * 使用[YCSB](https://github.com/xiaomi/pegasus-ycsb)中的Pegasus Java Client进行测试
-* 读写请求的数据分布特征：zipfian，可以理解为遵守80/20原则的数据分布，即80%的访问都集中在20%的内容上
+* 
设置`requestdistribution=zipfian`，可以理解为遵守80/20原则的数据分布，即80%的访问都集中在20%的内容上。参考[Zipfian
 distribution](https://en.wiktionary.org/wiki/Zipfian_distribution#English).
 
 ### 测试结果说明
 - Case：分为只读测试`Read`、只写测试`Write`、读写混合测试`Read & Write`
@@ -13,9 +13,9 @@ permalink: /overview/benchmark/
 - RW 
Ratio：读写操作比，即YCSB配置中的`readproportion`与`updateproportion`或`insertproportion`的比值
 - duration：测试总时长，单位小时
 - R-QPS：每秒读操作数
-- R-AVG-Lat,R-P99-Lat,R-P999-Lat：读操作的平均，P99，P999延迟，单位微秒
+- R-AVG-Lat, R-P99-Lat, R-P999-Lat：读操作的平均，P99，P999延迟，单位微秒
 - W-QPS：每秒写操作数
-- W-AVG-Lat,W-P99-Lat,W-P999-Lat：写操作的平均，P99，P999延迟，单位微秒
+- W-AVG-Lat, W-P99-Lat, W-P999-Lat：写操作的平均，P99，P999延迟，单位微秒
 
 ## 各版本的性能测试
 
@@ -28,7 +28,7 @@ permalink: /overview/benchmark/
 * CPU：Intel® Xeon® Silver 4210 * 2 2.20 GHz / 3.20 GHz
 * 内存：128 GB
 * 磁盘：SSD 480 GB * 8
-* 网卡：带宽 10Gb
+* 网卡：带宽 10 Gb
 
 ##### 集群规模
 
@@ -119,9 +119,9 @@ permalink: /overview/benchmark/
 
 #### 测试环境
 
-- 测试接口：multi_get() 和 batch_set()
+- 测试接口：`multi_get()` 和 `batch_set()`
 - 一个hashkey下包含3条sortkey数据
-- 单次batch_set()调用设置3个hashkey
+- 单次`batch_set()`调用设置3个hashkey
 - 单条数据大小：3KB
 - 测试表的Partition数：128
 - rocksdb_block_cache_capacity = 40G
@@ -142,7 +142,7 @@ permalink: /overview/benchmark/
 
 #### 测试结果
 
-- 单条数据大小：20KB * 2备份
+- 单条数据大小：20KB * 2副本
 
 | Case         | threads | Read/Write | duration | R-QPS  | R-AVG-Lat | 
R-P99-Lat | W-QPS | W-AVG-Lat | W-P99-Lat |
 
|--------------|---------|------------|----------|--------|-----------|-----------|-------|-----------|-----------|
@@ -151,7 +151,7 @@ permalink: /overview/benchmark/
 | Read & Write | 3 * 30  | 30:1       | 1.25     | 64,358 | 1,330     | 13,975 
   | 2,145 | 1,699     | 6,467     |
 | Read         | 6 * 100 | 1:0        | 0.91     | 30,491 | 3,274     | 12,167 
   | -     | -         | -         |
 
-- 单条数据大小：20KB * 3备份
+- 单条数据大小：20KB * 3副本
 
 | Case         | threads | Read/Write | duration | R-QPS  | R-AVG-Lat | 
R-P99-Lat | W-QPS | W-AVG-Lat | W-P99-Lat |
 
|--------------|---------|------------|----------|--------|-----------|-----------|-------|-----------|-----------|
@@ -161,7 +161,7 @@ permalink: /overview/benchmark/
 | Read         | 6 * 100 | 1:0        | 0.91     | 25,456 | 3,923     | 15,679 
   | -     | -         | -         |
 
 
-- 单条数据大小：10KB * 2备份
+- 单条数据大小：10KB * 2副本
 
 | Case         | threads | Read/Write | duration | R-QPS   | R-AVG-Lat | 
R-P99-Lat | W-QPS  | W-AVG-Lat | W-P99-Lat |
 
|--------------|---------|------------|----------|---------|-----------|-----------|--------|-----------|-----------|
@@ -170,7 +170,7 @@ permalink: /overview/benchmark/
 | Read & Write | 3 * 30  | 30:1       | 0.76     | 105,841 | 816       | 9613  
    | 3,527  | 1,107     | 4,155     |
 | Read         | 6 * 100 | 1:0        | 1.04     | 162,150 | 1,868     | 6733  
    | -      | -         | -         |
 
-- 单条数据大小：10KB * 3备份
+- 单条数据大小：10KB * 3副本
 
 | Case         | threads | Read/Write | duration | R-QPS   | R-AVG-Lat | 
R-P99-Lat | W-QPS | W-AVG-Lat | W-P99-Lat |
 
|--------------|---------|------------|----------|---------|-----------|-----------|-------|-----------|-----------|
@@ -200,16 +200,22 @@ permalink: /overview/benchmark/
 
 ## 不同场景下的性能测试
 
-如无特殊说明：
+如无特殊说明，测试环境如下：
 
 * 测试环境同 1.12.3
 * 单条数据大小：1KB
-* 客户端：节点数：3，版本号：1.11.10-thrift-0.11.0-inlined-release
-* 服务端：节点数：5，版本号：1.12.3，表分片数：64，开启rocksdb限速：max_rate=500MB, auto_tune=true
+* 客户端：
+  * 节点数：3
+  * 版本号：1.11.10-thrift-0.11.0-inlined-release
+* 服务端：
+  * 节点数：5
+  * 版本号：1.12.3
+  * 表分片数：64
+  * 配置：`rocksdb_limiter_max_write_megabytes_per_sec = 500`, 
`rocksdb_limiter_enable_auto_tune = false`
 
-### 不同的客户端线程数
+### 集群吞吐能力
 
-该项测试旨在对比在默认配置下，不同线程（仅读和仅写）对QPS和延迟的影响。
+该项测试旨在对比在同一集群在不同client请求吞吐下的延迟变化。
 
 > 注意：未开启RocksDB限速
 
@@ -217,15 +223,18 @@ permalink: /overview/benchmark/
 
 ![5-node-read](/assets/images/benchmark/5-node-read.png)
 
-由上图可以看到，在该测试场景下，写最大QPS≈43K，读最大QPS≈370K，你也可以根据QPS的大小合理估算对应的延迟。
+由上图可知，写最大QPS大约为43K，读最大QPS大约370K，你可以根据吞吐估算对应的延迟。
 
 ### 是否开启RocksDB限速
 
-> 测试场景为：测试`threads`配置为：3 * 20，QPS大约为44K
+> 测试场景为：测试`threads`配置为：3 * 20，IPS大约为44K
 
 
Pegasus底层采用RocksDB做存储引擎，当数据写入增多，会触发更多的compaction操作，占用更多的磁盘IO，出现更多的毛刺现象。该项测试展示了开启RocksDB的限速后，可以降低compaction负载，从而显著的降低毛刺现象。
 
-下图分别展示了无限速、500MB/s限速、500MB/s限速同时开启auto-tune功能，三种场景的IO使用率和写P99延迟情况：
+下图分别展示了三种场景的IO使用率和写P99延迟情况：
+- `rocksdb_limiter_max_write_megabytes_per_sec = 0`
+- `rocksdb_limiter_max_write_megabytes_per_sec = 500` and 
`rocksdb_limiter_enable_auto_tune = false`
+- `rocksdb_limiter_max_write_megabytes_per_sec = 500` and 
`rocksdb_limiter_enable_auto_tune = true`
 
 - 磁盘IO占用：
 ![io-no-limit](/assets/images/benchmark/io-no-limit.png)
@@ -237,22 +246,22 @@ Pegasus底层采用RocksDB做存储引擎，当数据写入增多，会触发更
 ![500-limit-set](/assets/images/benchmark/500-limit-set.png)
 ![500-limit-auto-set](/assets/images/benchmark/500-limit-auto-set.png)
 
-可以发现，磁盘IO使用率得到降低，相应的写延迟的毛刺现象也被大大缓解。
+可以发现，开启RocksDB限速后，磁盘IO使用率得到降低，写延迟的毛刺现象也被大大缓解。
 
-我们从YCSB的测试结果：
 ![limit](/assets/images/benchmark/limit.png)
-也可以看到：
+我们从YCSB的测试结果也可以看到：
 
-* 开启限速，开启限速并开启auto-tune后，QPS吞吐分别约提升了5%，20%
-* 开启限速后，仅对极端情况下的延迟(P999/P9999)有显著改善作用，对于大部分请求来说，改善并不明显
+* 开启限速后，吞吐提升了约5%
+* 开启限速并开启auto-tune后，吞吐提升了约20%
+* 开启限速后，仅对极端情况下的延迟（P999/P9999）有显著改善作用，但对于大部分请求来说，改善并不明显
 
 但是**需要注意**的是：
 
-auto-tune功能在单条数据较大的场景下可能会引发[write 
stall](https://github.com/facebook/rocksdb/wiki/Write-Stalls)，请合理评估是否开启auto-tune。
+auto-tune功能在单条数据较大的场景下可能会引发[write 
stall](https://github.com/facebook/rocksdb/wiki/Write-Stalls)，请合理评估是否在你的环境中开启auto-tune。
 
-### 不同的replica server数量
+### 集群规模
 
-该项测试旨在观察，不同机器数量对读写性能的影响。
+该项测试旨在观察，不同replica server数量对读写吞吐的影响。
 
 > 测试场景为：测试`Case`为只读和只写
 
@@ -262,14 +271,14 @@ auto-tune功能在单条数据较大的场景下可能会引发[write stall](htt
 
 由图中可以看到：
 
-* 扩容对写性能的提升要优于读性能的提升
-* 扩容带来的性能提升并不是线性增加的
+* 扩容对写吞吐的提升要优于读吞吐的提升
+* 扩容带来的吞吐提升并不是线性的
 
-你可以根据该项测试合理估计不同机器下所能承载的请求负载
+你可以根据该项测试估计不同集群规模所能承载的吞吐量
 
 ### 不同的表分片数
 
-该项测试旨在观察，表的不同分片对性能的影响。
+该项测试旨在观察，表的不同分片对吞吐的影响。
 
 > 测试场景为：
 > 仅读：`threads`配置为：3 * 50
@@ -277,5 +286,10 @@ auto-tune功能在单条数据较大的场景下可能会引发[write stall](htt
 
 ![partition](/assets/images/benchmark/partition.png)
 
-由图中可以看到，增加分片可以提高读性能，但是降低了写性能，所以请合理评估你的业务需求。
-除此之外，若分片数过小，可能会导致单分片过大，磁盘分布不均的问题，在实际的线上业务中，如无特别需求，建议单分片维持在10GB以内。
+由图中可以看到：
+* 增加分片可以提高读吞吐
+* 但是降低了写吞吐
+
+所以请根据你的业务需求评估表分片数。
+
+除此之外，若分片数过小，可能会导致单分片过大，磁盘分布倾斜等问题。在生产环境中，如无特别需求，建议单分片大小保持在10GB以内。


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

(incubator-pegasus-website) branch master updated: Update benchmark docs (#50)

Reply via email to