kevinw66 commented on PR #58615:
URL: https://github.com/apache/doris/pull/58615#issuecomment-3600510777
Benchmark results on Huawei kC2 instance:
Original:
```
Running ./output/be/lib/benchmark_test
Run on (16 X 200 MHz CPU s)
CPU Caches:
L1 Data 64 KiB (x16)
L1 Instruction 64 KiB (x16)
L2 Unified 512 KiB (x16)
L3 Unified 32768 KiB (x1)
Load Average: 1.55, 8.69, 12.21
------------------------------------------------------------------------------------------------------------
Benchmark Time
CPU Iterations UserCounters...
------------------------------------------------------------------------------------------------------------
BM_Bits_CountZeroNum/16/repeats:5_mean 5.44 ns 5.44
ns 5 bytes_per_second=2.74127G/s
BM_Bits_CountZeroNum/16/repeats:5_median 5.44 ns 5.44
ns 5 bytes_per_second=2.7413G/s
BM_Bits_CountZeroNum/16/repeats:5_stddev 0.000 ns 0.000
ns 5 bytes_per_second=213.303k/s
BM_Bits_CountZeroNum/16/repeats:5_cv 0.01 % 0.01
% 5 bytes_per_second=0.01%
BM_Bits_CountZeroNum/32/repeats:5_mean 10.3 ns 10.3
ns 5 bytes_per_second=2.8914G/s
BM_Bits_CountZeroNum/32/repeats:5_median 10.3 ns 10.3
ns 5 bytes_per_second=2.89148G/s
BM_Bits_CountZeroNum/32/repeats:5_stddev 0.001 ns 0.001
ns 5 bytes_per_second=236.223k/s
BM_Bits_CountZeroNum/32/repeats:5_cv 0.01 % 0.01
% 5 bytes_per_second=0.01%
BM_Bits_CountZeroNum/64/repeats:5_mean 17.3 ns 17.3
ns 5 bytes_per_second=3.4399G/s
BM_Bits_CountZeroNum/64/repeats:5_median 17.3 ns 17.3
ns 5 bytes_per_second=3.44002G/s
BM_Bits_CountZeroNum/64/repeats:5_stddev 0.002 ns 0.002
ns 5 bytes_per_second=420.005k/s
BM_Bits_CountZeroNum/64/repeats:5_cv 0.01 % 0.01
% 5 bytes_per_second=0.01%
BM_Bits_CountZeroNum/256/repeats:5_mean 59.2 ns 59.2
ns 5 bytes_per_second=4.03016G/s
BM_Bits_CountZeroNum/256/repeats:5_median 59.2 ns 59.2
ns 5 bytes_per_second=4.03013G/s
BM_Bits_CountZeroNum/256/repeats:5_stddev 0.003 ns 0.003
ns 5 bytes_per_second=214.616k/s
BM_Bits_CountZeroNum/256/repeats:5_cv 0.01 % 0.01
% 5 bytes_per_second=0.01%
BM_Bits_CountZeroNum/1024/repeats:5_mean 227 ns 226
ns 5 bytes_per_second=4.2107G/s
BM_Bits_CountZeroNum/1024/repeats:5_median 227 ns 226
ns 5 bytes_per_second=4.21062G/s
BM_Bits_CountZeroNum/1024/repeats:5_stddev 0.014 ns 0.014
ns 5 bytes_per_second=264.002k/s
BM_Bits_CountZeroNum/1024/repeats:5_cv 0.01 % 0.01
% 5 bytes_per_second=0.01%
BM_Bits_CountZeroNumNullMap/16/repeats:5_mean 3.40 ns 3.40
ns 5 bytes_per_second=4.38851G/s
BM_Bits_CountZeroNumNullMap/16/repeats:5_median 3.39 ns 3.39
ns 5 bytes_per_second=4.39511G/s
BM_Bits_CountZeroNumNullMap/16/repeats:5_stddev 0.011 ns 0.011
ns 5 bytes_per_second=15.1472M/s
BM_Bits_CountZeroNumNullMap/16/repeats:5_cv 0.34 % 0.34
% 5 bytes_per_second=0.34%
BM_Bits_CountZeroNumNullMap/32/repeats:5_mean 7.29 ns 7.29
ns 5 bytes_per_second=4.09028G/s
BM_Bits_CountZeroNumNullMap/32/repeats:5_median 7.29 ns 7.29
ns 5 bytes_per_second=4.08916G/s
BM_Bits_CountZeroNumNullMap/32/repeats:5_stddev 0.005 ns 0.005
ns 5 bytes_per_second=2.76443M/s
BM_Bits_CountZeroNumNullMap/32/repeats:5_cv 0.07 % 0.07
% 5 bytes_per_second=0.07%
BM_Bits_CountZeroNumNullMap/64/repeats:5_mean 10.4 ns 10.4
ns 5 bytes_per_second=5.7434G/s
BM_Bits_CountZeroNumNullMap/64/repeats:5_median 10.4 ns 10.4
ns 5 bytes_per_second=5.74335G/s
BM_Bits_CountZeroNumNullMap/64/repeats:5_stddev 0.001 ns 0.001
ns 5 bytes_per_second=668.068k/s
BM_Bits_CountZeroNumNullMap/64/repeats:5_cv 0.01 % 0.01
% 5 bytes_per_second=0.01%
BM_Bits_CountZeroNumNullMap/256/repeats:5_mean 29.9 ns 29.9
ns 5 bytes_per_second=7.98586G/s
BM_Bits_CountZeroNumNullMap/256/repeats:5_median 29.9 ns 29.9
ns 5 bytes_per_second=7.98601G/s
BM_Bits_CountZeroNumNullMap/256/repeats:5_stddev 0.006 ns 0.006
ns 5 bytes_per_second=1.59699M/s
BM_Bits_CountZeroNumNullMap/256/repeats:5_cv 0.02 % 0.02
% 5 bytes_per_second=0.02%
BM_Bits_CountZeroNumNullMap/1024/repeats:5_mean 107 ns 107
ns 5 bytes_per_second=8.89231G/s
BM_Bits_CountZeroNumNullMap/1024/repeats:5_median 107 ns 107
ns 5 bytes_per_second=8.8917G/s
BM_Bits_CountZeroNumNullMap/1024/repeats:5_stddev 0.016 ns 0.016
ns 5 bytes_per_second=1.34061M/s
BM_Bits_CountZeroNumNullMap/1024/repeats:5_cv 0.01 % 0.01
% 5 bytes_per_second=0.01%
```
Neon:
```
Running ./output/be/lib/benchmark_test
Run on (16 X 200 MHz CPU s)
CPU Caches:
L1 Data 64 KiB (x16)
L1 Instruction 64 KiB (x16)
L2 Unified 512 KiB (x16)
L3 Unified 32768 KiB (x1)
Load Average: 0.08, 0.98, 3.99
------------------------------------------------------------------------------------------------------------
Benchmark Time
CPU Iterations UserCounters...
------------------------------------------------------------------------------------------------------------
BM_Bits_CountZeroNum/16/repeats:5_mean 5.44 ns 5.44
ns 5 bytes_per_second=2.74146G/s
BM_Bits_CountZeroNum/16/repeats:5_median 5.44 ns 5.44
ns 5 bytes_per_second=2.74143G/s
BM_Bits_CountZeroNum/16/repeats:5_stddev 0.001 ns 0.001
ns 5 bytes_per_second=337.724k/s
BM_Bits_CountZeroNum/16/repeats:5_cv 0.01 % 0.01
% 5 bytes_per_second=0.01%
BM_Bits_CountZeroNum/32/repeats:5_mean 10.3 ns 10.3
ns 5 bytes_per_second=2.89053G/s
BM_Bits_CountZeroNum/32/repeats:5_median 10.3 ns 10.3
ns 5 bytes_per_second=2.89058G/s
BM_Bits_CountZeroNum/32/repeats:5_stddev 0.000 ns 0.000
ns 5 bytes_per_second=107.791k/s
BM_Bits_CountZeroNum/32/repeats:5_cv 0.00 % 0.00
% 5 bytes_per_second=0.00%
BM_Bits_CountZeroNum/64/repeats:5_mean 1.47 ns 1.47
ns 5 bytes_per_second=40.5144G/s
BM_Bits_CountZeroNum/64/repeats:5_median 1.47 ns 1.47
ns 5 bytes_per_second=40.5145G/s
BM_Bits_CountZeroNum/64/repeats:5_stddev 0.000 ns 0.000
ns 5 bytes_per_second=3.31223M/s
BM_Bits_CountZeroNum/64/repeats:5_cv 0.01 % 0.01
% 5 bytes_per_second=0.01%
BM_Bits_CountZeroNum/256/repeats:5_mean 4.71 ns 4.71
ns 5 bytes_per_second=50.6679G/s
BM_Bits_CountZeroNum/256/repeats:5_median 4.65 ns 4.65
ns 5 bytes_per_second=51.2558G/s
BM_Bits_CountZeroNum/256/repeats:5_stddev 0.128 ns 0.128
ns 5 bytes_per_second=1.32569G/s
BM_Bits_CountZeroNum/256/repeats:5_cv 2.71 % 2.71
% 5 bytes_per_second=2.62%
BM_Bits_CountZeroNum/1024/repeats:5_mean 18.4 ns 18.4
ns 5 bytes_per_second=51.7001G/s
BM_Bits_CountZeroNum/1024/repeats:5_median 18.4 ns 18.4
ns 5 bytes_per_second=51.9428G/s
BM_Bits_CountZeroNum/1024/repeats:5_stddev 0.201 ns 0.201
ns 5 bytes_per_second=569.203M/s
BM_Bits_CountZeroNum/1024/repeats:5_cv 1.09 % 1.09
% 5 bytes_per_second=1.08%
BM_Bits_CountZeroNumNullMap/16/repeats:5_mean 3.39 ns 3.39
ns 5 bytes_per_second=4.3946G/s
BM_Bits_CountZeroNumNullMap/16/repeats:5_median 3.39 ns 3.39
ns 5 bytes_per_second=4.39535G/s
BM_Bits_CountZeroNumNullMap/16/repeats:5_stddev 0.002 ns 0.002
ns 5 bytes_per_second=2.13898M/s
BM_Bits_CountZeroNumNullMap/16/repeats:5_cv 0.05 % 0.05
% 5 bytes_per_second=0.05%
BM_Bits_CountZeroNumNullMap/32/repeats:5_mean 7.28 ns 7.28
ns 5 bytes_per_second=4.09179G/s
BM_Bits_CountZeroNumNullMap/32/repeats:5_median 7.29 ns 7.29
ns 5 bytes_per_second=4.0897G/s
BM_Bits_CountZeroNumNullMap/32/repeats:5_stddev 0.007 ns 0.007
ns 5 bytes_per_second=3.75246M/s
BM_Bits_CountZeroNumNullMap/32/repeats:5_cv 0.09 % 0.09
% 5 bytes_per_second=0.09%
BM_Bits_CountZeroNumNullMap/64/repeats:5_mean 2.65 ns 2.65
ns 5 bytes_per_second=22.46G/s
BM_Bits_CountZeroNumNullMap/64/repeats:5_median 2.65 ns 2.65
ns 5 bytes_per_second=22.4601G/s
BM_Bits_CountZeroNumNullMap/64/repeats:5_stddev 0.000 ns 0.000
ns 5 bytes_per_second=733.167k/s
BM_Bits_CountZeroNumNullMap/64/repeats:5_cv 0.00 % 0.00
% 5 bytes_per_second=0.00%
BM_Bits_CountZeroNumNullMap/256/repeats:5_mean 8.94 ns 8.94
ns 5 bytes_per_second=26.6831G/s
BM_Bits_CountZeroNumNullMap/256/repeats:5_median 8.89 ns 8.89
ns 5 bytes_per_second=26.8247G/s
BM_Bits_CountZeroNumNullMap/256/repeats:5_stddev 0.109 ns 0.109
ns 5 bytes_per_second=327.742M/s
BM_Bits_CountZeroNumNullMap/256/repeats:5_cv 1.22 % 1.22
% 5 bytes_per_second=1.20%
BM_Bits_CountZeroNumNullMap/1024/repeats:5_mean 34.1 ns 34.1
ns 5 bytes_per_second=27.9668G/s
BM_Bits_CountZeroNumNullMap/1024/repeats:5_median 33.9 ns 33.9
ns 5 bytes_per_second=28.1313G/s
BM_Bits_CountZeroNumNullMap/1024/repeats:5_stddev 0.462 ns 0.462
ns 5 bytes_per_second=380.851M/s
BM_Bits_CountZeroNumNullMap/1024/repeats:5_cv 1.35 % 1.35
% 5 bytes_per_second=1.33%
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]