cyb70289 commented on pull request #8671:
URL: https://github.com/apache/arrow/pull/8671#issuecomment-727702783
Tested on `Xeon(R) Gold 5218 CPU @ 2.30GHz` (skylake).
gcc-7.5 has big improvement.
```
benchmark baseline
contender change %
7 GreaterArrayScalarInt64/32768/1 584.172m items/sec 1.455b
items/sec 149.105 {'null_percent': 100.0}
16 GreaterArrayScalarInt64/32768/0 585.510m items/sec 1.457b
items/sec 148.920 {'null_percent': 0.0}
5 GreaterArrayScalarInt64/32768/2 583.826m items/sec 1.451b
items/sec 148.611 {'null_percent': 50.0}
6 GreaterArrayScalarInt64/32768/10 583.846m items/sec 1.449b
items/sec 148.178 {'null_percent': 10.0}
2 GreaterArrayScalarInt64/32768/10000 584.545m items/sec 1.448b
items/sec 147.702 {'null_percent': 0.01}
15 GreaterArrayScalarInt64/32768/100 585.013m items/sec 1.446b
items/sec 147.107 {'null_percent': 1.0}
0 GreaterArrayArrayInt64/32768/0 396.250m items/sec 921.516m
items/sec 132.559 {'null_percent': 0.0}
8 GreaterArrayArrayInt64/32768/1 395.204m items/sec 917.636m
items/sec 132.193 {'null_percent': 100.0}
19 GreaterArrayArrayInt64/32768/100 394.544m items/sec 913.442m
items/sec 131.518 {'null_percent': 1.0}
20 GreaterArrayArrayInt64/32768/10000 392.995m items/sec 908.788m
items/sec 131.247 {'null_percent': 0.01}
14 GreaterArrayArrayInt64/32768/2 394.104m items/sec 909.476m
items/sec 130.771 {'null_percent': 50.0}
11 GreaterArrayArrayInt64/32768/10 394.966m items/sec 898.748m
items/sec 127.551 {'null_percent': 10.0}
10 GreaterArrayScalarString/32768/1 884.277m items/sec 956.126m
items/sec 8.125 {'null_percent': 100.0}
1 GreaterArrayArrayString/32768/1 305.843m items/sec 302.862m
items/sec -0.975 {'null_percent': 100.0}
21 GreaterArrayScalarString/32768/2 278.355m items/sec 273.676m
items/sec -1.681 {'null_percent': 50.0}
22 GreaterArrayArrayString/32768/2 140.984m items/sec 137.974m
items/sec -2.135 {'null_percent': 50.0}
12 GreaterArrayArrayString/32768/100 103.022m items/sec 98.360m
items/sec -4.525 {'null_percent': 1.0}
4 GreaterArrayArrayString/32768/10 104.032m items/sec 99.117m
items/sec -4.724 {'null_percent': 10.0}
13 GreaterArrayArrayString/32768/0 103.551m items/sec 98.652m
items/sec -4.731 {'null_percent': 0.0}
3 GreaterArrayArrayString/32768/10000 103.255m items/sec 97.749m
items/sec -5.332 {'null_percent': 0.01}
23 GreaterArrayScalarString/32768/10000 964.903m items/sec 912.461m
items/sec -5.435 {'null_percent': 0.01}
18 GreaterArrayScalarString/32768/100 924.999m items/sec 874.072m
items/sec -5.506 {'null_percent': 1.0}
17 GreaterArrayScalarString/32768/0 973.155m items/sec 917.459m
items/sec -5.723 {'null_percent': 0.0}
9 GreaterArrayScalarString/32768/10 701.792m items/sec 630.090m
items/sec -10.217 {'null_percent': 10.0}
```
clang-9 no benefit.
```
benchmark baseline
contender change %
3 GreaterArrayScalarInt64/32768/1 2.863b items/sec 2.891b
items/sec 0.977 {'null_percent': 100.0}
12 GreaterArrayScalarInt64/32768/10 2.845b items/sec 2.860b
items/sec 0.548 {'null_percent': 10.0}
9 GreaterArrayArrayInt64/32768/1 1.959b items/sec 1.967b
items/sec 0.381 {'null_percent': 100.0}
15 GreaterArrayArrayString/32768/0 118.703m items/sec 118.738m
items/sec 0.030 {'null_percent': 0.0}
20 GreaterArrayArrayString/32768/10000 118.455m items/sec 118.476m
items/sec 0.018 {'null_percent': 0.01}
8 GreaterArrayScalarInt64/32768/2 2.873b items/sec 2.870b
items/sec -0.089 {'null_percent': 50.0}
14 GreaterArrayArrayInt64/32768/2 1.943b items/sec 1.941b
items/sec -0.130 {'null_percent': 50.0}
6 GreaterArrayArrayInt64/32768/0 1.980b items/sec 1.977b
items/sec -0.145 {'null_percent': 0.0}
7 GreaterArrayArrayString/32768/100 119.003m items/sec 118.708m
items/sec -0.248 {'null_percent': 1.0}
17 GreaterArrayArrayInt64/32768/100 1.961b items/sec 1.947b
items/sec -0.683 {'null_percent': 1.0}
16 GreaterArrayScalarInt64/32768/0 2.901b items/sec 2.881b
items/sec -0.701 {'null_percent': 0.0}
5 GreaterArrayArrayInt64/32768/10 1.961b items/sec 1.945b
items/sec -0.834 {'null_percent': 10.0}
21 GreaterArrayArrayInt64/32768/10000 1.967b items/sec 1.939b
items/sec -1.399 {'null_percent': 0.01}
1 GreaterArrayScalarInt64/32768/100 2.854b items/sec 2.813b
items/sec -1.430 {'null_percent': 1.0}
2 GreaterArrayScalarInt64/32768/10000 2.869b items/sec 2.806b
items/sec -2.201 {'null_percent': 0.01}
23 GreaterArrayScalarString/32768/2 231.294m items/sec 223.778m
items/sec -3.250 {'null_percent': 50.0}
22 GreaterArrayArrayString/32768/10 124.178m items/sec 119.522m
items/sec -3.750 {'null_percent': 10.0}
4 GreaterArrayArrayString/32768/2 163.400m items/sec 139.325m
items/sec -14.734 {'null_percent': 50.0}
0 GreaterArrayArrayString/32768/1 815.032m items/sec 688.068m
items/sec -15.578 {'null_percent': 100.0}
13 GreaterArrayScalarString/32768/1 1.100b items/sec 858.483m
items/sec -21.922 {'null_percent': 100.0}
11 GreaterArrayScalarString/32768/10 655.258m items/sec 490.804m
items/sec -25.098 {'null_percent': 10.0}
18 GreaterArrayScalarString/32768/100 976.617m items/sec 626.209m
items/sec -35.880 {'null_percent': 1.0}
10 GreaterArrayScalarString/32768/0 1.039b items/sec 650.698m
items/sec -37.376 {'null_percent': 0.0}
19 GreaterArrayScalarString/32768/10000 1.033b items/sec 646.860m
items/sec -37.397 {'null_percent': 0.01}
```
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]