cyb70289 commented on pull request #8671: URL: https://github.com/apache/arrow/pull/8671#issuecomment-727702783
Tested on `Xeon(R) Gold 5218 CPU @ 2.30GHz` (skylake). gcc-7.5 has big improvement. ``` benchmark baseline contender change % 7 GreaterArrayScalarInt64/32768/1 584.172m items/sec 1.455b items/sec 149.105 {'null_percent': 100.0} 16 GreaterArrayScalarInt64/32768/0 585.510m items/sec 1.457b items/sec 148.920 {'null_percent': 0.0} 5 GreaterArrayScalarInt64/32768/2 583.826m items/sec 1.451b items/sec 148.611 {'null_percent': 50.0} 6 GreaterArrayScalarInt64/32768/10 583.846m items/sec 1.449b items/sec 148.178 {'null_percent': 10.0} 2 GreaterArrayScalarInt64/32768/10000 584.545m items/sec 1.448b items/sec 147.702 {'null_percent': 0.01} 15 GreaterArrayScalarInt64/32768/100 585.013m items/sec 1.446b items/sec 147.107 {'null_percent': 1.0} 0 GreaterArrayArrayInt64/32768/0 396.250m items/sec 921.516m items/sec 132.559 {'null_percent': 0.0} 8 GreaterArrayArrayInt64/32768/1 395.204m items/sec 917.636m items/sec 132.193 {'null_percent': 100.0} 19 GreaterArrayArrayInt64/32768/100 394.544m items/sec 913.442m items/sec 131.518 {'null_percent': 1.0} 20 GreaterArrayArrayInt64/32768/10000 392.995m items/sec 908.788m items/sec 131.247 {'null_percent': 0.01} 14 GreaterArrayArrayInt64/32768/2 394.104m items/sec 909.476m items/sec 130.771 {'null_percent': 50.0} 11 GreaterArrayArrayInt64/32768/10 394.966m items/sec 898.748m items/sec 127.551 {'null_percent': 10.0} 10 GreaterArrayScalarString/32768/1 884.277m items/sec 956.126m items/sec 8.125 {'null_percent': 100.0} 1 GreaterArrayArrayString/32768/1 305.843m items/sec 302.862m items/sec -0.975 {'null_percent': 100.0} 21 GreaterArrayScalarString/32768/2 278.355m items/sec 273.676m items/sec -1.681 {'null_percent': 50.0} 22 GreaterArrayArrayString/32768/2 140.984m items/sec 137.974m items/sec -2.135 {'null_percent': 50.0} 12 GreaterArrayArrayString/32768/100 103.022m items/sec 98.360m items/sec -4.525 {'null_percent': 1.0} 4 GreaterArrayArrayString/32768/10 104.032m items/sec 99.117m items/sec -4.724 {'null_percent': 10.0} 13 GreaterArrayArrayString/32768/0 103.551m items/sec 98.652m items/sec -4.731 {'null_percent': 0.0} 3 GreaterArrayArrayString/32768/10000 103.255m items/sec 97.749m items/sec -5.332 {'null_percent': 0.01} 23 GreaterArrayScalarString/32768/10000 964.903m items/sec 912.461m items/sec -5.435 {'null_percent': 0.01} 18 GreaterArrayScalarString/32768/100 924.999m items/sec 874.072m items/sec -5.506 {'null_percent': 1.0} 17 GreaterArrayScalarString/32768/0 973.155m items/sec 917.459m items/sec -5.723 {'null_percent': 0.0} 9 GreaterArrayScalarString/32768/10 701.792m items/sec 630.090m items/sec -10.217 {'null_percent': 10.0} ``` clang-9 no benefit. ``` benchmark baseline contender change % 3 GreaterArrayScalarInt64/32768/1 2.863b items/sec 2.891b items/sec 0.977 {'null_percent': 100.0} 12 GreaterArrayScalarInt64/32768/10 2.845b items/sec 2.860b items/sec 0.548 {'null_percent': 10.0} 9 GreaterArrayArrayInt64/32768/1 1.959b items/sec 1.967b items/sec 0.381 {'null_percent': 100.0} 15 GreaterArrayArrayString/32768/0 118.703m items/sec 118.738m items/sec 0.030 {'null_percent': 0.0} 20 GreaterArrayArrayString/32768/10000 118.455m items/sec 118.476m items/sec 0.018 {'null_percent': 0.01} 8 GreaterArrayScalarInt64/32768/2 2.873b items/sec 2.870b items/sec -0.089 {'null_percent': 50.0} 14 GreaterArrayArrayInt64/32768/2 1.943b items/sec 1.941b items/sec -0.130 {'null_percent': 50.0} 6 GreaterArrayArrayInt64/32768/0 1.980b items/sec 1.977b items/sec -0.145 {'null_percent': 0.0} 7 GreaterArrayArrayString/32768/100 119.003m items/sec 118.708m items/sec -0.248 {'null_percent': 1.0} 17 GreaterArrayArrayInt64/32768/100 1.961b items/sec 1.947b items/sec -0.683 {'null_percent': 1.0} 16 GreaterArrayScalarInt64/32768/0 2.901b items/sec 2.881b items/sec -0.701 {'null_percent': 0.0} 5 GreaterArrayArrayInt64/32768/10 1.961b items/sec 1.945b items/sec -0.834 {'null_percent': 10.0} 21 GreaterArrayArrayInt64/32768/10000 1.967b items/sec 1.939b items/sec -1.399 {'null_percent': 0.01} 1 GreaterArrayScalarInt64/32768/100 2.854b items/sec 2.813b items/sec -1.430 {'null_percent': 1.0} 2 GreaterArrayScalarInt64/32768/10000 2.869b items/sec 2.806b items/sec -2.201 {'null_percent': 0.01} 23 GreaterArrayScalarString/32768/2 231.294m items/sec 223.778m items/sec -3.250 {'null_percent': 50.0} 22 GreaterArrayArrayString/32768/10 124.178m items/sec 119.522m items/sec -3.750 {'null_percent': 10.0} 4 GreaterArrayArrayString/32768/2 163.400m items/sec 139.325m items/sec -14.734 {'null_percent': 50.0} 0 GreaterArrayArrayString/32768/1 815.032m items/sec 688.068m items/sec -15.578 {'null_percent': 100.0} 13 GreaterArrayScalarString/32768/1 1.100b items/sec 858.483m items/sec -21.922 {'null_percent': 100.0} 11 GreaterArrayScalarString/32768/10 655.258m items/sec 490.804m items/sec -25.098 {'null_percent': 10.0} 18 GreaterArrayScalarString/32768/100 976.617m items/sec 626.209m items/sec -35.880 {'null_percent': 1.0} 10 GreaterArrayScalarString/32768/0 1.039b items/sec 650.698m items/sec -37.376 {'null_percent': 0.0} 19 GreaterArrayScalarString/32768/10000 1.033b items/sec 646.860m items/sec -37.397 {'null_percent': 0.01} ``` ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org