cyb70289 commented on pull request #8671:
URL: https://github.com/apache/arrow/pull/8671#issuecomment-727702783


   Tested on `Xeon(R) Gold 5218 CPU @ 2.30GHz` (skylake).
   
   gcc-7.5 has big improvement.
   ```
                                  benchmark            baseline           
contender  change %
   7        GreaterArrayScalarInt64/32768/1  584.172m items/sec    1.455b 
items/sec   149.105  {'null_percent': 100.0}
   16       GreaterArrayScalarInt64/32768/0  585.510m items/sec    1.457b 
items/sec   148.920  {'null_percent': 0.0}
   5        GreaterArrayScalarInt64/32768/2  583.826m items/sec    1.451b 
items/sec   148.611  {'null_percent': 50.0}
   6       GreaterArrayScalarInt64/32768/10  583.846m items/sec    1.449b 
items/sec   148.178  {'null_percent': 10.0}
   2    GreaterArrayScalarInt64/32768/10000  584.545m items/sec    1.448b 
items/sec   147.702  {'null_percent': 0.01}
   15     GreaterArrayScalarInt64/32768/100  585.013m items/sec    1.446b 
items/sec   147.107  {'null_percent': 1.0}
   0         GreaterArrayArrayInt64/32768/0  396.250m items/sec  921.516m 
items/sec   132.559  {'null_percent': 0.0}
   8         GreaterArrayArrayInt64/32768/1  395.204m items/sec  917.636m 
items/sec   132.193  {'null_percent': 100.0}
   19      GreaterArrayArrayInt64/32768/100  394.544m items/sec  913.442m 
items/sec   131.518  {'null_percent': 1.0}
   20    GreaterArrayArrayInt64/32768/10000  392.995m items/sec  908.788m 
items/sec   131.247  {'null_percent': 0.01}
   14        GreaterArrayArrayInt64/32768/2  394.104m items/sec  909.476m 
items/sec   130.771  {'null_percent': 50.0}
   11       GreaterArrayArrayInt64/32768/10  394.966m items/sec  898.748m 
items/sec   127.551  {'null_percent': 10.0}
   10      GreaterArrayScalarString/32768/1  884.277m items/sec  956.126m 
items/sec     8.125  {'null_percent': 100.0}
   1        GreaterArrayArrayString/32768/1  305.843m items/sec  302.862m 
items/sec    -0.975  {'null_percent': 100.0}
   21      GreaterArrayScalarString/32768/2  278.355m items/sec  273.676m 
items/sec    -1.681  {'null_percent': 50.0}
   22       GreaterArrayArrayString/32768/2  140.984m items/sec  137.974m 
items/sec    -2.135  {'null_percent': 50.0}
   12     GreaterArrayArrayString/32768/100  103.022m items/sec   98.360m 
items/sec    -4.525  {'null_percent': 1.0}
   4       GreaterArrayArrayString/32768/10  104.032m items/sec   99.117m 
items/sec    -4.724  {'null_percent': 10.0}
   13       GreaterArrayArrayString/32768/0  103.551m items/sec   98.652m 
items/sec    -4.731  {'null_percent': 0.0}
   3    GreaterArrayArrayString/32768/10000  103.255m items/sec   97.749m 
items/sec    -5.332  {'null_percent': 0.01}
   23  GreaterArrayScalarString/32768/10000  964.903m items/sec  912.461m 
items/sec    -5.435  {'null_percent': 0.01}
   18    GreaterArrayScalarString/32768/100  924.999m items/sec  874.072m 
items/sec    -5.506  {'null_percent': 1.0}
   17      GreaterArrayScalarString/32768/0  973.155m items/sec  917.459m 
items/sec    -5.723  {'null_percent': 0.0}
   9      GreaterArrayScalarString/32768/10  701.792m items/sec  630.090m 
items/sec   -10.217  {'null_percent': 10.0}
   ```
   
   clang-9 no benefit.
   ```
                                  benchmark            baseline           
contender  change %
   3        GreaterArrayScalarInt64/32768/1    2.863b items/sec    2.891b 
items/sec     0.977  {'null_percent': 100.0}
   12      GreaterArrayScalarInt64/32768/10    2.845b items/sec    2.860b 
items/sec     0.548  {'null_percent': 10.0}
   9         GreaterArrayArrayInt64/32768/1    1.959b items/sec    1.967b 
items/sec     0.381  {'null_percent': 100.0}
   15       GreaterArrayArrayString/32768/0  118.703m items/sec  118.738m 
items/sec     0.030  {'null_percent': 0.0}
   20   GreaterArrayArrayString/32768/10000  118.455m items/sec  118.476m 
items/sec     0.018  {'null_percent': 0.01}
   8        GreaterArrayScalarInt64/32768/2    2.873b items/sec    2.870b 
items/sec    -0.089  {'null_percent': 50.0}
   14        GreaterArrayArrayInt64/32768/2    1.943b items/sec    1.941b 
items/sec    -0.130  {'null_percent': 50.0}
   6         GreaterArrayArrayInt64/32768/0    1.980b items/sec    1.977b 
items/sec    -0.145  {'null_percent': 0.0}
   7      GreaterArrayArrayString/32768/100  119.003m items/sec  118.708m 
items/sec    -0.248  {'null_percent': 1.0}
   17      GreaterArrayArrayInt64/32768/100    1.961b items/sec    1.947b 
items/sec    -0.683  {'null_percent': 1.0}
   16       GreaterArrayScalarInt64/32768/0    2.901b items/sec    2.881b 
items/sec    -0.701  {'null_percent': 0.0}
   5        GreaterArrayArrayInt64/32768/10    1.961b items/sec    1.945b 
items/sec    -0.834  {'null_percent': 10.0}
   21    GreaterArrayArrayInt64/32768/10000    1.967b items/sec    1.939b 
items/sec    -1.399  {'null_percent': 0.01}
   1      GreaterArrayScalarInt64/32768/100    2.854b items/sec    2.813b 
items/sec    -1.430  {'null_percent': 1.0}
   2    GreaterArrayScalarInt64/32768/10000    2.869b items/sec    2.806b 
items/sec    -2.201  {'null_percent': 0.01}
   23      GreaterArrayScalarString/32768/2  231.294m items/sec  223.778m 
items/sec    -3.250  {'null_percent': 50.0}
   22      GreaterArrayArrayString/32768/10  124.178m items/sec  119.522m 
items/sec    -3.750  {'null_percent': 10.0}
   4        GreaterArrayArrayString/32768/2  163.400m items/sec  139.325m 
items/sec   -14.734  {'null_percent': 50.0}
   0        GreaterArrayArrayString/32768/1  815.032m items/sec  688.068m 
items/sec   -15.578  {'null_percent': 100.0}
   13      GreaterArrayScalarString/32768/1    1.100b items/sec  858.483m 
items/sec   -21.922  {'null_percent': 100.0}
   11     GreaterArrayScalarString/32768/10  655.258m items/sec  490.804m 
items/sec   -25.098  {'null_percent': 10.0}
   18    GreaterArrayScalarString/32768/100  976.617m items/sec  626.209m 
items/sec   -35.880  {'null_percent': 1.0}
   10      GreaterArrayScalarString/32768/0    1.039b items/sec  650.698m 
items/sec   -37.376  {'null_percent': 0.0}
   19  GreaterArrayScalarString/32768/10000    1.033b items/sec  646.860m 
items/sec   -37.397  {'null_percent': 0.01}
   ```


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Reply via email to