pitrou commented on pull request #8466: URL: https://github.com/apache/arrow/pull/8466#issuecomment-713661111
Results on an AMD Zen 2 CPU: ``` VarianceKernelInt32/1048576/10000 140 us 140 us 5030 bytes_per_second=6.98658G/s null_percent=0.01 size=1048.58k VarianceKernelInt32/1048576/100 216 us 216 us 3267 bytes_per_second=4.5294G/s null_percent=1 size=1048.58k VarianceKernelInt32/1048576/10 397 us 397 us 1763 bytes_per_second=2.45765G/s null_percent=10 size=1048.58k VarianceKernelInt32/1048576/2 974 us 974 us 718 bytes_per_second=1026.87M/s null_percent=50 size=1048.58k VarianceKernelInt32/1048576/1 0.816 us 0.816 us 844145 bytes_per_second=1.1684T/s null_percent=100 size=1048.58k VarianceKernelInt32/1048576/0 130 us 130 us 5414 bytes_per_second=7.51569G/s null_percent=0 size=1048.58k VarianceKernelInt64/1048576/10000 135 us 135 us 5174 bytes_per_second=7.22877G/s null_percent=0.01 size=1048.58k VarianceKernelInt64/1048576/100 260 us 260 us 2682 bytes_per_second=3.7503G/s null_percent=1 size=1048.58k VarianceKernelInt64/1048576/10 440 us 440 us 1591 bytes_per_second=2.21931G/s null_percent=10 size=1048.58k VarianceKernelInt64/1048576/2 884 us 884 us 783 bytes_per_second=1.10507G/s null_percent=50 size=1048.58k VarianceKernelInt64/1048576/1 0.821 us 0.821 us 840316 bytes_per_second=1.16182T/s null_percent=100 size=1048.58k VarianceKernelInt64/1048576/0 123 us 123 us 5620 bytes_per_second=7.94262G/s null_percent=0 size=1048.58k VarianceKernelFloat/1048576/10000 366 us 366 us 1909 bytes_per_second=2.66576G/s null_percent=0.01 size=1048.58k VarianceKernelFloat/1048576/100 751 us 751 us 909 bytes_per_second=1.3003G/s null_percent=1 size=1048.58k VarianceKernelFloat/1048576/10 1097 us 1097 us 637 bytes_per_second=911.712M/s null_percent=10 size=1048.58k VarianceKernelFloat/1048576/2 1803 us 1802 us 387 bytes_per_second=554.854M/s null_percent=50 size=1048.58k VarianceKernelFloat/1048576/1 0.817 us 0.817 us 838993 bytes_per_second=1.1679T/s null_percent=100 size=1048.58k VarianceKernelFloat/1048576/0 346 us 346 us 2021 bytes_per_second=2.82409G/s null_percent=0 size=1048.58k VarianceKernelDouble/1048576/10000 184 us 184 us 3751 bytes_per_second=5.30153G/s null_percent=0.01 size=1048.58k VarianceKernelDouble/1048576/100 372 us 372 us 1869 bytes_per_second=2.62218G/s null_percent=1 size=1048.58k VarianceKernelDouble/1048576/10 549 us 549 us 1249 bytes_per_second=1.77993G/s null_percent=10 size=1048.58k VarianceKernelDouble/1048576/2 909 us 909 us 741 bytes_per_second=1099.92M/s null_percent=50 size=1048.58k VarianceKernelDouble/1048576/1 0.831 us 0.831 us 831173 bytes_per_second=1.14779T/s null_percent=100 size=1048.58k VarianceKernelDouble/1048576/0 174 us 174 us 4050 bytes_per_second=5.62431G/s null_percent=0 size=1048.58k ``` I'm curious why Int64 would be faster than Double. Aren't they using the same algorithm? (and Int64 goes through an additional int-to-float conversion for each value) ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
