kazuyukitanimura commented on a change in pull request #34611:
URL: https://github.com/apache/spark/pull/34611#discussion_r751886848
##########
File path: sql/core/benchmarks/DataSourceReadBenchmark-jdk11-results.txt
##########
@@ -1,252 +1,275 @@
+================================================================================================
+SQL Single Boolean Column Scan
+================================================================================================
+
+OpenJDK 64-Bit Server VM 11.0.13+8-LTS on Linux 5.11.0-1020-azure
+Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
+SQL Single BOOLEAN Column Scan: Best Time(ms) Avg Time(ms)
Stdev(ms) Rate(M/s) Per Row(ns) Relative
+------------------------------------------------------------------------------------------------------------------------
+SQL CSV 9748 9907
225 1.6 619.7 1.0X
+SQL Json 8466 8468
3 1.9 538.2 1.2X
+SQL Parquet Vectorized 124 149
21 127.2 7.9 78.8X
+SQL Parquet MR 2057 2071
20 7.6 130.8 4.7X
+SQL ORC Vectorized 183 232
40 86.1 11.6 53.3X
+SQL ORC MR 1517 1546
41 10.4 96.4 6.4X
+
+OpenJDK 64-Bit Server VM 11.0.13+8-LTS on Linux 5.11.0-1020-azure
+Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
+Parquet Reader Single BOOLEAN Column Scan: Best Time(ms) Avg Time(ms)
Stdev(ms) Rate(M/s) Per Row(ns) Relative
+-------------------------------------------------------------------------------------------------------------------------
+ParquetReader Vectorized 100 107
13 157.1 6.4 1.0X
+ParquetReader Vectorized -> Row 52 54
3 303.1 3.3 1.9X
+
+
================================================================================================
SQL Single Numeric Column Scan
================================================================================================
-OpenJDK 64-Bit Server VM 11.0.10+9-LTS on Linux 5.4.0-1043-azure
-Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
+OpenJDK 64-Bit Server VM 11.0.13+8-LTS on Linux 5.11.0-1020-azure
+Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
SQL Single TINYINT Column Scan: Best Time(ms) Avg Time(ms)
Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
-SQL CSV 13405 13422
24 1.2 852.3 1.0X
-SQL Json 10723 10788
92 1.5 681.7 1.3X
-SQL Parquet Vectorized 164 217
50 95.9 10.4 81.8X
-SQL Parquet MR 2349 2440
129 6.7 149.3 5.7X
-SQL ORC Vectorized 312 346
23 50.4 19.8 43.0X
-SQL ORC MR 1610 1659
69 9.8 102.4 8.3X
-
-OpenJDK 64-Bit Server VM 11.0.10+9-LTS on Linux 5.4.0-1043-azure
-Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
+SQL CSV 11664 11685
30 1.3 741.6 1.0X
+SQL Json 9144 9154
14 1.7 581.3 1.3X
+SQL Parquet Vectorized 136 152
24 115.7 8.6 85.8X
+SQL Parquet MR 2157 2172
22 7.3 137.1 5.4X
+SQL ORC Vectorized 212 251
30 74.0 13.5 54.9X
+SQL ORC MR 1626 1628
3 9.7 103.4 7.2X
+
+OpenJDK 64-Bit Server VM 11.0.13+8-LTS on Linux 5.11.0-1020-azure
+Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
Parquet Reader Single TINYINT Column Scan: Best Time(ms) Avg Time(ms)
Stdev(ms) Rate(M/s) Per Row(ns) Relative
-------------------------------------------------------------------------------------------------------------------------
-ParquetReader Vectorized 187 209
20 84.3 11.9 1.0X
-ParquetReader Vectorized -> Row 89 95
5 177.6 5.6 2.1X
+ParquetReader Vectorized 183 192
10 85.8 11.7 1.0X
+ParquetReader Vectorized -> Row 93 97
9 169.9 5.9 2.0X
-OpenJDK 64-Bit Server VM 11.0.10+9-LTS on Linux 5.4.0-1043-azure
-Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
+OpenJDK 64-Bit Server VM 11.0.13+8-LTS on Linux 5.11.0-1020-azure
+Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
SQL Single SMALLINT Column Scan: Best Time(ms) Avg Time(ms)
Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
-SQL CSV 14214 14549
474 1.1 903.7 1.0X
-SQL Json 11866 11934
95 1.3 754.4 1.2X
-SQL Parquet Vectorized 294 342
53 53.6 18.7 48.4X
-SQL Parquet MR 2929 3004
107 5.4 186.2 4.9X
-SQL ORC Vectorized 312 328
15 50.4 19.8 45.5X
-SQL ORC MR 2037 2097
84 7.7 129.5 7.0X
-
-OpenJDK 64-Bit Server VM 11.0.10+9-LTS on Linux 5.4.0-1043-azure
-Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
+SQL CSV 12278 12303
35 1.3 780.6 1.0X
+SQL Json 9534 9546
16 1.6 606.2 1.3X
+SQL Parquet Vectorized 167 205
32 93.9 10.6 73.3X
+SQL Parquet MR 2543 2564
30 6.2 161.7 4.8X
+SQL ORC Vectorized 217 265
32 72.6 13.8 56.7X
+SQL ORC MR 1832 1861
41 8.6 116.4 6.7X
+
+OpenJDK 64-Bit Server VM 11.0.13+8-LTS on Linux 5.11.0-1020-azure
+Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
Parquet Reader Single SMALLINT Column Scan: Best Time(ms) Avg Time(ms)
Stdev(ms) Rate(M/s) Per Row(ns) Relative
--------------------------------------------------------------------------------------------------------------------------
-ParquetReader Vectorized 249 266
18 63.1 15.8 1.0X
-ParquetReader Vectorized -> Row 192 247
36 82.1 12.2 1.3X
+ParquetReader Vectorized 230 238
9 68.3 14.7 1.0X
+ParquetReader Vectorized -> Row 238 276
16 66.1 15.1 1.0X
Review comment:
I wouldn't worry about less than 10 (M/s) difference. The numbers easily
swing that much between attempts
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]