dongjoon-hyun commented on a change in pull request #35102:
URL: https://github.com/apache/spark/pull/35102#discussion_r778572663
##########
File path: sql/hive/benchmarks/OrcReadBenchmark-jdk11-results.txt
##########
@@ -3,154 +3,220 @@ SQL Single Numeric Column Scan
================================================================================================
OpenJDK 64-Bit Server VM 11.0.13+8-LTS on Linux 5.11.0-1022-azure
-Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
+Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
SQL Single TINYINT Column Scan: Best Time(ms) Avg Time(ms)
Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
-Native ORC MR 953 1002
69 16.5 60.6 1.0X
-Native ORC Vectorized 164 228
55 95.7 10.5 5.8X
-Hive built-in ORC 1433 1464
44 11.0 91.1 0.7X
+Native ORC MR 928 976
51 16.9 59.0 1.0X
+Native ORC Vectorized 257 342
70 61.1 16.4 3.6X
+Hive built-in ORC 1201 1233
45 13.1 76.4 0.8X
OpenJDK 64-Bit Server VM 11.0.13+8-LTS on Linux 5.11.0-1022-azure
-Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
+Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
SQL Single SMALLINT Column Scan: Best Time(ms) Avg Time(ms)
Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
-Native ORC MR 1011 1016
7 15.6 64.3 1.0X
-Native ORC Vectorized 204 257
51 77.3 12.9 5.0X
-Hive built-in ORC 1500 1580
112 10.5 95.4 0.7X
+Native ORC MR 884 893
13 17.8 56.2 1.0X
+Native ORC Vectorized 222 305
73 70.8 14.1 4.0X
+Hive built-in ORC 1211 1270
83 13.0 77.0 0.7X
OpenJDK 64-Bit Server VM 11.0.13+8-LTS on Linux 5.11.0-1022-azure
-Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
+Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
SQL Single INT Column Scan: Best Time(ms) Avg Time(ms)
Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
-Native ORC MR 1139 1189
71 13.8 72.4 1.0X
-Native ORC Vectorized 209 289
61 75.1 13.3 5.4X
-Hive built-in ORC 1625 1704
113 9.7 103.3 0.7X
+Native ORC MR 923 964
44 17.0 58.7 1.0X
+Native ORC Vectorized 186 297
55 84.8 11.8 5.0X
+Hive built-in ORC 1347 1355
11 11.7 85.6 0.7X
OpenJDK 64-Bit Server VM 11.0.13+8-LTS on Linux 5.11.0-1022-azure
-Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
+Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
SQL Single BIGINT Column Scan: Best Time(ms) Avg Time(ms)
Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
-Native ORC MR 1224 1224
1 12.9 77.8 1.0X
-Native ORC Vectorized 388 415
20 40.5 24.7 3.2X
-Hive built-in ORC 1802 1834
45 8.7 114.6 0.7X
+Native ORC MR 1032 1096
91 15.2 65.6 1.0X
+Native ORC Vectorized 336 367
44 46.8 21.4 3.1X
+Hive built-in ORC 1381 1392
16 11.4 87.8 0.7X
OpenJDK 64-Bit Server VM 11.0.13+8-LTS on Linux 5.11.0-1022-azure
-Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
+Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
SQL Single FLOAT Column Scan: Best Time(ms) Avg Time(ms)
Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
-Native ORC MR 1196 1248
74 13.1 76.1 1.0X
-Native ORC Vectorized 279 357
78 56.4 17.7 4.3X
-Hive built-in ORC 1742 1764
31 9.0 110.8 0.7X
+Native ORC MR 1032 1045
19 15.2 65.6 1.0X
+Native ORC Vectorized 340 362
15 46.2 21.6 3.0X
+Hive built-in ORC 1447 1478
45 10.9 92.0 0.7X
OpenJDK 64-Bit Server VM 11.0.13+8-LTS on Linux 5.11.0-1022-azure
-Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
+Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
SQL Single DOUBLE Column Scan: Best Time(ms) Avg Time(ms)
Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
-Native ORC MR 1252 1254
2 12.6 79.6 1.0X
-Native ORC Vectorized 442 479
26 35.6 28.1 2.8X
-Hive built-in ORC 1768 1782
20 8.9 112.4 0.7X
+Native ORC MR 1055 1057
2 14.9 67.1 1.0X
+Native ORC Vectorized 353 378
19 44.6 22.4 3.0X
+Hive built-in ORC 1401 1414
18 11.2 89.1 0.8X
================================================================================================
Int and String Scan
================================================================================================
OpenJDK 64-Bit Server VM 11.0.13+8-LTS on Linux 5.11.0-1022-azure
-Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
+Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
Int and String Scan: Best Time(ms) Avg Time(ms)
Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
-Native ORC MR 2501 2561
85 4.2 238.5 1.0X
-Native ORC Vectorized 1350 1448
138 7.8 128.8 1.9X
-Hive built-in ORC 3032 3080
67 3.5 289.2 0.8X
+Native ORC MR 2009 2082
104 5.2 191.6 1.0X
+Native ORC Vectorized 1300 1337
52 8.1 124.0 1.5X
+Hive built-in ORC 2423 2437
20 4.3 231.0 0.8X
================================================================================================
Partitioned Table Scan
================================================================================================
OpenJDK 64-Bit Server VM 11.0.13+8-LTS on Linux 5.11.0-1022-azure
-Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
+Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
Partitioned Table: Best Time(ms) Avg Time(ms)
Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
-Data column - Native ORC MR 1969 2024
76 8.0 125.2 1.0X
-Data column - Native ORC Vectorized 448 520
57 35.1 28.5 4.4X
-Data column - Hive built-in ORC 2335 2407
102 6.7 148.4 0.8X
-Partition column - Native ORC MR 959 975
16 16.4 61.0 2.1X
-Partition column - Native ORC Vectorized 71 105
26 220.9 4.5 27.7X
-Partition column - Hive built-in ORC 1391 1415
33 11.3 88.5 1.4X
-Both columns - Native ORC MR 1657 1735
111 9.5 105.3 1.2X
-Both columns - Native ORC Vectorized 330 434
87 47.7 21.0 6.0X
-Both columns - Hive built-in ORC 1898 2079
257 8.3 120.7 1.0X
+Data column - Native ORC MR 1068 1156
124 14.7 67.9 1.0X
Review comment:
Although this is a known GitHub Action result limitation, this twice
faster result always misleads me. :)
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]