kazuyukitanimura commented on a change in pull request #34611:
URL: https://github.com/apache/spark/pull/34611#discussion_r751886848



##########
File path: sql/core/benchmarks/DataSourceReadBenchmark-jdk11-results.txt
##########
@@ -1,252 +1,275 @@
+================================================================================================
+SQL Single Boolean Column Scan
+================================================================================================
+
+OpenJDK 64-Bit Server VM 11.0.13+8-LTS on Linux 5.11.0-1020-azure
+Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
+SQL Single BOOLEAN Column Scan:           Best Time(ms)   Avg Time(ms)   
Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
+------------------------------------------------------------------------------------------------------------------------
+SQL CSV                                            9748           9907         
225          1.6         619.7       1.0X
+SQL Json                                           8466           8468         
  3          1.9         538.2       1.2X
+SQL Parquet Vectorized                              124            149         
 21        127.2           7.9      78.8X
+SQL Parquet MR                                     2057           2071         
 20          7.6         130.8       4.7X
+SQL ORC Vectorized                                  183            232         
 40         86.1          11.6      53.3X
+SQL ORC MR                                         1517           1546         
 41         10.4          96.4       6.4X
+
+OpenJDK 64-Bit Server VM 11.0.13+8-LTS on Linux 5.11.0-1020-azure
+Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
+Parquet Reader Single BOOLEAN Column Scan:  Best Time(ms)   Avg Time(ms)   
Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
+-------------------------------------------------------------------------------------------------------------------------
+ParquetReader Vectorized                             100            107        
  13        157.1           6.4       1.0X
+ParquetReader Vectorized -> Row                       52             54        
   3        303.1           3.3       1.9X
+
+
 
================================================================================================
 SQL Single Numeric Column Scan
 
================================================================================================
 
-OpenJDK 64-Bit Server VM 11.0.10+9-LTS on Linux 5.4.0-1043-azure
-Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
+OpenJDK 64-Bit Server VM 11.0.13+8-LTS on Linux 5.11.0-1020-azure
+Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
 SQL Single TINYINT Column Scan:           Best Time(ms)   Avg Time(ms)   
Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
 
------------------------------------------------------------------------------------------------------------------------
-SQL CSV                                           13405          13422         
 24          1.2         852.3       1.0X
-SQL Json                                          10723          10788         
 92          1.5         681.7       1.3X
-SQL Parquet Vectorized                              164            217         
 50         95.9          10.4      81.8X
-SQL Parquet MR                                     2349           2440         
129          6.7         149.3       5.7X
-SQL ORC Vectorized                                  312            346         
 23         50.4          19.8      43.0X
-SQL ORC MR                                         1610           1659         
 69          9.8         102.4       8.3X
-
-OpenJDK 64-Bit Server VM 11.0.10+9-LTS on Linux 5.4.0-1043-azure
-Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
+SQL CSV                                           11664          11685         
 30          1.3         741.6       1.0X
+SQL Json                                           9144           9154         
 14          1.7         581.3       1.3X
+SQL Parquet Vectorized                              136            152         
 24        115.7           8.6      85.8X
+SQL Parquet MR                                     2157           2172         
 22          7.3         137.1       5.4X
+SQL ORC Vectorized                                  212            251         
 30         74.0          13.5      54.9X
+SQL ORC MR                                         1626           1628         
  3          9.7         103.4       7.2X
+
+OpenJDK 64-Bit Server VM 11.0.13+8-LTS on Linux 5.11.0-1020-azure
+Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
 Parquet Reader Single TINYINT Column Scan:  Best Time(ms)   Avg Time(ms)   
Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
 
-------------------------------------------------------------------------------------------------------------------------
-ParquetReader Vectorized                             187            209        
  20         84.3          11.9       1.0X
-ParquetReader Vectorized -> Row                       89             95        
   5        177.6           5.6       2.1X
+ParquetReader Vectorized                             183            192        
  10         85.8          11.7       1.0X
+ParquetReader Vectorized -> Row                       93             97        
   9        169.9           5.9       2.0X
 
-OpenJDK 64-Bit Server VM 11.0.10+9-LTS on Linux 5.4.0-1043-azure
-Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
+OpenJDK 64-Bit Server VM 11.0.13+8-LTS on Linux 5.11.0-1020-azure
+Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
 SQL Single SMALLINT Column Scan:          Best Time(ms)   Avg Time(ms)   
Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
 
------------------------------------------------------------------------------------------------------------------------
-SQL CSV                                           14214          14549         
474          1.1         903.7       1.0X
-SQL Json                                          11866          11934         
 95          1.3         754.4       1.2X
-SQL Parquet Vectorized                              294            342         
 53         53.6          18.7      48.4X
-SQL Parquet MR                                     2929           3004         
107          5.4         186.2       4.9X
-SQL ORC Vectorized                                  312            328         
 15         50.4          19.8      45.5X
-SQL ORC MR                                         2037           2097         
 84          7.7         129.5       7.0X
-
-OpenJDK 64-Bit Server VM 11.0.10+9-LTS on Linux 5.4.0-1043-azure
-Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
+SQL CSV                                           12278          12303         
 35          1.3         780.6       1.0X
+SQL Json                                           9534           9546         
 16          1.6         606.2       1.3X
+SQL Parquet Vectorized                              167            205         
 32         93.9          10.6      73.3X
+SQL Parquet MR                                     2543           2564         
 30          6.2         161.7       4.8X
+SQL ORC Vectorized                                  217            265         
 32         72.6          13.8      56.7X
+SQL ORC MR                                         1832           1861         
 41          8.6         116.4       6.7X
+
+OpenJDK 64-Bit Server VM 11.0.13+8-LTS on Linux 5.11.0-1020-azure
+Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
 Parquet Reader Single SMALLINT Column Scan:  Best Time(ms)   Avg Time(ms)   
Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
 
--------------------------------------------------------------------------------------------------------------------------
-ParquetReader Vectorized                              249            266       
   18         63.1          15.8       1.0X
-ParquetReader Vectorized -> Row                       192            247       
   36         82.1          12.2       1.3X
+ParquetReader Vectorized                              230            238       
    9         68.3          14.7       1.0X
+ParquetReader Vectorized -> Row                       238            276       
   16         66.1          15.1       1.0X

Review comment:
       I wouldn't worry about less than 10 (M/s) difference. The numbers easily 
swing that much between attempts




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to