MaxGekk commented on a change in pull request #32044:
URL: https://github.com/apache/spark/pull/32044#discussion_r606672262
##########
File path: sql/core/benchmarks/WideSchemaBenchmark-results.txt
##########
@@ -2,144 +2,144 @@
parsing large select expressions
================================================================================================
-OpenJDK 64-Bit Server VM 1.8.0_232-8u232-b09-0ubuntu1~18.04.1-b09 on Linux
4.15.0-1044-aws
-Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
+OpenJDK 64-Bit Server VM 1.8.0_282-b08 on Linux 5.4.0-1043-azure
+Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
parsing large select: Best Time(ms) Avg Time(ms)
Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
-1 select expressions 5 13
8 0.0 5370143.0 1.0X
-100 select expressions 12 16
6 0.0 11995425.0 0.4X
-2500 select expressions 211 214
4 0.0 210927791.0 0.0X
+1 select expressions 1 2
0 0.0 1296117.0 1.0X
+100 select expressions 9 11
1 0.0 8808690.0 0.1X
+2500 select expressions 422 426
5 0.0 421632363.0 0.0X
Review comment:
regression by 2 times?
##########
File path: sql/core/benchmarks/CSVBenchmark-results.txt
##########
@@ -2,66 +2,66 @@
Benchmark to measure CSV read/write performance
================================================================================================
-Java HotSpot(TM) 64-Bit Server VM 1.8.0_202-b08 on Mac OS X 10.15.7
-Intel(R) Core(TM) i9-9980HK CPU @ 2.40GHz
+OpenJDK 64-Bit Server VM 1.8.0_282-b08 on Linux 5.4.0-1043-azure
+Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
Parsing quoted values: Best Time(ms) Avg Time(ms)
Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
-One quoted string 24185 24195
10 0.0 483694.2 1.0X
+One quoted string 43757 44446
765 0.0 875148.4 1.0X
-Java HotSpot(TM) 64-Bit Server VM 1.8.0_202-b08 on Mac OS X 10.15.7
-Intel(R) Core(TM) i9-9980HK CPU @ 2.40GHz
+OpenJDK 64-Bit Server VM 1.8.0_282-b08 on Linux 5.4.0-1043-azure
+Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
Wide rows with 1000 columns: Best Time(ms) Avg Time(ms)
Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
-Select 1000 columns 61793 62388
532 0.0 61793.4 1.0X
-Select 100 columns 21958 21993
34 0.0 21957.9 2.8X
-Select one column 18215 18515
505 0.1 18215.0 3.4X
-count() 5865 6168
296 0.2 5865.1 10.5X
-Select 100 columns, one bad input field 39638 39739
124 0.0 39637.5 1.6X
-Select 100 columns, corrupt record field 47290 48133
741 0.0 47290.0 1.3X
+Select 1000 columns 96330 99161
NaN 0.0 96329.7 1.0X
+Select 100 columns 41414 42672
1556 0.0 41414.1 2.3X
+Select one column 35365 36113
662 0.0 35365.4 2.7X
+count() 18845 18867
26 0.1 18845.0 5.1X
Review comment:
regression by 2 times
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]