dongjoon-hyun commented on a change in pull request #28981:
URL: https://github.com/apache/spark/pull/28981#discussion_r449248689
##########
File path: sql/core/benchmarks/JsonBenchmark-jdk11-results.txt
##########
@@ -7,106 +7,106 @@ OpenJDK 64-Bit Server VM
11.0.7+10-post-Ubuntu-2ubuntu218.04 on Linux 4.15.0-106
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
JSON schema inferring: Best Time(ms) Avg Time(ms)
Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
-No encoding 69219 69342
116 1.4 692.2 1.0X
-UTF-8 is set 143950 143986
55 0.7 1439.5 0.5X
+No encoding 73307 73400
141 1.4 733.1 1.0X
+UTF-8 is set 143834 143925
152 0.7 1438.3 0.5X
Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 11.0.7+10-post-Ubuntu-2ubuntu218.04 on Linux
4.15.0-1063-aws
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
count a short column: Best Time(ms) Avg Time(ms)
Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
-No encoding 57828 57913
136 1.7 578.3 1.0X
-UTF-8 is set 83649 83711
60 1.2 836.5 0.7X
+No encoding 50894 51065
292 2.0 508.9 1.0X
+UTF-8 is set 98462 99455
1173 1.0 984.6 0.5X
Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 11.0.7+10-post-Ubuntu-2ubuntu218.04 on Linux
4.15.0-1063-aws
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
count a wide column: Best Time(ms) Avg Time(ms)
Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
-No encoding 64560 65193
1023 0.2 6456.0 1.0X
-UTF-8 is set 102925 103174
216 0.1 10292.5 0.6X
+No encoding 64011 64969
1001 0.2 6401.1 1.0X
+UTF-8 is set 102757 102984
311 0.1 10275.7 0.6X
Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 11.0.7+10-post-Ubuntu-2ubuntu218.04 on Linux
4.15.0-1063-aws
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
select wide row: Best Time(ms) Avg Time(ms)
Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
-No encoding 131002 132316
1160 0.0 262003.1 1.0X
-UTF-8 is set 152128 152371
332 0.0 304256.5 0.9X
+No encoding 132559 133561
1010 0.0 265117.3 1.0X
+UTF-8 is set 151458 152129
611 0.0 302915.4 0.9X
Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 11.0.7+10-post-Ubuntu-2ubuntu218.04 on Linux
4.15.0-1063-aws
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
Select a subset of 10 columns: Best Time(ms) Avg Time(ms)
Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
-Select 10 columns 19376 19514
160 0.5 1937.6 1.0X
-Select 1 column 24089 24156
58 0.4 2408.9 0.8X
+Select 10 columns 21148 21202
87 0.5 2114.8 1.0X
+Select 1 column 24701 24724
21 0.4 2470.1 0.9X
Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 11.0.7+10-post-Ubuntu-2ubuntu218.04 on Linux
4.15.0-1063-aws
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
creation of JSON parser per line: Best Time(ms) Avg Time(ms)
Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
-Short column without encoding 8131 8219
103 1.2 813.1 1.0X
-Short column with UTF-8 13464 13508
44 0.7 1346.4 0.6X
-Wide column without encoding 108012 108598
914 0.1 10801.2 0.1X
-Wide column with UTF-8 150988 151369
412 0.1 15098.8 0.1X
+Short column without encoding 6945 6998
59 1.4 694.5 1.0X
+Short column with UTF-8 11510 11569
51 0.9 1151.0 0.6X
+Wide column without encoding 95004 95795
790 0.1 9500.4 0.1X
+Wide column with UTF-8 149223 149409
276 0.1 14922.3 0.0X
Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 11.0.7+10-post-Ubuntu-2ubuntu218.04 on Linux
4.15.0-1063-aws
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
JSON functions: Best Time(ms) Avg Time(ms)
Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
-Text read 753 765
18 13.3 75.3 1.0X
-from_json 23182 23446
230 0.4 2318.2 0.0X
-json_tuple 31129 31304
181 0.3 3112.9 0.0X
-get_json_object 22821 23073
225 0.4 2282.1 0.0X
+Text read 649 652
3 15.4 64.9 1.0X
+from_json 22284 22393
99 0.4 2228.4 0.0X
+json_tuple 32310 32824
484 0.3 3231.0 0.0X
+get_json_object 22111 22751
568 0.5 2211.1 0.0X
Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 11.0.7+10-post-Ubuntu-2ubuntu218.04 on Linux
4.15.0-1063-aws
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
Dataset of json strings: Best Time(ms) Avg Time(ms)
Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
-Text read 3078 3101
26 16.2 61.6 1.0X
-schema inferring 30225 30434
333 1.7 604.5 0.1X
-parsing 32237 32308
63 1.6 644.7 0.1X
+Text read 2894 2903
8 17.3 57.9 1.0X
+schema inferring 26724 26785
62 1.9 534.5 0.1X
+parsing 37502 37632
131 1.3 750.0 0.1X
Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 11.0.7+10-post-Ubuntu-2ubuntu218.04 on Linux
4.15.0-1063-aws
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
Json files in the per-line mode: Best Time(ms) Avg Time(ms)
Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
-Text read 10835 10900
86 4.6 216.7 1.0X
-Schema inferring 37720 37805
110 1.3 754.4 0.3X
-Parsing without charset 35464 35538
100 1.4 709.3 0.3X
-Parsing with UTF-8 67311 67738
381 0.7 1346.2 0.2X
+Text read 10994 11010
16 4.5 219.9 1.0X
+Schema inferring 45654 45677
37 1.1 913.1 0.2X
+Parsing without charset 34476 34559
73 1.5 689.5 0.3X
+Parsing with UTF-8 56987 57002
13 0.9 1139.7 0.2X
OpenJDK 64-Bit Server VM 11.0.7+10-post-Ubuntu-2ubuntu218.04 on Linux
4.15.0-1063-aws
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
Write dates and timestamps: Best Time(ms) Avg Time(ms)
Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
-Create a dataset of timestamps 2208 2222
14 4.5 220.8 1.0X
-to_json(timestamp) 14299 14570
285 0.7 1429.9 0.2X
-write timestamps to files 12955 12969
13 0.8 1295.5 0.2X
-Create a dataset of dates 2297 2323
30 4.4 229.7 1.0X
-to_json(date) 8509 8561
74 1.2 850.9 0.3X
-write dates to files 6786 6827
45 1.5 678.6 0.3X
+Create a dataset of timestamps 2150 2188
35 4.7 215.0 1.0X
+to_json(timestamp) 17874 18080
294 0.6 1787.4 0.1X
+write timestamps to files 12518 12538
34 0.8 1251.8 0.2X
+Create a dataset of dates 2298 2310
18 4.4 229.8 0.9X
+to_json(date) 11673 11703
27 0.9 1167.3 0.2X
+write dates to files 7121 7135
12 1.4 712.1 0.3X
OpenJDK 64-Bit Server VM 11.0.7+10-post-Ubuntu-2ubuntu218.04 on Linux
4.15.0-1063-aws
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
Read dates and timestamps: Best Time(ms) Avg Time(ms)
Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
-read timestamp text from files 2598 2613
18 3.8 259.8 1.0X
-read timestamps from files 42007 42028
19 0.2 4200.7 0.1X
-infer timestamps from files 18102 18120
28 0.6 1810.2 0.1X
-read date text from files 2355 2360
5 4.2 235.5 1.1X
-read date from files 17420 17458
33 0.6 1742.0 0.1X
-timestamp strings 3099 3101
3 3.2 309.9 0.8X
-parse timestamps from Dataset[String] 48188 48215
25 0.2 4818.8 0.1X
-infer timestamps from Dataset[String] 22929 22988
102 0.4 2292.9 0.1X
-date strings 4090 4103
11 2.4 409.0 0.6X
-parse dates from Dataset[String] 24952 25068
139 0.4 2495.2 0.1X
-from_json(timestamp) 66038 66352
413 0.2 6603.8 0.0X
-from_json(date) 43755 43782
27 0.2 4375.5 0.1X
+read timestamp text from files 2616 2641
34 3.8 261.6 1.0X
+read timestamps from files 37481 37517
58 0.3 3748.1 0.1X
+infer timestamps from files 84774 84964
201 0.1 8477.4 0.0X
+read date text from files 2362 2365
3 4.2 236.2 1.1X
+read date from files 16583 16612
29 0.6 1658.3 0.2X
+timestamp strings 3927 3963
40 2.5 392.7 0.7X
+parse timestamps from Dataset[String] 52827 53004
243 0.2 5282.7 0.0X
+infer timestamps from Dataset[String] 101108 101644
769 0.1 10110.8 0.0X
Review comment:
Thanks!
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]