panbingkun commented on code in PR #40506: URL: https://github.com/apache/spark/pull/40506#discussion_r1206118575
########## sql/core/benchmarks/JsonBenchmark-jdk11-results.txt: ########## @@ -4,120 +4,121 @@ Benchmark for performance of JSON parsing Preparing data for benchmarking ... OpenJDK 64-Bit Server VM 11.0.19+7 on Linux 5.15.0-1037-azure -Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz +Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz JSON schema inferring: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative ------------------------------------------------------------------------------------------------------------------------ -No encoding 3150 3166 27 1.6 630.1 1.0X -UTF-8 is set 4572 4585 12 1.1 914.4 0.7X +No encoding 3493 3689 209 1.4 698.6 1.0X +UTF-8 is set 4954 4984 50 1.0 990.7 0.7X Preparing data for benchmarking ... OpenJDK 64-Bit Server VM 11.0.19+7 on Linux 5.15.0-1037-azure -Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz +Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz count a short column: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative ------------------------------------------------------------------------------------------------------------------------ -No encoding 2422 2475 50 2.1 484.4 1.0X -UTF-8 is set 3786 3796 14 1.3 757.2 0.6X +No encoding 2723 2771 54 1.8 544.6 1.0X +UTF-8 is set 4092 4166 99 1.2 818.4 0.7X Preparing data for benchmarking ... OpenJDK 64-Bit Server VM 11.0.19+7 on Linux 5.15.0-1037-azure -Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz +Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz count a wide column: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative ------------------------------------------------------------------------------------------------------------------------ -No encoding 5104 5170 87 0.2 5104.0 1.0X -UTF-8 is set 9229 9246 15 0.1 9228.7 0.6X +No encoding 5025 5208 175 0.2 5024.7 1.0X +UTF-8 is set 9642 9678 34 0.1 9641.7 0.5X Preparing data for benchmarking ... OpenJDK 64-Bit Server VM 11.0.19+7 on Linux 5.15.0-1037-azure -Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz +Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz select wide row: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative ------------------------------------------------------------------------------------------------------------------------ -No encoding 13977 14153 277 0.0 279538.0 1.0X -UTF-8 is set 16231 16284 70 0.0 324628.3 0.9X +No encoding 16465 22840 1784 0.0 329303.4 1.0X +UTF-8 is set 21291 21761 785 0.0 425817.2 0.8X Preparing data for benchmarking ... OpenJDK 64-Bit Server VM 11.0.19+7 on Linux 5.15.0-1037-azure -Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz +Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz Select a subset of 10 columns: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative ------------------------------------------------------------------------------------------------------------------------ -Select 10 columns 2197 2232 42 0.5 2196.7 1.0X -Select 1 column 1560 1567 9 0.6 1560.2 1.4X +Select 10 columns 2749 2881 115 0.4 2749.2 1.0X +Select 1 column 1951 2014 83 0.5 1950.9 1.4X Preparing data for benchmarking ... OpenJDK 64-Bit Server VM 11.0.19+7 on Linux 5.15.0-1037-azure -Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz +Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz creation of JSON parser per line: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative ------------------------------------------------------------------------------------------------------------------------ -Short column without encoding 688 709 18 1.5 688.3 1.0X -Short column with UTF-8 939 963 21 1.1 939.4 0.7X -Wide column without encoding 8049 8102 66 0.1 8048.7 0.1X -Wide column with UTF-8 14346 14368 28 0.1 14345.7 0.0X +Short column without encoding 773 796 21 1.3 773.0 1.0X +Short column with UTF-8 1096 1133 32 0.9 1096.1 0.7X +Wide column without encoding 8231 8389 140 0.1 8230.8 0.1X +Wide column with UTF-8 12882 13034 147 0.1 12881.9 0.1X Preparing data for benchmarking ... OpenJDK 64-Bit Server VM 11.0.19+7 on Linux 5.15.0-1037-azure -Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz +Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz JSON functions: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative ------------------------------------------------------------------------------------------------------------------------ -Text read 101 103 2 9.9 100.5 1.0X -from_json 1960 1965 6 0.5 1960.1 0.1X -json_tuple 2226 2235 13 0.4 2226.3 0.0X -get_json_object 2077 2088 12 0.5 2077.0 0.0X +Text read 99 109 9 10.1 98.6 1.0X +from_json 2766 2816 46 0.4 2766.0 0.0X +json_tuple 3064 3077 11 0.3 3063.8 0.0X +get_json_object wholestage off 2897 2917 32 0.3 2897.3 0.0X +get_json_object wholestage on 2832 2853 22 0.4 2831.6 0.0X Review Comment: Update here -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
