[GitHub] spark pull request #22501: [SPARK-25492][TEST] Refactor WideSchemaBenchmark ...

dongjoon-hyun Thu, 18 Oct 2018 12:42:26 -0700

Github user dongjoon-hyun commented on a diff in the pull request:

    https://github.com/apache/spark/pull/22501#discussion_r226439834
  
    --- Diff: sql/core/benchmarks/WideSchemaBenchmark-results.txt ---
    @@ -1,117 +1,145 @@
    -Java HotSpot(TM) 64-Bit Server VM 1.8.0_92-b14 on Mac OS X 10.11.6
    -Intel(R) Core(TM) i7-4980HQ CPU @ 2.80GHz
    
+================================================================================================
    +parsing large select expressions
    
+================================================================================================
     
    +Java HotSpot(TM) 64-Bit Server VM 1.8.0_151-b12 on Mac OS X 10.12.6
    +Intel(R) Core(TM) i7-7820HQ CPU @ 2.90GHz
     parsing large select:                    Best/Avg Time(ms)    Rate(M/s)   
Per Row(ns)   Relative
     
------------------------------------------------------------------------------------------------
    -1 select expressions                             2 /    4          0.0     
2050147.0       1.0X
    -100 select expressions                           6 /    7          0.0     
6123412.0       0.3X
    -2500 select expressions                        135 /  141          0.0   
134623148.0       0.0X
    +1 select expressions                             2 /    4          0.0     
1934953.0       1.0X
    +100 select expressions                           4 /    5          0.0     
3659399.0       0.5X
    +2500 select expressions                         68 /   76          0.0    
68278937.0       0.0X
     
    -Java HotSpot(TM) 64-Bit Server VM 1.8.0_92-b14 on Mac OS X 10.11.6
    -Intel(R) Core(TM) i7-4980HQ CPU @ 2.80GHz
     
    
+================================================================================================
    +many column field read and write
    
+================================================================================================
    +
    +Java HotSpot(TM) 64-Bit Server VM 1.8.0_151-b12 on Mac OS X 10.12.6
    +Intel(R) Core(TM) i7-7820HQ CPU @ 2.90GHz
     many column field r/w:                   Best/Avg Time(ms)    Rate(M/s)   
Per Row(ns)   Relative
     
------------------------------------------------------------------------------------------------
    -1 cols x 100000 rows (read in-mem)              16 /   18          6.3     
    158.6       1.0X
    -1 cols x 100000 rows (exec in-mem)              17 /   19          6.0     
    166.7       1.0X
    -1 cols x 100000 rows (read parquet)             24 /   26          4.3     
    235.1       0.7X
    -1 cols x 100000 rows (write parquet)            81 /   85          1.2     
    811.3       0.2X
    -100 cols x 1000 rows (read in-mem)              17 /   19          6.0     
    166.2       1.0X
    -100 cols x 1000 rows (exec in-mem)              25 /   27          4.0     
    249.2       0.6X
    -100 cols x 1000 rows (read parquet)             23 /   25          4.4     
    226.0       0.7X
    -100 cols x 1000 rows (write parquet)            83 /   87          1.2     
    831.0       0.2X
    -2500 cols x 40 rows (read in-mem)              132 /  137          0.8     
   1322.9       0.1X
    -2500 cols x 40 rows (exec in-mem)              326 /  330          0.3     
   3260.6       0.0X
    -2500 cols x 40 rows (read parquet)             831 /  839          0.1     
   8305.8       0.0X
    -2500 cols x 40 rows (write parquet)            237 /  245          0.4     
   2372.6       0.1X
    -
    -Java HotSpot(TM) 64-Bit Server VM 1.8.0_92-b14 on Mac OS X 10.11.6
    -Intel(R) Core(TM) i7-4980HQ CPU @ 2.80GHz
    +1 cols x 100000 rows (read in-mem)              22 /   25          4.6     
    219.4       1.0X
    +1 cols x 100000 rows (exec in-mem)              22 /   28          4.5     
    223.8       1.0X
    +1 cols x 100000 rows (read parquet)             45 /   49          2.2     
    449.6       0.5X
    +1 cols x 100000 rows (write parquet)           204 /  223          0.5     
   2044.4       0.1X
    --- End diff --
    
    This might be a little regression on Parquet writer from Spark 2.1.0 
(SPARK-17335).
    
    cc @cloud-fan and @gatorsmile , @rdblue



---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark pull request #22501: [SPARK-25492][TEST] Refactor WideSchemaBenchmark ...

Reply via email to