Re: [PR] Cache common plan properties to eliminate recursive calls in physical plan [arrow-datafusion]

via GitHub Wed, 28 Feb 2024 01:40:44 -0800


simonvandel commented on PR #9346:
URL: 
https://github.com/apache/arrow-datafusion/pull/9346#issuecomment-1968588865


   I was curious on how this affected planning performance. Here are my results 
comparing 0c46d7fa105fddc4a35a4c99e4aa2a063d967abb (parent of first commit in 
this PR) with a8fac85429f519ea5ae258a2a6425eaa9ab333c8.
   
   Seems like there are quite big regressions in physical planning. But I may 
have messed up.
   
   <details>
   
   <summary>Results</summary>
   
   ```
   mold -run cargo bench --bench sql_planner --profile release-nonlto -- 
--baseline=0c46d7fa105fddc4a35a4c99e4aa2a063d967abb
        Running benches/sql_planner.rs 
(/home/svs/code/arrow-datafusion/target/release-nonlto/deps/sql_planner-355e1e15827c5f11)
   Gnuplot not found, using plotters backend
   logical_select_one_from_700
                           time:   [576.63 µs 577.55 µs 578.47 µs]
                           change: [+1.0656% +1.7066% +2.3733%] (p = 0.00 < 
0.05)
                           Performance has regressed.
   Found 6 outliers among 100 measurements (6.00%)
     1 (1.00%) low severe
     3 (3.00%) high mild
     2 (2.00%) high severe
   
   physical_select_one_from_700
                           time:   [4.0361 ms 4.0488 ms 4.0629 ms]
                           change: [-1.9349% -1.3951% -0.9091%] (p = 0.00 < 
0.05)
                           Change within noise threshold.
   Found 4 outliers among 100 measurements (4.00%)
     3 (3.00%) high mild
     1 (1.00%) high severe
   
   logical_trivial_join_low_numbered_columns
                           time:   [586.18 µs 587.20 µs 588.35 µs]
                           change: [-0.4208% +1.0724% +3.2859%] (p = 0.32 > 
0.05)
                           No change in performance detected.
   Found 10 outliers among 100 measurements (10.00%)
     10 (10.00%) high severe
   
   logical_trivial_join_high_numbered_columns
                           time:   [627.61 µs 628.75 µs 629.94 µs]
                           change: [-1.5180% -1.0025% -0.5763%] (p = 0.00 < 
0.05)
                           Change within noise threshold.
   Found 5 outliers among 100 measurements (5.00%)
     4 (4.00%) high mild
     1 (1.00%) high severe
   
   Benchmarking logical_aggregate_with_join: Warming up for 3.0000 s
   Warning: Unable to complete 100 samples in 5.0s. You may wish to increase 
target time to 5.5s, enable flat sampling, or reduce sample count to 60.
   logical_aggregate_with_join
                           time:   [1.0842 ms 1.0866 ms 1.0893 ms]
                           change: [-1.5018% -0.9371% -0.4389%] (p = 0.00 < 
0.05)
                           Change within noise threshold.
   Found 7 outliers among 100 measurements (7.00%)
     4 (4.00%) high mild
     3 (3.00%) high severe
   
   physical_plan_tpch_q1   time:   [7.8999 ms 7.9398 ms 7.9805 ms]
                           change: [+1113.8% +1122.0% +1129.1%] (p = 0.00 < 
0.05)
                           Performance has regressed.
   
   physical_plan_tpch_q2   time:   [11.743 ms 11.806 ms 11.878 ms]
                           change: [+1622.6% +1632.4% +1644.4%] (p = 0.00 < 
0.05)
                           Performance has regressed.
   Found 1 outliers among 100 measurements (1.00%)
     1 (1.00%) high severe
   
   physical_plan_tpch_q3   time:   [3.8676 ms 3.8778 ms 3.8886 ms]
                           change: [+625.22% +628.60% +631.62%] (p = 0.00 < 
0.05)
                           Performance has regressed.
   Found 13 outliers among 100 measurements (13.00%)
     9 (9.00%) high mild
     4 (4.00%) high severe
   
   physical_plan_tpch_q4   time:   [2.9474 ms 2.9549 ms 2.9635 ms]
                           change: [+442.21% +445.51% +448.46%] (p = 0.00 < 
0.05)
                           Performance has regressed.
   Found 7 outliers among 100 measurements (7.00%)
     4 (4.00%) high mild
     3 (3.00%) high severe
   
   physical_plan_tpch_q5   time:   [5.8592 ms 5.8881 ms 5.9182 ms]
                           change: [+903.01% +912.33% +920.23%] (p = 0.00 < 
0.05)
                           Performance has regressed.
   Found 5 outliers among 100 measurements (5.00%)
     5 (5.00%) high mild
   
   Benchmarking physical_plan_tpch_q6: Warming up for 3.0000 s
   Warning: Unable to complete 100 samples in 5.0s. You may wish to increase 
target time to 9.3s, enable flat sampling, or reduce sample count to 50.
   physical_plan_tpch_q6   time:   [1.8279 ms 1.8300 ms 1.8322 ms]
                           change: [+358.95% +361.52% +365.24%] (p = 0.00 < 
0.05)
                           Performance has regressed.
   Found 4 outliers among 100 measurements (4.00%)
     1 (1.00%) high mild
     3 (3.00%) high severe
   
   physical_plan_tpch_q7   time:   [8.2906 ms 8.3330 ms 8.3757 ms]
                           change: [+950.61% +957.43% +963.46%] (p = 0.00 < 
0.05)
                           Performance has regressed.
   
   physical_plan_tpch_q8   time:   [11.821 ms 11.870 ms 11.920 ms]
                           change: [+1331.7% +1339.7% +1347.4%] (p = 0.00 < 
0.05)
                           Performance has regressed.
   
   physical_plan_tpch_q9   time:   [8.8488 ms 8.8885 ms 8.9294 ms]
                           change: [+1161.1% +1169.6% +1177.0%] (p = 0.00 < 
0.05)
                           Performance has regressed.
   Found 1 outliers among 100 measurements (1.00%)
     1 (1.00%) high mild
   
   physical_plan_tpch_q10  time:   [6.0025 ms 6.0278 ms 6.0543 ms]
                           change: [+829.09% +837.51% +845.36%] (p = 0.00 < 
0.05)
                           Performance has regressed.
   Found 7 outliers among 100 measurements (7.00%)
     7 (7.00%) high mild
   
   physical_plan_tpch_q11  time:   [4.8230 ms 4.8421 ms 4.8622 ms]
                           change: [+716.64% +721.76% +726.08%] (p = 0.00 < 
0.05)
                           Performance has regressed.
   Found 6 outliers among 100 measurements (6.00%)
     4 (4.00%) high mild
     2 (2.00%) high severe
   
   physical_plan_tpch_q12  time:   [3.8354 ms 3.8458 ms 3.8573 ms]
                           change: [+553.06% +556.86% +560.04%] (p = 0.00 < 
0.05)
                           Performance has regressed.
   Found 15 outliers among 100 measurements (15.00%)
     10 (10.00%) high mild
     5 (5.00%) high severe
   
   physical_plan_tpch_q13  time:   [2.4938 ms 2.4991 ms 2.5052 ms]
                           change: [+420.96% +422.92% +424.70%] (p = 0.00 < 
0.05)
                           Performance has regressed.
   Found 5 outliers among 100 measurements (5.00%)
     4 (4.00%) high mild
     1 (1.00%) high severe
   
   physical_plan_tpch_q14  time:   [3.2362 ms 3.2430 ms 3.2510 ms]
                           change: [+527.89% +530.83% +533.51%] (p = 0.00 < 
0.05)
                           Performance has regressed.
   Found 4 outliers among 100 measurements (4.00%)
     1 (1.00%) high mild
     3 (3.00%) high severe
   
   physical_plan_tpch_q16  time:   [4.9522 ms 4.9715 ms 4.9919 ms]
                           change: [+840.97% +846.02% +850.48%] (p = 0.00 < 
0.05)
                           Performance has regressed.
   Found 5 outliers among 100 measurements (5.00%)
     5 (5.00%) high mild
   
   physical_plan_tpch_q17  time:   [4.7371 ms 4.7556 ms 4.7757 ms]
                           change: [+848.11% +853.56% +858.89%] (p = 0.00 < 
0.05)
                           Performance has regressed.
   Found 7 outliers among 100 measurements (7.00%)
     5 (5.00%) high mild
     2 (2.00%) high severe
   
   physical_plan_tpch_q18  time:   [5.4681 ms 5.5619 ms 5.6655 ms]
                           change: [+851.77% +869.08% +890.15%] (p = 0.00 < 
0.05)
                           Performance has regressed.
   Found 10 outliers among 100 measurements (10.00%)
     8 (8.00%) high mild
     2 (2.00%) high severe
   
   physical_plan_tpch_q19  time:   [9.6241 ms 9.6606 ms 9.6985 ms]
                           change: [+1401.0% +1409.1% +1416.4%] (p = 0.00 < 
0.05)
                           Performance has regressed.
   
   physical_plan_tpch_q20  time:   [6.1281 ms 6.1519 ms 6.1765 ms]
                           change: [+914.60% +921.14% +926.79%] (p = 0.00 < 
0.05)
                           Performance has regressed.
   
   physical_plan_tpch_q21  time:   [9.0428 ms 9.0806 ms 9.1191 ms]
                           change: [+991.69% +997.40% +1003.3%] (p = 0.00 < 
0.05)
                           Performance has regressed.
   Found 1 outliers among 100 measurements (1.00%)
     1 (1.00%) high mild
   
   physical_plan_tpch_q22  time:   [4.3348 ms 4.3436 ms 4.3534 ms]
                           change: [+532.03% +534.80% +537.16%] (p = 0.00 < 
0.05)
                           Performance has regressed.
   Found 8 outliers among 100 measurements (8.00%)
     5 (5.00%) high mild
     3 (3.00%) high severe
   
   physical_plan_tpch_all  time:   [12.755 ms 12.775 ms 12.796 ms]
                           change: [-0.8907% -0.6753% -0.4745%] (p = 0.00 < 
0.05)
                           Change within noise threshold.
   Found 3 outliers among 100 measurements (3.00%)
     2 (2.00%) high mild
     1 (1.00%) high severe
   ```
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Re: [PR] Cache common plan properties to eliminate recursive calls in physical plan [arrow-datafusion]

Reply via email to