Re: [PR] refactor: make dfschema wrap schemaref [arrow-datafusion]

via GitHub Thu, 21 Mar 2024 09:07:19 -0700


alamb commented on PR #9595:
URL: 
https://github.com/apache/arrow-datafusion/pull/9595#issuecomment-2012799977


   Ok, I re-ran the benchmark and indeed it shows this PT is actually 4x faster 
for queries that select large numbers of columns ❤️ 
   
   ```
   group                            dfschema                               main
   -----                            --------                               ----
   logical_plan_tpch_all            1.01     16.4±0.15ms        ? ?/sec    1.00 
    16.2±0.20ms        ? ?/sec
   logical_select_all_from_1000     1.00     25.6±0.26ms        ? ?/sec    3.86 
    98.8±0.44ms        ? ?/sec
   physical_plan_tpch_all           1.00    127.3±0.82ms        ? ?/sec    1.01 
   128.2±0.60ms        ? ?/sec
   physical_select_all_from_1000    1.00    175.1±0.53ms        ? ?/sec    4.14 
   724.3±1.56ms        ? ?/sec
   ```
   
   
   <details><summary>Benchmark scripts</summary>
   <p>
   
   ```shell
   git checkout feat/make-dfschema-wrap-schemaref
   cargo bench --bench sql_planner -- all --save-baseline dfschema
   
   MERGE_BASE=$(git merge-base HEAD apache/main)
   echo "** Comparing to ${MERGE_BASE}"
   
   git checkout ${MERGE_BASE}
   cargo bench --bench sql_planner -- all --save-baseline main
   
   critcmp main dfschema
   ```
   
   </p>
   </details> 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Re: [PR] refactor: make dfschema wrap schemaref [arrow-datafusion]

Reply via email to