alamb commented on PR #15301:
URL: https://github.com/apache/datafusion/pull/15301#issuecomment-2752597014

   I started checking this out
   
   Some timings for the modified query (no `to_timestamp` call)
   ```sql
   -- Q24 from clickbench
   SELECT * FROM 'hits_partitioned' WHERE "URL" LIKE '%google%' ORDER BY 
"EventTime" LIMIT 10;
   ```
   
   ```sql
   SET datafusion.execution.parquet.pushdown_filters = true;
   SET datafusion.optimizer.enable_dynamic_filter_pushdown = true;
   ```
   
   | branch/config | time |
   |--------|--------|
   | main, no config flags | 0m6.157s |
   | main, filter_pushdown | 0m2.448s |
   | this PR, no config flags | 0m5.601s | 
   | this PR, filter_pushdown and dynamic_filter | 0m1.679s | 
   
   So TLDR is I conclude this PR does help things, even seemingly when there is 
no filter pushdown. It really shows benefit when filter pushdown is on
   
   I will start looking at the code more carefully tomorrow


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to