alamb commented on issue #11212:
URL: https://github.com/apache/datafusion/issues/11212#issuecomment-2755686534

   Thank you for bringing this up again @acking-you 
   
   > If we can optimize the specialized query you mentioned and not slowing 
down other queries, it would be nice to have it.
   
   I agree with @jayzhan211  -- if we can improve performance in your scenario 
without slowing down other queries this is a great thing to do
   
   > I attempted to reproduce our usage scenario using the hits dataset from 
clickbench hits:
   
   Nice! Thank you for this analysis
   
   Perhaps as a first step you could add this query (or perhaps a derivative 
one) to the "extended" clickbench suite (that is DataFusion specific but covers 
some important cases): 
https://github.com/apache/datafusion/tree/main/benchmarks/queries/clickbench#extended-queries
   
   The next step would be to make a PR with the optimization -- we can then use 
the query in the first step to verify performance is improved and also verify 
other queries are not slowed down. 
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to