alamb commented on issue #11212: URL: https://github.com/apache/datafusion/issues/11212#issuecomment-2755686534
Thank you for bringing this up again @acking-you > If we can optimize the specialized query you mentioned and not slowing down other queries, it would be nice to have it. I agree with @jayzhan211 -- if we can improve performance in your scenario without slowing down other queries this is a great thing to do > I attempted to reproduce our usage scenario using the hits dataset from clickbench hits: Nice! Thank you for this analysis Perhaps as a first step you could add this query (or perhaps a derivative one) to the "extended" clickbench suite (that is DataFusion specific but covers some important cases): https://github.com/apache/datafusion/tree/main/benchmarks/queries/clickbench#extended-queries The next step would be to make a PR with the optimization -- we can then use the query in the first step to verify performance is improved and also verify other queries are not slowed down. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For additional commands, e-mail: github-h...@datafusion.apache.org