2010YOUY01 commented on PR #18448: URL: https://github.com/apache/datafusion/pull/18448#issuecomment-3478662746
> > What is this series of PRs implementing? Is it for query anti-pattern in [#18393 (comment)](https://github.com/apache/datafusion/pull/18393#issuecomment-3474879801), stat pruning is not working, so we're pushing down build-side dynamic filter like `key in [1,5,23]` to probe side parquet scan, and use parquet builtin bloom filter to effectively skip prunable units? > > Yes that is precisely it. I'm not sure what you mean by `query anti-pattern` though. Maybe `query anti-pattern` is not accurate, but I'm just describing in that query, stat pruning is not working due to the given parquet file is not well clustered on the pruning column. > I'll put a summary and links to the relevant PRs in [#17171 (comment)](https://github.com/apache/datafusion/issues/17171#issuecomment-3478645163). That said #17171 is already a pretty busy issue so it's buried at the bottom... I'm not sure what more I can do about that. Perhaps edit the issue top description, or open a new one? 🤔 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
