Dandandan opened a new pull request #8998: URL: https://github.com/apache/arrow/pull/8998
This adds extra statistics on the amount of nulls per column. This is a step towards supporting more cost based optimizations. Second step is adding number distinct, min, max values. With that we can have a good estimate of selectivity of filters, supporting more cases in which we could apply optimizations such as reordering joins. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
