suremarc commented on issue #8227: URL: https://github.com/apache/datafusion/issues/8227#issuecomment-2458134570
> If we have per-partition statistics, merging them will be problematic for NDV. Extrapolation techniques are not likely to work. Ok, well I suppose we can keep the existing global statistics and add a new per-partition statistics method (that defaults to returning the global statistics for each partition). That would probably be a less invasive change too. Would be happy to discuss the details more over on #10316 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
