[GitHub] [arrow-datafusion] alamb commented on issue #1972: DataFusion Optimizer framework discussion

GitBox Thu, 24 Mar 2022 13:06:48 -0700


alamb commented on issue #1972:
URL: 
https://github.com/apache/arrow-datafusion/issues/1972#issuecomment-1078116201



   > But this seems like such an obvious solution that I'm positive that there 
are probably a number of fundamental issues with it beyond the of the 
selectivity being made into a N-dimensional distribution where N is the number 
of predicates.
   
   The typical problem here is skewed data (e.g. one value having 50% of the 
rows) -- and do you pick equi width or equi height histograms, which both have 
tradeoffs


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [arrow-datafusion] alamb commented on issue #1972: DataFusion Optimizer framework discussion

Reply via email to