suremarc commented on PR #13293:
URL: https://github.com/apache/datafusion/pull/13293#issuecomment-2536391187

   It's been a month and I haven't seen any new proposals. IIUC the main use 
case for inexact statistics is to estimate `num_rows` and `total_byte_size` 
using estimated selectivity, which itself is "inexact". So basically we need 
point estimates for those attributes, and exact bounds for the column 
min/maxes. 
   
   Unless I'm misunderstanding, it seems like @crepererum's proposed API 
accommodates both of these use cases. Another open question is if we should try 
to unify `Interval` with `Precision`, but I think if we guard the internal 
values we will at least have the option to make this change going forward 
without breaking anything. 
   
   I am interested in getting this change in so that I can resume work on 
#13296, so I am going to start pre-emptively working on a PR with the new 
`Precision` API. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to