suremarc commented on PR #13293: URL: https://github.com/apache/datafusion/pull/13293#issuecomment-2536391187
It's been a month and I haven't seen any new proposals. IIUC the main use case for inexact statistics is to estimate `num_rows` and `total_byte_size` using estimated selectivity, which itself is "inexact". So basically we need point estimates for those attributes, and exact bounds for the column min/maxes. Unless I'm misunderstanding, it seems like @crepererum's proposed API accommodates both of these use cases. Another open question is if we should try to unify `Interval` with `Precision`, but I think if we guard the internal values we will at least have the option to make this change going forward without breaking anything. I am interested in getting this change in so that I can resume work on #13296, so I am going to start pre-emptively working on a PR with the new `Precision` API. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For additional commands, e-mail: github-h...@datafusion.apache.org