Dandandan commented on pull request #8860: URL: https://github.com/apache/arrow/pull/8860#issuecomment-742651345
@XiaokunDing what about changing the statistics to be optional per statistic, but not the `statistics` itself? I was looking at adding it e.g. for a MemTable where it is easy to add a `num_rows` statistic, but (currently) not so trivial to add a `total_byte_size` implementation. Having the `num_rows` available could e.g. allow more efficient joins. I guess this would be the same for other statistics that could be added later on. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
