tsreaper opened a new pull request, #295: URL: https://github.com/apache/flink-table-store/pull/295
Currently we have two types of files to write: * Data files (LSM tree files), where a level 0 data file is a single file and a level >= 1 data file is a set of rolling files. Statistics for these files are needed for pruning when scanning. * Extra files (changelog files), just a list of records. No statistics are needed. However, current writers are all based on `MetricFileWriter`, which always produces statistics. We'd like to refactor the writers and group them into `SingleFileWriter` and `RollingFileWriter`. `StatsCollectingSingleFileWriter` should be a subclass of `SingleFileWriter` which additionally produces statistics, and data file writers should be a subclass of `StatsCollectingSingleFileWriter` or `RollingFileWriter` based on their level. For extra file writers, extending from `SingleFileWriter` is enough. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
