alamb opened a new pull request #380: URL: https://github.com/apache/arrow-datafusion/pull/380
# Rationale As explained on #363 the high level goal is to make the parquet row group pruning logic generic to any types of min/max statistics (not just parquet metadata) # Changes: 1. Introduce a new `PruningStatistics` trait 2. Refactor `PruningPredicateBuilder` to be generic in terms of `PruningStatistics` 3. Add documentation and tests # Notes: I am leaving this PR in draft state in case anyone is interested, as I 1. merge its dependencies(see below) in 2. work on a POC demonstrating the use in IOx # Sequence: I am trying to do this in a few small PRs to reduce review burden; Here is how connect together: Planned changes: - [x] Refactor code into a new module (https://github.com/apache/arrow-datafusion/pull/365) - [x] Return bool rather than parquet specific output (https://github.com/apache/arrow-datafusion/pull/370) - [ ] Add `ScalarValue::iter_to_array` (TBD PR) - [ ] Add `PruningStatstics` Trait (this PR) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org