adriangb opened a new pull request, #22348: URL: https://github.com/apache/datafusion/pull/22348
## Which issue does this PR close? Relates to the discussion in #22024 about the Parquet datasource crate becoming hard to navigate. Split out of #22156, which bundled several code-motion moves into one PR — this is one of three smaller, independently-reviewable PRs that replace it. ## Rationale for this change `row_group_filter.rs` had grown to ~1,900 LOC. It mixes "data we loaded from the file" with "the access-plan filter that consumes it." This PR is **pure code motion**: no behavior change and no public API change. ## What changes are included in this PR? Extracts `BloomFilterStatistics` — the loaded Split Block Bloom Filter (SBBF) data plus its `PruningStatistics` adapter — from `row_group_filter.rs` into a new `bloom_filter.rs`. This separates `BloomFilterStatistics` (data loaded from the file) from `RowGroupAccessPlanFilter` (the access-plan filter that consumes it), leaving `row_group_filter.rs` focused on the latter. `BloomFilterStatistics` is crate-internal; `row_group_filter` re-exports it (`pub(crate) use`) so the existing `crate::row_group_filter::BloomFilterStatistics` path keeps resolving for in-crate callers — this PR therefore touches no other file. ## Are these changes tested? Yes, covered by existing tests. `cargo test -p datafusion-datasource-parquet --all-features` (122 passing) and `cargo clippy -p datafusion-datasource-parquet --all-targets --all-features -- -D warnings` both pass. ## Are there any user-facing changes? No. `BloomFilterStatistics` is crate-internal; this only reorganizes files inside the crate. 🤖 Generated with [Claude Code](https://claude.com/claude-code) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
