cbb330 opened a new pull request, #49010: URL: https://github.com/apache/arrow/pull/49010
## Summary Part 2/15 of ORC predicate pushdown implementation. ⚠️ **Depends on PR #49009 being merged first** Builds on PR #49009 to add statistics-to-expression conversion: - Convert ORC stripe statistics to Arrow Expression guarantees - Support for INT32/INT64 types - Handle NULL values correctly - Build min/max range expressions: `field >= min AND field <= max` ## Changes - Add `StripeStatsAsExpression()` function - Extract min/max from IntegerColumnStatistics - Generate guarantee expressions for SimplifyWithGuarantee() ## Example For a stripe with `id` column stats min=100, max=500: - Generates expression: `(id >= 100) AND (id <= 500) OR is_null(id)` - Used to test if predicate `id > 1000` can be satisfied (no, skip stripe) **Part of stacked PR series. Review after PR #49009.** -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
