adriangb opened a new pull request, #19962:
URL: https://github.com/apache/datafusion/pull/19962
## Summary
Extract benchmarks and sqllogictest cases from #19538 for easier review.
This PR includes:
- **New Benchmark**: `parquet_struct_query.rs` - Benchmarks SQL queries on
struct columns in Parquet files
- 524,288 rows across 8 row groups
- 20 benchmark queries covering struct access, filtering, joins, and
aggregations
- Struct schema: `id` (Int32) and `s` (Struct with `id`/Int32 and
`value`/Utf8 fields)
- **SQLLogicTest**: `projection_pushdown.slt` - Tests for projection
pushdown optimization
## Changes
- Added `datafusion/core/benches/parquet_struct_query.rs`
- Updated `datafusion/core/Cargo.toml` with benchmark entry
- Added `datafusion/sqllogictest/test_files/projection_pushdown.slt`
## Test Plan
- Run benchmark: `cargo bench --profile dev --bench parquet_struct_query`
- All 20 benchmark queries execute successfully
- Parquet file generated with correct row count (524,288) and row groups (8)
🤖 Generated with [Claude Code](https://claude.com/claude-code)
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]