andygrove opened a new pull request, #3495: URL: https://github.com/apache/datafusion-comet/pull/3495
## Summary - Fix case-insensitive column matching in the new `SparkPhysicalExprAdapterFactory` on the df52 branch - `DefaultPhysicalExprAdapter` uses Arrow's `field_with_name`/`index_of` which are case-sensitive. When a parquet file has lowercase column names (e.g., "a") but the table schema has uppercase (e.g., "A"), the lookup fails and columns are filled with nulls - Fix remaps physical schema field names to match logical names (case-insensitively) before passing to the default adapter, then restores original physical names in the rewritten expressions so that downstream `reassign_expr_columns` can find columns in the actual parquet stream schema ## Test plan - [ ] Existing test "test V1 parquet native scan -- case insensitive" in `ParquetReadV1Suite` should now pass - [ ] Existing Rust tests continue to pass - [ ] CI passes 🤖 Generated with [Claude Code](https://claude.com/claude-code) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
