[PR] fix: [df52] handle case-insensitive column matching in PhysicalExprAdapter [datafusion-comet]

via GitHub Wed, 11 Feb 2026 16:35:00 -0800


andygrove opened a new pull request, #3495:
URL: https://github.com/apache/datafusion-comet/pull/3495


   ## Summary
   
   - Fix case-insensitive column matching in the new 
`SparkPhysicalExprAdapterFactory` on the df52 branch
   - `DefaultPhysicalExprAdapter` uses Arrow's `field_with_name`/`index_of` 
which are case-sensitive. When a parquet file has lowercase column names (e.g., 
"a") but the table schema has uppercase (e.g., "A"), the lookup fails and 
columns are filled with nulls
   - Fix remaps physical schema field names to match logical names 
(case-insensitively) before passing to the default adapter, then restores 
original physical names in the rewritten expressions so that downstream 
`reassign_expr_columns` can find columns in the actual parquet stream schema
   
   ## Test plan
   
   - [ ] Existing test "test V1 parquet native scan -- case insensitive" in 
`ParquetReadV1Suite` should now pass
   - [ ] Existing Rust tests continue to pass
   - [ ] CI passes
   
   🤖 Generated with [Claude Code](https://claude.com/claude-code)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[PR] fix: [df52] handle case-insensitive column matching in PhysicalExprAdapter [datafusion-comet]

Reply via email to