SteveLauC opened a new pull request, #8056:
URL: https://github.com/apache/arrow-datafusion/pull/8056

   ### What does this PR do
   
   1. Store the source file info in `RecordBatch.schema.metadata` so that our 
`FilterExec` can be aware of this
   
   ### Note
   
   1. This is only for `ParquetExec`, not a general implementation, so other 
formats like JSON or CSV won't have this
   2. It is **ONLY guaranteed** that the `RecordBatch`es that **directly** come 
from a `ParquetExec` will have this info, batches generated from nodes on the 
top of a `ParquetExec` node may not have this
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to