alamb commented on PR #8294: URL: https://github.com/apache/arrow-datafusion/pull/8294#issuecomment-1828170161
I spoke with @tustvold and we came up with the following plan: Plans: 1. Change this PR to take iterator of [`ColumnMetadata`](https://docs.rs/parquet/latest/parquet/format/struct.ColumnMetaData.html) and a target arrow `DataType` so we can consolidate the "map arrow column -> parquet column" logic (which is incorrect for `struct`s, as pointed out above) 1. (as a follow on PR): Add a test for reading/writing struct arrays with statistics (and likely file a ticket when this turns out to be wrong). The test should also ensure that reading a scalar field that appears *after* the struct field gets the correct values. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
