alamb commented on PR #80: URL: https://github.com/apache/parquet-testing/pull/80#issuecomment-2863844142
I think we have figured out the issue and @Liyixin95 has provided a fix here: - https://github.com/apache/arrow-rs/pull/7482 Quoting from myself on https://github.com/apache/arrow-rs/pull/5626#issuecomment-2858975922 Ok, what is happening here is as follows: arrow-rs and arrow-cpp (and potentially polars) add a special file metadata field called "ARROW:schema" that records the desired Arrow schema. This is described in more detail here: - https://github.com/apache/arrow-rs/pull/7479 In order for the arrow-rs parquet reader to read the data as a duration it needs to interpret the contents of that metadata hint. So I suggest we close this PR and go with the fix in arrow-rs -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
