Re: [I] Cannot read Parquet files that do not specify Map keys as required [arrow-rs]

2024-04-30 Thread via GitHub
tustvold closed issue #5606: Cannot read Parquet files that do not specify Map keys as required URL: https://github.com/apache/arrow-rs/issues/5606 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

Re: [I] Cannot read Parquet files that do not specify Map keys as required [arrow-rs]

2024-04-11 Thread via GitHub
jupiter commented on issue #5606: URL: https://github.com/apache/arrow-rs/issues/5606#issuecomment-2050107030 It works when removing/reducing the check with all files I tested. I have not been able to produce any files that have invalid data to match such a schema, but I'd assume it would

Re: [I] Cannot read Parquet files that do not specify Map keys as required [arrow-rs]

2024-04-08 Thread via GitHub
tustvold commented on issue #5606: URL: https://github.com/apache/arrow-rs/issues/5606#issuecomment-2043709651 We could probably just ignore the malformed map logical type and decode such columns as a regular list of structs. This would allow the data to be read, without needing to

Re: [I] Cannot read Parquet files that do not specify Map keys as required [arrow-rs]

2024-04-08 Thread via GitHub
jupiter commented on issue #5606: URL: https://github.com/apache/arrow-rs/issues/5606#issuecomment-2043496776 It was discussed, but I don't think that was the conclusion. The creator's issue was resolved by rewriting a file. In order to operate with precious Parquet files from huge

Re: [I] Cannot read Parquet files that do not specify Map keys as required [arrow-rs]

2024-04-08 Thread via GitHub
tustvold commented on issue #5606: URL: https://github.com/apache/arrow-rs/issues/5606#issuecomment-2043422154 The conclusion of https://github.com/apache/arrow/issues/37389 appears to be that we are correct to refuse to read such malformed files, am I missing something here? -- This is

Re: [I] Cannot read Parquet files that do not specify Map keys as required [arrow-rs]

2024-04-08 Thread via GitHub
jupiter commented on issue #5606: URL: https://github.com/apache/arrow-rs/issues/5606#issuecomment-2042869108 It was hard to say whether this should be regarded as a bug or feature request. It's a bug from the perspective that we'd expect broad compatibility. -- This is an automated