agoose77 opened a new issue #10580: URL: https://github.com/apache/arrow/issues/10580
Hi all, thanks for the useful library! I noticed when calling `pyarrow.dataset.parquet_dataset` that the order of the files (`dataset.files`) does not match that of the `_metadata` row group columns. I'm not an Arrow expert by any means, but is this intentional? I think the unordered map is the culprit, but I have not recompiled to test this theory. https://github.com/apache/arrow/blob/133b1a904bf7fc1d24343c306a2279e27d4ebe6d/cpp/src/arrow/dataset/file_parquet.cc#L870 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
