agoose77 opened a new issue #10580:
URL: https://github.com/apache/arrow/issues/10580


   Hi all, thanks for the useful library!
   
   I noticed when calling `pyarrow.dataset.parquet_dataset` that the order of 
the files (`dataset.files`) does not match that of the `_metadata` row group 
columns. I'm not an Arrow expert by any means, but is this intentional?
   
   I think the unordered map is the culprit, but I have not recompiled to test 
this theory. 
https://github.com/apache/arrow/blob/133b1a904bf7fc1d24343c306a2279e27d4ebe6d/cpp/src/arrow/dataset/file_parquet.cc#L870


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to