u3Izx9ql7vW4 opened a new issue, #43239: URL: https://github.com/apache/arrow/issues/43239
### Describe the enhancement requested There have a been a number of issues (https://github.com/apache/arrow/issues/26818, https://github.com/apache/arrow/issues/39030) raised regarding Dataset not preserving ordering. This feature is present in [`pyarrow.parquet.write_table`](https://arrow.apache.org/docs/python/generated/pyarrow.parquet.write_table.html), which supports `sorting_columns`: > Specify the sort order of the data being written. The writer does not sort the data nor does it verify that the data is sorted. The sort order is written to the row group metadata, which can then be used by readers. This issue requests the ability to save Datasets such that when loaded they are ordered. ### Component(s) Python -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
