u3Izx9ql7vW4 opened a new issue, #43239:
URL: https://github.com/apache/arrow/issues/43239

   ### Describe the enhancement requested
   
   There have a been a number of issues 
(https://github.com/apache/arrow/issues/26818, 
https://github.com/apache/arrow/issues/39030) raised regarding Dataset not 
preserving ordering. 
   
   This feature is present in 
[`pyarrow.parquet.write_table`](https://arrow.apache.org/docs/python/generated/pyarrow.parquet.write_table.html),
 which supports `sorting_columns`:
   
   > Specify the sort order of the data being written. The writer does not sort 
the data nor does it verify that the data is sorted. The sort order is written 
to the row group metadata, which can then be used by readers.
   
   This issue requests the ability to save Datasets such that when loaded they 
are ordered.  
   
   ### Component(s)
   
   Python


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to