[
https://issues.apache.org/jira/browse/ARROW-9367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17173184#comment-17173184
]
Joris Van den Bossche commented on ARROW-9367:
----------------------------------------------
bq. what if one wants to sort it by multiple columns (keys)
That is not yet possible, I think (grouping on multiple columns is kind of a
groupby operation, I assume, because only where the first column has equal
values, a second column should be used)
> [Python] Sorting on pyarrow data structures ?
> ---------------------------------------------
>
> Key: ARROW-9367
> URL: https://issues.apache.org/jira/browse/ARROW-9367
> Project: Apache Arrow
> Issue Type: Wish
> Components: Python
> Reporter: Athanassios Hatzis
> Priority: Major
> Labels: sort
>
> Hi, I consider sorting a fundamental operation for any in-memory data
> structures, including those of PyArrow.
> It would be nice if pa.array, pa.table, etc had sorting methods but I did not
> find any. One has to pass sorting indices calculated from some other library,
> such as numpy, to sort them. Sorting indices could have been calculated
> directly from PyArrow. Am I missing something here ? That increases
> significantly complexity for the developer.
> Do you have any plans on implementing such a feature in the near future ?
--
This message was sent by Atlassian Jira
(v8.3.4#803005)