[
https://issues.apache.org/jira/browse/SPARK-3655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14254973#comment-14254973
]
koert kuipers commented on SPARK-3655:
--------------------------------------
Imran,
I think the groupAndSort function is easy to implement with the code in
this pullreq (which includes the custom partitioning and sorting by key +
value that you would need), but i agree with you that the rest of what you
suggest involves a bigger change. So it's probably better to create a new
jira and start with it after this one is done. We will be able to
re-use/refactor the code of this pullreq which by then hopefully has been
merged into master.
I have some more questions involving types and design, but i will send
those to you offline.
On Sat, Dec 20, 2014 at 3:03 PM, Imran Rashid (JIRA) <[email protected]>
> Support sorting of values in addition to keys (i.e. secondary sort)
> -------------------------------------------------------------------
>
> Key: SPARK-3655
> URL: https://issues.apache.org/jira/browse/SPARK-3655
> Project: Spark
> Issue Type: New Feature
> Components: Spark Core
> Affects Versions: 1.1.0, 1.2.0
> Reporter: koert kuipers
> Assignee: Koert Kuipers
> Priority: Minor
>
> Now that spark has a sort based shuffle, can we expect a secondary sort soon?
> There are some use cases where getting a sorted iterator of values per key is
> helpful.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]