[
https://issues.apache.org/jira/browse/SPARK-9744?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14662267#comment-14662267
]
Jerry Z commented on SPARK-9744:
--------------------------------
Not sure what you mean by "Why just the Java RDD?". Also, you touched on
something I don't have a full grasp on. Since the java docs seem to be light on
description, I don't know the full implication of the partitions. So for the
SortBy() method, do I have a fully sorted RDD or only within those partitions?
Does every method involving partitions neglect neighboring values? If I join
two set with one being the reverse of the other, does that mean only the middle
partition gets paired up?
> Add Java RDD method to map with lag and lead
> --------------------------------------------
>
> Key: SPARK-9744
> URL: https://issues.apache.org/jira/browse/SPARK-9744
> Project: Spark
> Issue Type: Wish
> Reporter: Jerry Z
> Priority: Minor
>
> To avoid zipping with index and doing numerous mapping and joins, having a
> single method call to map with an additional two parameters (1: list of
> offsets [(-) for lag, 0 for current and (+) for lead])) and (2:default
> value). The other difference to the map function takes an argument of List<T>
> and not just T.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]