[jira] [Commented] (SPARK-9744) Add Java RDD method to map with lag and lead

Jerry Z (JIRA) Fri, 07 Aug 2015 11:58:57 -0700

    [ 
https://issues.apache.org/jira/browse/SPARK-9744?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14662267#comment-14662267
 ]


Jerry Z commented on SPARK-9744:
--------------------------------

Not sure what you mean by "Why just the Java RDD?". Also, you touched on 
something I don't have a full grasp on. Since the java docs seem to be light on 
description, I don't know the full implication of the partitions. So for the 
SortBy() method, do I have a fully sorted RDD or only within those partitions? 
Does every method involving partitions neglect neighboring values? If I join 
two set with one being the reverse of the other, does that mean only the middle 
partition gets paired up?

> Add Java RDD method to map with lag and lead
> --------------------------------------------
>
>                 Key: SPARK-9744
>                 URL: https://issues.apache.org/jira/browse/SPARK-9744
>             Project: Spark
>          Issue Type: Wish
>            Reporter: Jerry Z
>            Priority: Minor
>
> To avoid zipping with index and doing numerous mapping and joins, having a 
> single method call to map with an additional two parameters (1: list of 
> offsets [(-) for lag, 0 for current and (+) for lead])) and (2:default 
> value). The other difference to the map function takes an argument of List<T> 
> and not just T.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (SPARK-9744) Add Java RDD method to map with lag and lead

Reply via email to