[jira] [Updated] (SPARK-7683) Confusing behavior of fold function of RDD in pyspark

Sean Owen (JIRA) Sun, 17 May 2015 03:16:45 -0700

     [ 
https://issues.apache.org/jira/browse/SPARK-7683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Sean Owen updated SPARK-7683:
-----------------------------
    Target Version/s: 2+

To be clear the change being tracked here is to swap the order of arguments 
that Pyspark passes to fold() in order to be consistent with Scala, but which 
can't obviously be done without maybe breaking a user program.

> Confusing behavior of fold function of RDD in pyspark
> -----------------------------------------------------
>
>                 Key: SPARK-7683
>                 URL: https://issues.apache.org/jira/browse/SPARK-7683
>             Project: Spark
>          Issue Type: Bug
>          Components: PySpark
>    Affects Versions: 1.3.1
>            Reporter: Ai He
>            Priority: Minor
>
> This will make the “fold” function consistent with the "fold" in rdd.scala 
> and other "aggregate" functions where “acc” goes first. Otherwise, users have 
> to write a lambda function like “lambda x, y: op(y, x)” if they want to use 
> “zeroValue” to change the result type.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Updated] (SPARK-7683) Confusing behavior of fold function of RDD in pyspark

Reply via email to