[ 
https://issues.apache.org/jira/browse/SPARK-16551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hyukjin Kwon updated SPARK-16551:
---------------------------------
    Labels: bulk-closed  (was: )

> Accumulator Examples should demonstrate different use case from UDAFs
> ---------------------------------------------------------------------
>
>                 Key: SPARK-16551
>                 URL: https://issues.apache.org/jira/browse/SPARK-16551
>             Project: Spark
>          Issue Type: Documentation
>            Reporter: Vladimir Feinberg
>            Priority: Minor
>              Labels: bulk-closed
>
> Currently, the Spark programming guide demonstrates Accumulators 
> (http://spark.apache.org/docs/latest/programming-guide.html#accumulators) by 
> taking the sum of an RDD.
> This example makes new users think that Accumulators serve the role that 
> UDAFs do, which they don't. They're meant to be out-of-band, small values 
> that don't break pipe-lining. Documentation examples and notes should reflect 
> this (and warn that they may cause driver bottlenecks).



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to