[
https://issues.apache.org/jira/browse/SPARK-16551?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15706764#comment-15706764
]
Ruiming Zhou commented on SPARK-16551:
--------------------------------------
I can look at this issue.
> Accumulator Examples should demonstrate different use case from UDAFs
> ---------------------------------------------------------------------
>
> Key: SPARK-16551
> URL: https://issues.apache.org/jira/browse/SPARK-16551
> Project: Spark
> Issue Type: Documentation
> Reporter: Vladimir Feinberg
> Priority: Minor
>
> Currently, the Spark programming guide demonstrates Accumulators
> (http://spark.apache.org/docs/latest/programming-guide.html#accumulators) by
> taking the sum of an RDD.
> This example makes new users think that Accumulators serve the role that
> UDAFs do, which they don't. They're meant to be out-of-band, small values
> that don't break pipe-lining. Documentation examples and notes should reflect
> this (and warn that they may cause driver bottlenecks).
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]