GitHub user cloud-fan opened a pull request:
https://github.com/apache/spark/pull/13029
[SPARK-15252][CORE][WIP] add accumulator wrapper to have more control of
accumulator
## What changes were proposed in this pull request?
Currently the `AccumulatorV2` is a kind of messy as it mixes the logic of
how to accumulator inputs and how to control the metadata. This PR adds an
`AccumulatorWrapper` to take care of the metadata management and make our
accumulator framework more flexible.
One drawback is, this makes the API a little harder to use. e.g. now users
have to use `wrapper.accumulator.add` in RDD closure instead of `acc1.add`.
This PR is a prototype.
## How was this patch tested?
existing tests.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/cloud-fan/spark acc-wrapper
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/13029.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #13029
----
commit 3d1c73604b3efbbbf087904ebfd7b6889d8dde85
Author: Wenchen Fan <[email protected]>
Date: 2016-05-10T14:54:53Z
add accumulator wrapper
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]