[GitHub] spark pull request #11105: [SPARK-12469][CORE] Data Property accumulators fo...

holdenk Mon, 17 Oct 2016 10:14:46 -0700

Github user holdenk commented on a diff in the pull request:

    https://github.com/apache/spark/pull/11105#discussion_r83690354
  
    --- Diff: core/src/main/scala/org/apache/spark/util/AccumulatorV2.scala ---
    @@ -136,15 +179,76 @@ abstract class AccumulatorV2[IN, OUT] extends 
Serializable {
       def reset(): Unit
     
       /**
    +   * Takes the inputs and accumulates. e.g. it can be a simple `+=` for 
counter accumulator.
    +   * Developers should extend addImpl to customize the adding 
functionality.
    +   */
    +  final def add(v: IN): Unit = {
    +    if (metadata != null && metadata.dataProperty) {
    +      dataPropertyAdd(v)
    +    } else {
    +      addImpl(v)
    +    }
    +  }
    +
    +  private def dataPropertyAdd(v: IN): Unit = {
    +    // Add first for localValue & AccumulatorInfo
    --- End diff --
    
    If we want the user to be able to access the current accumulated value from 
their process worker side then we need to perform a "normal" add as well as the 
data property add. Also if we don't have a merge step (e.g. only one 
accumulator) this might make a difference (although I _think_ this won't be the 
case anymore with the refactor that happened I haven't tested it).



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark pull request #11105: [SPARK-12469][CORE] Data Property accumulators fo...

Reply via email to