[GitHub] spark pull request: [SPARK-4980] [MLlib] Add decay factors to stre...

rotationsymmetry Thu, 27 Aug 2015 14:21:03 -0700

Github user rotationsymmetry commented on the pull request:

    https://github.com/apache/spark/pull/8022#issuecomment-135557506
  
    @feynmanliang Thank you very much for your review. 
    
    I have incorporated your comments in commit a4ed2b0.
    
    * Add ScalaDoc for public API.
    * Add ScalaDoc to decribe the forgetful algorithm in 
StreamingLinearAlgorithm.
    * Remove F-polymorphism in StreamingDecaySetter[T].
    * decayFactor and timeUnit in StreamingDecaySetter[T] are now private.
    * Remove division by zero in trainOn of StreamingLinearAlgorithm; provide 
comments to explains why.
    * Improve testing cases of StreamingLogisticRegressionSuite to have rel 
tol=0.1.
    * resolve merge conflict.
    
    As for your comment of "having getLambda instead of getDiscount in 
StreamingDecay", I feel that the discount factor better conveys the 
mathematical idea of the algorithm. Lambda, on the other hand, is only a 
temporary value in the calculation. For example, in the [spark 
doc](http://spark.apache.org/docs/latest/mllib-clustering.html#streaming-k-means),
 the discount factor is employed to describe the algorithm. I have included 
similar description in the ScalaDoc for `StreamingLinearAlgorithm`.
    
    Thanks again for your review. If you have any further comments, please let 
me know.




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark pull request: [SPARK-4980] [MLlib] Add decay factors to stre...

Reply via email to