DB Tsai created SPARK-4907:
------------------------------

             Summary: Inconsistent loss and gradient in LeastSquaresGradient 
compared with R
                 Key: SPARK-4907
                 URL: https://issues.apache.org/jira/browse/SPARK-4907
             Project: Spark
          Issue Type: Bug
          Components: MLlib
            Reporter: DB Tsai


In most of the academic paper and algorithm implementations, people use L = 
1/2n ||A weights-y||^2 instead of L = 1/n ||A weights-y||^2 for least-squared 
loss. See Eq. (1) in http://web.stanford.edu/~hastie/Papers/glmnet.pdf

Since MLlib uses different convention, this will result different residuals and 
all the stats properties will be different from GLMNET package in R. The model 
coefficients will be still the same under this change. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to