[GitHub] spark issue #19020: [SPARK-3181] [ML] Implement huber loss for LinearRegress...

jkbradley Tue, 26 Sep 2017 16:04:40 -0700

Github user jkbradley commented on the issue:

    https://github.com/apache/spark/pull/19020
  
    > We have two candidate name: epsilon or m
    
    I see; that seems fine then, though I worry that we use "epsilon" in MLlib 
(tests) for "a very small positive number."  Can we document it more clearly, 
including the comment that it matches sklearn and is "M" from the paper?
    
    > provide the estimated scaling factor (sigma from the paper)
    
    I'd say:
    * Either we provide it as 1 for regular linear regression (since that is 
technically correct)
    * Or we take this as indication that @sethah 's comment about separating 
the classes is better.
    
    Re: @sethah 's comment about separating classes, I'll comment in the JIRA 
since that's a bigger discussion.



---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark issue #19020: [SPARK-3181] [ML] Implement huber loss for LinearRegress...

Reply via email to