[GitHub] spark pull request: SPARK-2830 [MLlib]: re-organize mllib document...

mengxr Tue, 12 Aug 2014 16:22:33 -0700

Github user mengxr commented on a diff in the pull request:

    https://github.com/apache/spark/pull/1908#discussion_r16149641
  
    --- Diff: docs/mllib-linear-methods.md ---
    @@ -106,27 +105,25 @@ Here `$\mathrm{sign}(\wv)$` is the vector consisting 
of the signs (`$\pm1$`) of
     of `$\wv$`.
     
     L2-regularized problems are generally easier to solve than L1-regularized 
due to smoothness.
    -However, L1 regularization can help promote sparsity in weights, leading 
to simpler models, which is
    -also used for feature selection.  It is not recommended to train models 
without any regularization,
    +However, L1 regularization can help promote sparsity in weights leading to 
smaller and more interpretable models, the latter of which can be useful for 
feature selection.
    +It is not recommended to train models without any regularization,
     especially when the number of training examples is small.
     
     ## Binary classification
     
    -[Binary 
classification](http://en.wikipedia.org/wiki/Binary_classification) is to 
divide items into
    +[Binary 
classification](http://en.wikipedia.org/wiki/Binary_classification) aims to 
divide items into
     two categories: positive and negative.  MLlib supports two linear methods 
for binary classification:
    -linear support vector machine (SVM) and logistic regression.  The training 
data set is represented
    +linear support vector machines (SVMs) and logistic regression.  The 
training data set is represented
    --- End diff --
    
    That's an interesting point. For linear regression, people created new 
names for different types of regularization, some of which even shadowed the 
original name. It would be nice to add a sentence to clarify that we are 
counting different regularization types.



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark pull request: SPARK-2830 [MLlib]: re-organize mllib document...

Reply via email to