[GitHub] spark issue #15214: [SPARK-17017][Follow-up][ML] Refactor of ChiSqSelector a...

mpjlu Sun, 25 Sep 2016 07:59:52 -0700

Github user mpjlu commented on the issue:

    https://github.com/apache/spark/pull/15214
  
    hi @srowen .
    My understand of yanbo's comments here is,
    if user use chSqSelector like this:
    model1 = new ChiSqSelector().setFPR(0.05).setKBest(100).fit(data)
    model2 = new ChiSqSelector().setKBest(100).setFPR(0.05).fit(data)
    model1 will be different with model2. so the model is dependent on the 
order of users setting params.
    Actually, user should not use ChiSqSelector like this. One just need to set 
one SelectorType/Parameter is ok.  But if one don't know ChiSqSelector, he may 
do like this. So yanbo think this is a problem. 
    
    In this PR, setFPR(0.05) is split to two functions: 
setSelectorType("fpr").setAlpha(0.05). This maybe clear to the user.  
    By the principle of software development: one function do one thing, I am 
ok with this change.  
    But from user experience,  I like the spark-17017 method.




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #15214: [SPARK-17017][Follow-up][ML] Refactor of ChiSqSelector a...

Reply via email to