Nicholas Chammas created SPARK-19216:
----------------------------------------

             Summary: LogisticRegressionModel is missing getThreshold()
                 Key: SPARK-19216
                 URL: https://issues.apache.org/jira/browse/SPARK-19216
             Project: Spark
          Issue Type: Improvement
          Components: ML, PySpark
    Affects Versions: 2.1.0
            Reporter: Nicholas Chammas
            Priority: Minor


Say I just loaded a logistic regression model from storage. How do I check that 
model's threshold in PySpark? From what I can see, the only way to do that is 
to dip into the Java object:

{code}
model._java_obj.getThreshold())
{code}

It seems like PySpark's version of {{LogisticRegressionModel}} should include 
this method.

Another issue is that it's not clear whether the threshold is for the raw 
prediction or the probability. Maybe it's obvious to machine learning 
practitioners, but I couldn't tell from reading the docs or skimming the code 
what the threshold was for exactly.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to