[
https://issues.apache.org/jira/browse/SPARK-19216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15822094#comment-15822094
]
Nicholas Chammas commented on SPARK-19216:
------------------------------------------
cc [~josephkb] - Is this a valid gap in Python's API, or I did just
misunderstand things?
> LogisticRegressionModel is missing getThreshold()
> -------------------------------------------------
>
> Key: SPARK-19216
> URL: https://issues.apache.org/jira/browse/SPARK-19216
> Project: Spark
> Issue Type: Improvement
> Components: ML, PySpark
> Affects Versions: 2.1.0
> Reporter: Nicholas Chammas
> Priority: Minor
>
> Say I just loaded a logistic regression model from storage. How do I check
> that model's threshold in PySpark? From what I can see, the only way to do
> that is to dip into the Java object:
> {code}
> model._java_obj.getThreshold())
> {code}
> It seems like PySpark's version of {{LogisticRegressionModel}} should include
> this method.
> Another issue is that it's not clear whether the threshold is for the raw
> prediction or the probability. Maybe it's obvious to machine learning
> practitioners, but I couldn't tell from reading the docs or skimming the code
> what the threshold was for exactly.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]