Github user mengxr commented on the pull request:
https://github.com/apache/spark/pull/372#issuecomment-40333887
@srowen I think we will have a generic `PredictiveModel[INPUT,OUTPUT]` base
class for both `RegressionModel` and `ClassificationModel`. I agree that we
should have classification models returning a distribution over labels. Thanks
for bringing this up! In the current logistic regression and SVM
implementation, the default threshold is set at 0.5 and 0.0 respectively to
return exact labels instead of scores, which I don't really like.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---