GitHub user BenFradet opened a pull request:
https://github.com/apache/spark/pull/10355
[SPARK-7425] [ML] [WIP] spark.ml Predictor should support other numeric
types for label
Currently, the Predictor abstraction expects the input labelCol type to be
DoubleType, but we should support other numeric types. This will involve
updating the PredictorParams.validateAndTransformSchema method.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/BenFradet/spark SPARK-7425
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/10355.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #10355
----
commit 2b276a29cc6183e6b42f828631c23e74bcd4144f
Author: BenFradet <[email protected]>
Date: 2015-12-13T15:41:57Z
check label data type for numeric type instead of double
commit 7ef4ad4fce33f48e744fd0c9dbf30549eee76097
Author: BenFradet <[email protected]>
Date: 2015-12-13T15:47:42Z
added some cases to extractLabeledPoints, looking for a better way to
handle this
commit e791ff6390284d6b0baae869de52f37bc0e38862
Author: BenFradet <[email protected]>
Date: 2015-12-13T16:40:30Z
Added a method to set the metadata on a dataframe
commit 83ffecba566f6a085fffa7dcf3194fa9f64edfc3
Author: BenFradet <[email protected]>
Date: 2015-12-13T16:41:43Z
unit tests for the decision tree classifier
commit 97dde27306984e3d78ca05d1cc99e47b18e48a8f
Author: BenFradet <[email protected]>
Date: 2015-12-17T08:33:56Z
used the sqlcontext provided with MLlibTestSparkContext
commit c68ace1d7afe815ed252bacd0cebc576ef6e06b0
Author: BenFradet <[email protected]>
Date: 2015-12-17T09:06:46Z
simpler version of extractLabeledPoints
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]