GitHub user imatiach-msft opened a pull request: https://github.com/apache/spark/pull/17086
[SPARK-18693][ML][MLLIB] ML Evaluators should use weight column - added weight column for multiclass classification evaluator ## What changes were proposed in this pull request? The evaluators BinaryClassificationEvaluator, RegressionEvaluator, and MulticlassClassificationEvaluator and the corresponding metrics classes BinaryClassificationMetrics, RegressionMetrics and MulticlassMetrics should use sample weight data. I've closed the PR: https://github.com/apache/spark/pull/16557 as recommended in favor of creating three pull requests, one for each of the evaluators (binary/regression/multiclass) to make it easier to review/update. ## How was this patch tested? I added tests to the metrics class. You can merge this pull request into a Git repository by running: $ git pull https://github.com/imatiach-msft/spark ilmat/multiclass-evaluate Alternatively you can review and apply these changes as the patch at: https://github.com/apache/spark/pull/17086.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #17086 ---- commit cf6a5aba61716dcb11ef3ca7b1f3b803bf99ef33 Author: Ilya Matiach <il...@microsoft.com> Date: 2017-02-27T18:28:08Z Added weight column for multiclass classification evaluator ---- --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org