[ https://issues.apache.org/jira/browse/SPARK-3278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14355119#comment-14355119 ]
Vladimir Vladimirov commented on SPARK-3278: -------------------------------------------- Martin. This would be really nice. Is it possible to run isotonic regression on the data I'll provide? (~150-200M records). The answers I'm looking for - how long it would take to train the model on this data set, how much resources it would take on a cluster and confirm that it won't blow spark. I'll export values in format <float1>, <float2> per line - similar to how it is described in the doc http://people.apache.org/~pwendell/spark-1.3.0-rc1-docs/mllib-isotonic-regression.html ? Where float1 - is between 0 and 1. And float2 - is 0 or 1 > Isotonic regression > ------------------- > > Key: SPARK-3278 > URL: https://issues.apache.org/jira/browse/SPARK-3278 > Project: Spark > Issue Type: New Feature > Components: MLlib > Reporter: Xiangrui Meng > Assignee: Martin Zapletal > Fix For: 1.3.0 > > > Add isotonic regression for score calibration. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org