Re: RandomForest evaluator for grid search

2015-07-13 Thread Feynman Liang
Joseph may be already working on it; I would continue the discussion on JIRA and first ask if anyone is working on it before starting your own PR. On Mon, Jul 13, 2015 at 1:47 PM, Olivier Girardot < o.girar...@lateral-thoughts.com> wrote: > Thanks ! that's great ! > any way to help on that ? > >

Re: RandomForest evaluator for grid search

2015-07-13 Thread Feynman Liang
That is currently tracked by SPARK-3727 . On Mon, Jul 13, 2015 at 1:16 PM, Olivier Girardot < o.girar...@lateral-thoughts.com> wrote: > thx for the info. > > I'd be interested in getting the full predict_proba like in scikit learn ( > http://sciki

Re: RandomForest evaluator for grid search

2015-07-13 Thread Olivier Girardot
thx for the info. I'd be interested in getting the full predict_proba like in scikit learn ( http://scikit-learn.org/stable/modules/generated/sklearn.ensemble.RandomForestClassifier.html#sklearn.ensemble.RandomForestClassifier.predict_proba) for the random forest model. There doesn't seem to be a

Re: RandomForest evaluator for grid search

2015-07-13 Thread Feynman Liang
There is MulticlassMetrics in MLlib; unfortunately a pipelined version hasn't yet been made for spark-ml. SPARK-7690 is tracking work on this if you are interested in following the development. On Mon, Jul 13, 2015 at 2:16 AM, Olivier Girardot < o

RandomForest evaluator for grid search

2015-07-13 Thread Olivier Girardot
Hi everyone, Using spark-ml there seems to be only BinaryClassificationEvaluator and RegressionEvaluator, is there any way or plan to provide a ROC-based or PR-based or F-Measure based for multi-class, I would be interested especially in evaluating and doing a grid search for a RandomForest model.