Mllib explain feature for tree ensembles

2015-10-28 Thread Eugen Cepoi
Hey,

Is there some kind of "explain" feature implemented in mllib for the
algorithms based on tree ensembles?
Some method to which you would feed in a single feature vector and it would
return/print what features contributed to the decision or how much each
feature contributed "negatively" and "positively" to the decision.

This can be very useful to debug a model on some specific samples and for
feature engineering.

Thanks,
Eugen


Re: Mllib explain feature for tree ensembles

2015-10-28 Thread Yanbo Liang
Spark ML/MLlib has provided featureImportances

to
estimate the importance of each feature.

2015-10-28 18:29 GMT+08:00 Eugen Cepoi :

> Hey,
>
> Is there some kind of "explain" feature implemented in mllib for the
> algorithms based on tree ensembles?
> Some method to which you would feed in a single feature vector and it
> would return/print what features contributed to the decision or how much
> each feature contributed "negatively" and "positively" to the decision.
>
> This can be very useful to debug a model on some specific samples and for
> feature engineering.
>
> Thanks,
> Eugen
>


Re: Mllib explain feature for tree ensembles

2015-10-28 Thread Eugen Cepoi
I guess I will have to upgrade to spark 1.5, thanks!

2015-10-28 11:50 GMT+01:00 Yanbo Liang :

> Spark ML/MLlib has provided featureImportances
> 
>  to
> estimate the importance of each feature.
>
> 2015-10-28 18:29 GMT+08:00 Eugen Cepoi :
>
>> Hey,
>>
>> Is there some kind of "explain" feature implemented in mllib for the
>> algorithms based on tree ensembles?
>> Some method to which you would feed in a single feature vector and it
>> would return/print what features contributed to the decision or how much
>> each feature contributed "negatively" and "positively" to the decision.
>>
>> This can be very useful to debug a model on some specific samples and for
>> feature engineering.
>>
>> Thanks,
>> Eugen
>>
>
>