Re: Handling Missing Values in MLLIB Decision Tree

2016-03-22 Thread Joseph Bradley
It does not currently handle surrogate splits.  You will need to preprocess
your data to remove or fill in missing values.  I'd recommend using the
DataFrame API for that since it comes with a number of na methods.
Joseph

On Thu, Mar 17, 2016 at 9:51 PM, Abir Chakraborty 
wrote:

> Hello,
>
>
>
> Can MLLIB Decision Tree (DT) handle missing values by having surrogate
> split (as it is currently being done in “rpart” library in R)?
>
>
>
> Thanks,
>
> Abir
> --
>
> *Principal Data Scientist, Data Science Group, Innovation Labs*
>
> *[24]**7 **Inc. - *The Intuitive Consumer Experience Company™ *|* *We
> make life simple for consumers to connect with companies to get things done*
>
> Mobile: +91-9880755850 *|* e-mail: abi...@247-inc.com
>   Prestige Tech Platina, Kadubeesanahalli, Marathahalli Outer Ring Road
> *|* Bangalore 560087 *|* India *|* www.247-inc.com
>
>
>


Handling Missing Values in MLLIB Decision Tree

2016-03-18 Thread Abir Chakraborty
Hello,

Can MLLIB Decision Tree (DT) handle missing values by having surrogate split 
(as it is currently being done in "rpart" library in R)?

Thanks,
Abir

Principal Data Scientist, Data Science Group, Innovation Labs
[24]7 Inc. - The Intuitive Consumer Experience Company(tm) | We make life 
simple for consumers to connect with companies to get things done
Mobile: +91-9880755850 | e-mail: abi...@247-inc.com
Prestige Tech Platina, Kadubeesanahalli, Marathahalli Outer Ring Road | 
Bangalore 560087 | India | www.247-inc.com