Re: Handling Missing Values in MLLIB Decision Tree

2016-03-22 Thread Joseph Bradley
It does not currently handle surrogate splits. You will need to preprocess your data to remove or fill in missing values. I'd recommend using the DataFrame API for that since it comes with a number of na methods. Joseph On Thu, Mar 17, 2016 at 9:51 PM, Abir Chakraborty

Handling Missing Values in MLLIB Decision Tree

2016-03-18 Thread Abir Chakraborty
Hello, Can MLLIB Decision Tree (DT) handle missing values by having surrogate split (as it is currently being done in "rpart" library in R)? Thanks, Abir Principal Data Scientist, Data Science Group, Innovation Labs [24]7 Inc. - The Intuitive Consumer Experience