You may also find VectorSlicer and SQLTransformer useful in your case. Just
out of curiosity, how would you typically handles categorical features,
except for OneHotEncoder.

Regards,
Yuhao

2016-07-01 4:00 GMT-07:00 Yanbo Liang <yblia...@gmail.com>:

> You can combine the columns which are need to be normalized into a vector
> by VectorAssembler and do normalization on it.
> Do another assembling for columns should not be normalized. At last, you
> can assemble the two vector into one vector as the feature column and feed
> it into model training.
>
> Thanks
> Yanbo
>
> 2016-06-25 21:16 GMT-07:00 段石石 <burness1...@gmail.com>:
>
>> Hi all:
>>
>>
>>     I use the MinMaxScaler for data normalization, but I found the the
>> api is only for Vector, we must vectorized the features firtst. However,
>> the feature usually include two parts: one is need to be Normalization,
>> another should not be normalized such as categorical. I want to add a api
>> with the DataFrame which aim to normalize the columns which we want to
>> normalize. And then we can make it to be vector and sent to the ML model
>> api to train. I think that will be very useful for the developer with
>> machine learning.
>>
>>
>>
>> Best Regards
>>
>> Thanks
>>
>
>

Reply via email to