Hi Felix.
You should be fine the way you do it.
You might want to rescale the continuous values though, possibly to lie 
within 0 to 1, using MinMaxScaler.

Cheers,
Andy

On 06/18/2015 05:05 AM, Felix Dreher wrote:
> Dear all,
>
> I have a general question about running logistic regression with a
> combination of numeric and binary predictors (independent variables).
> I'm sorry if it was already asked before, but I couldn't find an answer
> in the mail archives.
>
> The data set I'm working with looks as follows:
> The numeric predictors are gene expression values (continuous values
> ranging from 0 to 20,000) and the binary predictors are gene mutation
> values (0: gene is not mutated, 1: gene is mutated). The dependent
> variable is drug response (here: cell line is sensitive or resistant to
> a drug).
>
> Currently I'm ignoring the heterogeneity of the predictors and run the
> function as follows:
>
> lr = LogisticRegression(penalty='l1', C=1).fit(train_xs, train_ys)
>
> I was wondering if I need to adjust the model in order to reflect the
> combination of numeric and binary predictors?
>
> Any hints are highly appreciated...
>
> Best regards,
> Felix
>
>
>
>
> ------------------------------------------------------------------------------
> _______________________________________________
> Scikit-learn-general mailing list
> Scikit-learn-general@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/scikit-learn-general


------------------------------------------------------------------------------
_______________________________________________
Scikit-learn-general mailing list
Scikit-learn-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to