Hey,
as far as I know feature selection using the a chi-squared statistic,
can only be done on categorical features and not on possibly continuous
ones?
Furthermore, since your logistic model doesn't use any regularization,
you should be fine here. So I'd check the ChiSqSeletor and possibly
replace it with another feature selection method.
There is however always the chance that your response does not depend on
your covariables, so you'd estimate a zero coefficient.
Cheers,
Simon
Am 24.10.17 um 04:56 schrieb Alexis Peña:
Hi Guys,
We are fitting a Logistic model using the following code.
val Chisqselector = new
ChiSqSelector().setNumTopFeatures(10).setFeaturesCol("VECTOR_1").setLabelCol("TARGET").setOutputCol("selectedFeatures")
val assembler = new VectorAssembler().setInputCols(Array("FEATURES",
"selectedFeatures", "PROM_MESES_DIST", "RECENCIA", "TEMP_MIN",
"TEMP_MAX", "PRECIPITACIONES")).setOutputCol("Union")
val lr = new
LogisticRegression().setLabelCol("TARGET").setFeaturesCol("Union")
val pipeline = new Pipeline().setStages(Array(Chisqselector,
assembler, lr))
do you know why the coeff for the following features are zero
estimate, is it produced in ChisqSelector or Logistic model?
Thanks in advance!!
CODIGO
PARAMETRO
COEFICIENTES_MUESTREO_BALANCEADO
PROPIAS
CV_UM
0,276866756
PROPIAS
CV_U3M
-0,241851427
PROPIAS
CV_U6M
-0,568312819
PROPIAS
CV_U12M
0,134706601
PROPIAS
M_UM
5,47E-06
PROPIAS
M_U3M
-7,10E-06
PROPIAS
M_U6M
1,73E-05
PROPIAS
M_U12M
-5,41E-06
PROPIAS
CP_UM
-0,050750105
PROPIAS
CP_U3M
0,125483162
PROPIAS
CP_U6M
-0,353906788
PROPIAS
CP_U12M
0,159538155
PROPIAS
TUM
-0,020217902
PROPIAS
TU3M
0,002101906
PROPIAS
TU6M
-0,005481915
PROPIAS
TU12M
0,003443081
CRUZADAS
2303
0
CRUZADAS
3901
0
CRUZADAS
3905
0
CRUZADAS
3907
0
CRUZADAS
3909
0
CRUZADAS
4102
0
CRUZADAS
4307
0
CRUZADAS
4501
0
CRUZADAS
4907
0,247624087
CRUZADAS
5304
-0,161424508
LP
PROM_MESES_DIST
-0,680356554
PROPIAS
RECENCIA
-0,00289069
EXTERNAS
TEMP_MIN
0,006488683
EXTERNAS
TEMP_MAX
-0,013497441
EXTERNAS
PRECIPITACIONES
-0,007607086
INTERCEPTO
2,401593191