Frank McQuillan created MADLIB-952:
--------------------------------------
Summary: Type error in elastic net
Key: MADLIB-952
URL: https://issues.apache.org/jira/browse/MADLIB-952
Project: Apache MADlib
Issue Type: Bug
Components: Module: Regularized Regression
Reporter: Frank McQuillan
First check the columns which are text or character using the following code:
select
column_name, data_type
from
information_schema.columns
where table_schema='analytics'
and table_name='discharge_train'
and data_type in ('character varying','text')
It gave me the following result.
"cohort_assignment";"text"
"providergender";"character varying"
"locationname";"character varying"
"patientkey";"character varying"
Then this is the code I used for elastic net.
DROP TABLE IF EXISTS analytics.elastic_net_logistic;
SELECT madlib.elastic_net_train( 'analytics.discharge_train',
'analytics.elastic_net_train_result',
'readmission < 0',
'*',
'binomial',
0.6,
0.02,
TRUE,
NULL,
'fista',
'',
'totalemergencyvisits_in_current_visit ,
totalgimvisits_in_current_visit ,
totalcardiologyvisits_in_current_visit ,
total_time_in_medical_short_inpatient ,
totalacupulmonaryvisits_in_current_visit ,
current_complain_ ,
total_current_dea_ciii,
curr_rx_gen_lactated_ringers ,patientkey ,
totalobservationvisits_in_current_visit,
total_unitsvisited_before_admission ,
totalpcuneurovisits_in_current_visit ,
totalicuvisits_in_current_visit ,curr_rx_gen_oxycodone_w_ ,
total_time_in_medical_short_in_er ,
totalmfmvisits_in_current_visit ,
totalmedsurgvisits_in_current_visit ,
hospitalaccountrecord, inpatientadmitdatetime ,totalongvisits_in_current_visit,
no_null_days_btw_admissions,cohort_assignment,locationname,providergender,patientkey',
10000,
1e-6
);
If you check in exlcuded list of variables, at the very last I have excluded
the character or text variables.
It still gives me this error.
Elastic Net error: All columns to be included in the
independent variables should be of the same numeric
type.
Context: Traceback (most recent call last):
PL/Python function "elastic_net_train", line 23, in <module>
return elastic_net.elastic_net_train(**globals())
PL/Python function "elastic_net_train", line 308, in elastic_net_train
PL/Python function "elastic_net_train", line 427, in analyze_input_str
PL/Python function "elastic_net_train"
Please note I have also tried by dropping all the character or text variables
and then passing it to elastic net code. And it still gives me the same error.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)