Rashmi Raghu created MADLIB-975:
-----------------------------------
Summary: Random Forest training error message
Key: MADLIB-975
URL: https://issues.apache.org/jira/browse/MADLIB-975
Project: Apache MADlib
Issue Type: Bug
Reporter: Rashmi Raghu
Error message during RF training not interpretable. See query example below.
Sample data for query sent offline.
SELECT madlib.forest_train(
'dev.training_data_take_1', --training_table_name
'dev.models_random_forest', -- output_table_name,
'id', -- id_col_name,
'event', -- dependent_variable,
'*', -- list_of_features,
'id,regionname,wellid,day', -- list_of_features_to_exclude,
NULL, -- grouping_cols,
100 -- num_trees,
-- num_random_features,
-- importance,
-- num_permutations,
-- max_tree_depth,
-- min_split,
-- min_bucket,
-- num_splits,
-- surrogate_params,
-- verbose,
-- sample_ratio
);
ERROR: AttributeError: 'NoneType' object has no attribute 'sort'
(plpython.c:4648)
CONTEXT: Traceback (most recent call last):
PL/Python function "forest_train", line 42, in <module>
sample_ratio
PL/Python function "forest_train", line 337, in forest_train
PL/Python function "forest_train"
********** Error **********
ERROR: AttributeError: 'NoneType' object has no attribute 'sort'
(plpython.c:4648)
SQL state: XX000
Context: Traceback (most recent call last):
PL/Python function "forest_train", line 42, in <module>
sample_ratio
PL/Python function "forest_train", line 337, in forest_train
PL/Python function "forest_train"
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)