Frank McQuillan created MADLIB-994:
--------------------------------------

             Summary: RF - improve docs for describing memory usagge
                 Key: MADLIB-994
                 URL: https://issues.apache.org/jira/browse/MADLIB-994
             Project: Apache MADlib
          Issue Type: Documentation
          Components: Module: Random Forest
            Reporter: Frank McQuillan


Some users are hitting VMEM limits, so docs need to give more guidance on what 
params affect memory.

e.g., 
Not that many rows, but a large number of features (500 - 800).


ERROR:  plpy.SPIError: Out of memory  (seg46 slice4 awsaiuirl1179:40006 
pid=449659) (plpython.c:4648)
DETAIL:  VM Protect failed to allocate 1028374648 bytes, 690 MB available
CONTEXT:  Traceback (most recent call last):
  PL/Python function "forest_train", line 39, in <module>
    sample_ratio
  PL/Python function "forest_train", line 565, in forest_train
  PL/Python function "forest_train", line 2248, in _tree_train_grps_using_bins
  PL/Python function "forest_train", line 1324, in _one_step_for_grps
PL/Python function "forest_train"
 
********** Error **********
 
ERROR: plpy.SPIError: Out of memory  (seg46 slice4 awsaiuirl1179:40006 
pid=449659) (plpython.c:4648)
SQL state: XX000
Detail: VM Protect failed to allocate 1028374648 bytes, 690 MB available
Context: Traceback (most recent call last):
  PL/Python function "forest_train", line 39, in <module>
    sample_ratio
  PL/Python function "forest_train", line 565, in forest_train
  PL/Python function "forest_train", line 2248, in _tree_train_grps_using_bins
  PL/Python function "forest_train", line 1324, in _one_step_for_grps
PL/Python function "forest_train"



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to