Makoto Yui created HIVEMALL-245:
-----------------------------------

             Summary: Refactor RandomForest for Sparse Data handling
                 Key: HIVEMALL-245
                 URL: https://issues.apache.org/jira/browse/HIVEMALL-245
             Project: Hivemall
          Issue Type: Improvement
    Affects Versions: 0.5.2
            Reporter: Makoto Yui
            Assignee: Makoto Yui
             Fix For: 0.6.0


* Fix attribute to use RoaringBitmap instead of AttributeType[]
 * Support pruning of redundant decision tree nodes
 * Support the default value for missing values
 * Fix split handling of sparse numeric values
 ** problem: split does not occur when column values have a single value
 *** if(x<=1.0) \{ .. } else \{ ... } never split where sparse x is already 1.0
 ** if there are only a single value for a column, then treat it as nominal 
value (?)



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to