Makoto Yui created HIVEMALL-245:
-----------------------------------
Summary: Refactor RandomForest for Sparse Data handling
Key: HIVEMALL-245
URL: https://issues.apache.org/jira/browse/HIVEMALL-245
Project: Hivemall
Issue Type: Improvement
Affects Versions: 0.5.2
Reporter: Makoto Yui
Assignee: Makoto Yui
Fix For: 0.6.0
* Fix attribute to use RoaringBitmap instead of AttributeType[]
* Support pruning of redundant decision tree nodes
* Support the default value for missing values
* Fix split handling of sparse numeric values
** problem: split does not occur when column values have a single value
*** if(x<=1.0) \{ .. } else \{ ... } never split where sparse x is already 1.0
** if there are only a single value for a column, then treat it as nominal
value (?)
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)