Joseph K. Bradley created SPARK-2796:
----------------------------------------
Summary: DecisionTree bug with ordered categorical features
Key: SPARK-2796
URL: https://issues.apache.org/jira/browse/SPARK-2796
Project: Spark
Issue Type: Bug
Components: MLlib
Affects Versions: 1.0.0
Reporter: Joseph K. Bradley
In DecisionTree, the method
sequentialBinSearchForOrderedCategoricalFeatureInClassification() indexed bins
from 0 to (math.pow(2, featureCategories.toInt - 1) - 1). This upper bound is
the bound for unordered categorical features, not ordered ones. The upper
bound should be the arity (i.e., max value) of the feature.
--
This message was sent by Atlassian JIRA
(v6.2#6252)