caoli created SPARK-2160:
----------------------------
Summary: error of Decision tree algorithm in Spark MLlib
Key: SPARK-2160
URL: https://issues.apache.org/jira/browse/SPARK-2160
Project: Spark
Issue Type: Bug
Components: MLlib
Affects Versions: 1.0.0
Reporter: caoli
Fix For: 1.1.0
the error of comput rightNodeAgg about Decision tree algorithm in Spark MLlib
, in the function extractLeftRightNodeAggregates() ,when compute rightNodeAgg
used bindata index is error. in the DecisionTree.scala file about Line980:
rightNodeAgg(featureIndex)(2 * (numBins - 2 - splitIndex)) =
binData(shift + (2 * (numBins - 2 - splitIndex))) +
rightNodeAgg(featureIndex)(2 * (numBins - 1 - splitIndex))
the binData(shift + (2 * (numBins - 2 - splitIndex))) index compute is
error, so the result of rightNodeAgg include repeated data about "bins"
--
This message was sent by Atlassian JIRA
(v6.2#6252)