Saurabh Agrawal created SPARK-21476: ---------------------------------------
Summary: RandomForest classification model not using broadcast in transform Key: SPARK-21476 URL: https://issues.apache.org/jira/browse/SPARK-21476 Project: Spark Issue Type: Bug Components: ML Affects Versions: 2.2.0 Reporter: Saurabh Agrawal I notice significant task deserialization latency while running prediction with pipelines using RandomForestClassificationModel. While digging into the source, found that the transform method in RandomForestClassificationModel binds to its parent ProbabilisticClassificationModel and the only concrete definition that RandomForestClassificationModel provides and which is actually used in transform is that of predictRaw. Broadcasting is not being used in predictRaw. -- This message was sent by Atlassian JIRA (v6.4.14#64029) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org