Joseph K. Bradley created SPARK-14322: -----------------------------------------
Summary: Use treeReduce instead of reduce in OnlineLDAOptimizer Key: SPARK-14322 URL: https://issues.apache.org/jira/browse/SPARK-14322 Project: Spark Issue Type: Improvement Components: ML, MLlib Reporter: Joseph K. Bradley OnlineLDAOptimizer uses {{RDD.reduce}} in two places where it could use treeReduce. This can cause scalability issues. This should be an easy fix. See this line: [https://github.com/apache/spark/blob/f12f11e578169b47e3f8b18b299948c0670ba585/mllib/src/main/scala/org/apache/spark/mllib/clustering/LDAOptimizer.scala#L452] and a few lines below it. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org