Xiangrui Meng created SPARK-1655:
------------------------------------
Summary: In naive Bayes, store conditional probabilities
distributively.
Key: SPARK-1655
URL: https://issues.apache.org/jira/browse/SPARK-1655
Project: Spark
Issue Type: Improvement
Components: MLlib
Reporter: Xiangrui Meng
In the current implementation, we collect all conditional probabilities to the
driver node. When there are many labels and many features, this puts heavy load
on the driver. For scalability, we should provide a way to store conditional
probabilities distributively.
--
This message was sent by Atlassian JIRA
(v6.2#6252)