Hi,

Could you please send the link for the example you are talking about? 
minPartitions and numFeatures do not exist in the current API 
for NaiveBayes as far as I know. So, I don't know how to answer your second 
question.

Regarding your first question, guessing blindly, it should be related to 
numPartitions, which is the number of partitions your dataset consists of.
It is usually best to set this number to the number of cores your machine has. 
You can also try double the number of cores or half.

Best,
Burak

----- Original Message -----
From: "SK" <skrishna...@gmail.com>
To: u...@spark.incubator.apache.org
Sent: Wednesday, August 6, 2014 3:45:09 PM
Subject: Naive Bayes parameters


1) How is the minPartitions parameter in NaiveBayes example used? What is
the default value?

2) Why is the  numFeatures specified as a parameter? Can this not be
obtained from the data? This parameter is not specified for the other MLlib
algorithms.  

thanks



--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/Naive-Bayes-parameters-tp11592.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org

Reply via email to