Re: [scikit-learn] Naive Bayes - Multinomial Naive Bayes tf-idf

Andy Fri, 04 Nov 2016 07:45:32 -0700


On 11/04/2016 05:45 AM, Marcin Mirończuk wrote:

Hi,
In our experiments, we use a Multinomial Naive Bayes (MNB). Thetraditional MNB implies the TF weight of the words. We read indocumentation http://scikit-learn.org/stable/modules/naive_bayes.htmlwhich describes Multinomial Naive Bayes that "... where the data aretypically represented as word vector counts, although tf-idf vectorsare also known to work well in practice". The "word vector counts" isa TF and it is well known. We have a problem which the "tf-idfvectors". In this case, i.e. tf-idf it was implemented the approachof the D. M. Rennie et all Tackling the Poor Assumptions of NaiveBayes Text Classification? In the documentation, there are not anycitation of this solution.

No, I think that paper implements something slightly different. Thedocumentation says that you can apply the TfidfVectorizer instead ofCountVectorizer and it can still work.

_______________________________________________
scikit-learn mailing list
[email protected]
https://mail.python.org/mailman/listinfo/scikit-learn

Re: [scikit-learn] Naive Bayes - Multinomial Naive Bayes tf-idf

Reply via email to