Hey!I am currently using <http://scikit-learn.org/stable/modules/generated/sklearn.feature_extraction.text.Vectorizer.html>sklearn.feature_extraction.text.Vectorizer for feature extraction of text documents I have.
I am now curious and don't quite understand how the TFIDF calculation is done. Is it done seperately for each document or based on all documents. It can't be done for each class of documents, because information about the labels is not available.
Hope you can give me some explanations regarding this. Thanks! Philipp
------------------------------------------------------------------------------ This SF email is sponsosred by: Try Windows Azure free for 90 days Click Here http://p.sf.net/sfu/sfd2d-msazure
_______________________________________________ Scikit-learn-general mailing list Scikit-learn-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/scikit-learn-general