Suppose I used tfidfvectorizer to create features, trained a classifier,
did cross-validation, etc.. Let's say I am happy with the result and I want
to use my classifier with new data. When I am converting my new (unlabeled)
data to a feature vector, don't I need the IDF from the original tfidf
vectorizer to calculate the tfidf of the words in my new (unlabeled) data
point? If so, is there an easy way to do this?

Thanks
------------------------------------------------------------------------------
Precog is a next-generation analytics platform capable of advanced
analytics on semi-structured data. The platform includes APIs for building
apps and a phenomenal toolset for data science. Developers can use
our toolset for easy data analysis & visualization. Get a free account!
http://www2.precog.com/precogplatform/slashdotnewsletter
_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to