Hi, I seem that even if there is a slight change in the corpus, I have to run TfidfVectorizer on the whole corpus again. This can be time-consuming especially for large corpora.
Is there a way to generate the tf-idf matrix incrementally so that if there is a slight change in the corpus, it will just take a little time instead of a lot of time to generate the tf-idf matrix? Thanks. -- Regards, Peng _______________________________________________ scikit-learn mailing list scikit-learn@python.org https://mail.python.org/mailman/listinfo/scikit-learn