Hi,

I seem that even if there is a slight change in the corpus, I have to
run TfidfVectorizer on the whole corpus again. This can be
time-consuming especially for large corpora.

Is there a way to generate the tf-idf matrix incrementally so that if
there is a slight change in the corpus, it will just take a little
time instead of a lot of time to generate the tf-idf matrix? Thanks.

-- 
Regards,
Peng
_______________________________________________
scikit-learn mailing list
scikit-learn@python.org
https://mail.python.org/mailman/listinfo/scikit-learn

Reply via email to