Hi Peng, I believe the set of English stop words used across all token vectorizers can be found in https://github.com/scikit-learn/scikit-learn/blob/master/sklearn/feature_extraction/_stop_words.py.
Cheers, Jon On Mon, Jan 27, 2020 at 3:33 PM Peng Yu <pengyu...@gmail.com> wrote: > Hi, > > I don't see what stopwords are used by CountVectorizer with > stop_wordsstring = ‘english’. > > > https://scikit-learn.org/stable/modules/generated/sklearn.feature_extraction.text.CountVectorizer.html > > Is there a way to figure it out? Thanks. > > -- > Regards, > Peng > _______________________________________________ > scikit-learn mailing list > scikit-learn@python.org > https://mail.python.org/mailman/listinfo/scikit-learn >
_______________________________________________ scikit-learn mailing list scikit-learn@python.org https://mail.python.org/mailman/listinfo/scikit-learn