2014-09-09 3:36 GMT+02:00 Apu Mishra :
> Lars Buitinck writes:
>
>> The way to combine HV and
>> Tfidf is
>>
>> hashing = HashingVectorizer(non_negative=True, norm=None)
>> tfidf = TfidfTransformer()
>> hashing_tfidf = Pipeline([("hashing", hashing), ("tidf", tfidf)])
>>
>
> I notice your use of t
Lars Buitinck writes:
> The way to combine HV and
> Tfidf is
>
> hashing = HashingVectorizer(non_negative=True, norm=None)
> tfidf = TfidfTransformer()
> hashing_tfidf = Pipeline([("hashing", hashing), ("tidf", tfidf)])
>
I notice your use of the non_negative option in HashingVectorizer(), whe
2013/10/2 Minkoo :
> I have a question on using HashingVectorizer with TFxIDF. Currently, I'm
> trying to build a model to predict classes for large set of documents.
>
> On the other hand TfIdfVectorizer does not support processing documents in
> batch. It needs to load the entire feature vector i
You can use a Pipeline to combine a TfidfTransformer with the HashingVectorizer.
--
Olivier
--
October Webinars: Code for Performance
Free Intel webinars can help you accelerate application performance.
Explore tips for
Hi scikit learn.
I have a question on using HashingVectorizer with TFxIDF. Currently, I'm
trying to build a model to predict classes for large set of documents.
I'm using HashingVectorizer as my data is large. Because I can feed some
batch of documents to HashingVectorizer, it's perfect fit for m