2012/10/22 Ark <[email protected]>:
>
>> I don't see the number of non-zeros: could you please do:
>>
>> >>> print vectorizer.transform([my_text_document])
>>
>> as I asked previously? The run time should be linear with the number
>> of non zeros.
> --------------------------------------------
>
> ipdb> print self.vectorizer.transform([doc])
>   (0, 687)      0.0303117660218
>   (0, 1145)     0.0636126446646

Hum strange, I would have expected the coo_matrix.__repr__ output that
gives the number of non zeros.

Maybe instead:

>>> print self.vectorizer.transform([doc]).data.shape

-- 
Olivier
http://twitter.com/ogrisel - http://github.com/ogrisel

------------------------------------------------------------------------------
Everyone hates slow websites. So do we.
Make your web apps faster with AppDynamics
Download AppDynamics Lite for free today:
http://p.sf.net/sfu/appdyn_sfd2d_oct
_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to