2012/10/22 Ark <[email protected]>: > >> I don't see the number of non-zeros: could you please do: >> >> >>> print vectorizer.transform([my_text_document]) >> >> as I asked previously? The run time should be linear with the number >> of non zeros. > -------------------------------------------- > > ipdb> print self.vectorizer.transform([doc]) > (0, 687) 0.0303117660218 > (0, 1145) 0.0636126446646
Hum strange, I would have expected the coo_matrix.__repr__ output that gives the number of non zeros. Maybe instead: >>> print self.vectorizer.transform([doc]).data.shape -- Olivier http://twitter.com/ogrisel - http://github.com/ogrisel ------------------------------------------------------------------------------ Everyone hates slow websites. So do we. Make your web apps faster with AppDynamics Download AppDynamics Lite for free today: http://p.sf.net/sfu/appdyn_sfd2d_oct _______________________________________________ Scikit-learn-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
