2012/9/7 Lars Buitinck <[email protected]>: > 2012/9/7 Mathieu Blondel <[email protected]>: >> On my box, RidgeClassifier finishes in 136 seconds but kNN dies with >> MemoryError. > > That's incredibly slow compared to all the other classifiers. Also > without --all_categories, it's slower than everything else. > > I've already found out where things go wrong, though I still have no > idea why this didn't occur before.
Maybe the default feature extraction has changed and made the matrix much denser that it used to be for this example? Although recent changes to the vectorizer would tend to decrease the number of features (min_df=2) hence make the problem smaller to solve. -- Olivier http://twitter.com/ogrisel - http://github.com/ogrisel ------------------------------------------------------------------------------ Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ _______________________________________________ Scikit-learn-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
