Ark <ark_antos@...> writes: > > > > How large (in bytes and in which format)? What are n_samples, > > n_features and n_classes? > > > > Input data is in the form of paragraphs from English literature
So, raw data -> Countvectorizer -> test, train set -> sgd.fit -> predict is the flow. > n_samples=10000, n_features=100,000, n_classes=max 100[still collecting data] ------------------------------------------------------------------------------ Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ _______________________________________________ Scikit-learn-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
