Thank you for the detailed explanation. I think the approach with the feedback mechanism seems appropriate at this point.
> If you plan to seriously increase the number of documents in your > corpus you could also try a Rocchio classifier [1] or a k-NN > classifier. For large text documents collections it's probably more > interesting to implement them can be implemented on top of search > engine such as solr or elastic search with similarity queries. ------------------------------------------------------------------------------ LIMITED TIME SALE - Full Year of Microsoft Training For Just $49.99! 1,500+ hours of tutorials including VisualStudio 2012, Windows 8, SharePoint 2013, SQL 2012, MVC 4, more. BEST VALUE: New Multi-Library Power Pack includes Mobile, Cloud, Java, and UX Design. Lowest price ever! Ends 9/20/13. http://pubads.g.doubleclick.net/gampad/clk?id=58041151&iu=/4140/ostg.clktrk _______________________________________________ Scikit-learn-general mailing list Scikit-learn-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/scikit-learn-general