Re: [Scikit-learn-general] Unlabelled and mislabelled data

Ark Tue, 17 Sep 2013 15:38:38 -0700

Thank you for the detailed explanation. I think the approach with
the feedback mechanism seems appropriate at this point.


> If you plan to seriously increase the number of documents in your
> corpus you could also try a Rocchio classifier [1] or a k-NN
> classifier. For large text documents collections it's probably more
> interesting to implement them can be implemented on top of search
> engine such as solr or elastic search with similarity queries.





------------------------------------------------------------------------------
LIMITED TIME SALE - Full Year of Microsoft Training For Just $49.99!
1,500+ hours of tutorials including VisualStudio 2012, Windows 8, SharePoint
2013, SQL 2012, MVC 4, more. BEST VALUE: New Multi-Library Power Pack includes
Mobile, Cloud, Java, and UX Design. Lowest price ever! Ends 9/20/13. 
http://pubads.g.doubleclick.net/gampad/clk?id=58041151&iu=/4140/ostg.clktrk
_______________________________________________
Scikit-learn-general mailing list
Scikit-learn-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Re: [Scikit-learn-general] Unlabelled and mislabelled data

Reply via email to