Re: [CODE4LIB] Automatic Content Classification recommendations?

2011-11-28 Thread diego ferreyra
TemaTres Keyword Extactor is tool to automatic categorization of texts based on supplied controlled vocabularies. Is a php tool to extract terms from a text and use it to obtain keywords from a specific controlled vocabulary. Use the terminological web services provided by TemaTres. does not

Re: [CODE4LIB] Automatic Content Classification recommendations?

2011-11-28 Thread Jason Stirnaman
ConceptSearch http://www.conceptsearching.com/web/ is a commercial search engine and classification tool. Maybe similar to TemaTres, it doesn't use machine-learning but extracts concepts out of your documents that can be mapped to vocabulary terms. The vocabulary is then exposed to the end-user

[CODE4LIB] Automatic Content Classification recommendations?

2011-11-27 Thread Peter Neish
Hi there, Just wondering if anyone has any recommendations for systems that will do automatic content classification through machine learning? We want to classify newspaper articles using terms from our existing thesaurus and have a fairly big set of articles already tagged that could be used as

Re: [CODE4LIB] Automatic Content Classification recommendations?

2011-11-27 Thread Thomas Krichel
Peter Neish writes Just wondering if anyone has any recommendations for systems that will do automatic content classification through machine learning? I use LibSVM in AuthorClaim (http://authorclaim.org) and svm_light in NEP (http://nep.repec.org). I found both very helpful. I