I have a database of documents. In other words, each tuple contains a document that needs to be classified. Does Mahout API provide such capability that I connect to DB, get the document, classify and write the label back to database?
I am aware I can connect to DB separately, loop through tuples, convert each tuple to a document, then use Mahout API to classify, and write back to the database, at the end. Is this the way to go? To be more specific, does BayesFileFormatter in Mahout API come with readerToDatabase method? or is there a way to use readerToDocument method along with a database tuple instead of Files.newReader()? What is the best practice to connect and read/write from/to DB from Mahout classifier? -- View this message in context: http://lucene.472066.n3.nabble.com/Classifying-documents-in-database-tp3505846p3505846.html Sent from the Mahout User List mailing list archive at Nabble.com.
