[ https://issues.apache.org/jira/browse/MAHOUT-1421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13967840#comment-13967840 ]
jay vyas commented on MAHOUT-1421: ---------------------------------- Hi sebastian. I the other docs JIRAs, like MAHOUT-1441 , are sort of blocking me. for example: - Its not clear from the code what the right way to put adapters around Existing Recommenders. - There are still some remaining documentation holes in the clusterers (i.e. MAHOUT-1441). So I think its good to keep this JIRA open, but first we will need to let the docs catch up, what do you think? > Adapter package for all mahout tools > ------------------------------------ > > Key: MAHOUT-1421 > URL: https://issues.apache.org/jira/browse/MAHOUT-1421 > Project: Mahout > Issue Type: Improvement > Reporter: jay vyas > Fix For: 1.0 > > > Hi mahout. I'd like to create an umbrella JIRA for allowing more runtime > flexibility for reading different types of input formats for all mahout > tasks. > Specifically, I'd like to start with the FreeTextRecommenderAdapeter, which > typically requires: > 1) Hashing text entries into numbers > 2) Saving the large transformed file on disk > 3) Feeding it into classifieer > Instead, we could build adapters into the classifier itself, so that the user > 1) Specifies input file to recommender > 2) Specifies transformation class which converts each record of input to 3 > column recommender format > 3) Runs internal mahout recommender directly against the data > And thus the user could easily run mahout against existing data without > having to munge it to much. > This package might be called something like "org.apache.mahout.adapters", and > would over time provide flexible adapters to the core mahout algorithm > implementations, so that folks wouldnt have to worry so much about > vectors/csv transformers/etc... > Any thoughts on this? If positive feedback I can submit an initial patch to > get things started. -- This message was sent by Atlassian JIRA (v6.2#6252)