[
https://issues.apache.org/jira/browse/MAHOUT-294?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12888439#action_12888439
]
Jeff Eastman commented on MAHOUT-294:
-------------------------------------
The bug reported above has been fixed (topicSmoothing was doing the right thing
in the driver) and I've been looking at reparenting the clustering drivers to
AbstractJob. The primary benefits I can see are the consolidation of input and
output command options and the parseArguments() method which is useful for most
- but not all - map/reduce job configuration steps. The class requires an
instance run(String[]) method (Tool requires it actually) and this will require
some refactoring of the clustering drivers since they all have a static
runJob() method that accepts Java arguments and is called by main(). Also,
using addOption() will require backing out recent changes in
DefaultOptionsCreator or at least refactoring that too.
It is interesting that the Tool interface comment uses both Configuration
(hadoop 0.20.2) and JobConf (deprecated).
Anyway, I'm not sure this warrants all the effort. I'm game but need a little
more encouragement/benefits.
> Uniform API behavior for Jobs
> -----------------------------
>
> Key: MAHOUT-294
> URL: https://issues.apache.org/jira/browse/MAHOUT-294
> Project: Mahout
> Issue Type: Improvement
> Components: Classification, Clustering, Collaborative Filtering,
> Frequent Itemset/Association Rule Mining, Genetic Algorithms, Math, Utils
> Affects Versions: 0.4
> Reporter: Robin Anil
> Fix For: 0.4
>
>
> * Move AbstractJob to common and convert all the Driver classes to extend
> that.
> One suggestion is:
> AlgorithmParams params = ParamsBuilder.build().withParam("-i",
> input).withParam("-o", output)....
> MyAlgorithmn.runJob(params) throws ParameterMissingException;
> * Give uniform command-line parameters for various algorithms.
> e.g Currently distance measure is -d, -dm, -m at different places in
> clustering
> * Add a temp directory as a parameter
> http://www.lucidimagination.com/search/document/28a979aa62c02a1/who_owns_mahout_bucket_on_s3#ddb5855e8bdace45
> This issue will keep track of all discussion/patches related to the design
> and cleanup of Mahout API
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.