[ 
https://issues.apache.org/jira/browse/MAHOUT-348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sean Owen resolved MAHOUT-348.
------------------------------

      Assignee: Sean Owen
    Resolution: Duplicate

I agree with this though would consider it something subsumed by MAHOUT-167, 
MAHOUT-294 as they are about using "AbstractJob" which implements Tool.

> Trainer jobs should implement Hadoop's Tool
> -------------------------------------------
>
>                 Key: MAHOUT-348
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-348
>             Project: Mahout
>          Issue Type: Improvement
>          Components: Classification
>    Affects Versions: 0.3
>            Reporter: Ferdy
>            Assignee: Sean Owen
>
> It would be nice if the Trainer jobs (and Mahout jobs in general, those not 
> already doing so) would implement Tool. From the Hadoop's javadocs:
> "Tool, is the standard for any Map-Reduce tool/application. The 
> tool/application should delegate the handling of standard command-line 
> options to ToolRunner.run(Tool, String[]) and only handle its custom 
> arguments."
> The problem we are running into currently is the fact that as of Mahout 0.3 
> there is no way to submit a CBayesDriver job with custom Configuration. 
> Therefore it is not possible to set the classpath right for it's Mappers and 
> Reducers, if one is to run the CBayesDriver with the generic "-libjars" 
> option. Of course, this particular problem could be solved by just putting 
> the required jars in the Hadoop lib dir, however this not always possible. 
> For a custom Hadoop deployment (shared among many users and different types 
> of jobs), every job should be able to specify it's own library dependencies.
> Note: I'm currently aware of issue MAHOUT-167, which has limited overlap with 
> this issue: MAHOUT-167 states that the new API should be used (particulary 
> for Clustering jobs). This issue addresses the needs for implementing a 
> Hadoop Job interface at all, preferably Tool.
> Also, there's issue MAHOUT-294, an effort to track all changes surrounding 
> the Job API.
> Let me hear your thoughts, and I'll whip up a patch when needed.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to