[ http://issues.apache.org/jira/browse/NUTCH-209?page=all ]
     
Doug Cutting resolved NUTCH-209:
--------------------------------

    Resolution: Fixed

I just committed this.

Michael, the 'bin/hadoop jar' command is not (yet) used by Nutch.  Please file 
a Hadoop bug to add the feature you're asking for.

> include nutch jar in mapred jobs
> --------------------------------
>
>          Key: NUTCH-209
>          URL: http://issues.apache.org/jira/browse/NUTCH-209
>      Project: Nutch
>         Type: Improvement
>     Versions: 0.8-dev
>     Reporter: Doug Cutting
>     Priority: Minor
>      Fix For: 0.8-dev

>
> I just added a simple way in Hadoop to specify the job jar file.  When 
> constructing a JobConf one can specify a class whose containing jar is set to 
> be the job's jar.  To take advantage of this in Nutch, we could add a util 
> class:
> public class NutchJob extends JobConf {
>   public NutchJob(Configuration conf) {
>     super(conf, NutchJob.class);
>   }
> }
> Then change all of the places where we construct a JobConf to instead 
> construct a NutchJob.
> Finally, we should add an ant target called 'job' that constructs a job jar, 
> containing all of the classes and the plugins, and make this the default 
> target.  This way all Nutch code can be distributed with each job as it is 
> submitted, and daemons would only need to be restarted when Hadoop code is 
> updated.
> Does this sound reasonable?

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira

Reply via email to