[ http://issues.apache.org/jira/browse/NUTCH-209?page=all ]
Sami Siren closed NUTCH-209. ---------------------------- > include nutch jar in mapred jobs > -------------------------------- > > Key: NUTCH-209 > URL: http://issues.apache.org/jira/browse/NUTCH-209 > Project: Nutch > Issue Type: Improvement > Affects Versions: 0.8 > Reporter: Doug Cutting > Priority: Minor > Fix For: 0.8 > > > I just added a simple way in Hadoop to specify the job jar file. When > constructing a JobConf one can specify a class whose containing jar is set to > be the job's jar. To take advantage of this in Nutch, we could add a util > class: > public class NutchJob extends JobConf { > public NutchJob(Configuration conf) { > super(conf, NutchJob.class); > } > } > Then change all of the places where we construct a JobConf to instead > construct a NutchJob. > Finally, we should add an ant target called 'job' that constructs a job jar, > containing all of the classes and the plugins, and make this the default > target. This way all Nutch code can be distributed with each job as it is > submitted, and daemons would only need to be restarted when Hadoop code is > updated. > Does this sound reasonable? -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa - For more information on JIRA, see: http://www.atlassian.com/software/jira
