[EMAIL PROTECTED] wrote:
I have not been able to solve the mystery of Generator.java modifications taking effect only on the master from which "generate" is run, and not taking effect on any of the slaves, despite slaves having the same Generator.class, the same nutch jar and the same nutch .job file as the ones on the master.
I _highly_ recommend the use of a "clean" hadoop cluster (i.e. installed from a Hadoop binary release) and running nutch jobs through bin/hadoop jar nutch*.job org.apache.nutch....
The default Nutch setup, where classes and resources exist in multiple copies on the classpath, is really suitable only for development and not for a regular operation. I mean, you _can_ run Nutch using the default setup but you need to be very very careful to synchronize all resources across the cluster, and in most cases you need to restart the cluster on every update.
This is not the case if you run Nutch job file on a clean Hadoop cluster. -- Best regards, Andrzej Bialecki <>< ___. ___ ___ ___ _ _ __________________________________ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com
