Hi Zennet,

> 
> The URLFilter plugin is already working from previous development but
> my changes to the code don't take effect.
> 
> Here are the steps I've taken:
> 1. Modified the existing implementation of URLFilter interface

Okay.

> 2. Built the project with ant

Good.

> 3. Copied build/plugin/* to NUTCH_HOME/plugins

You don't need to do this if you're running the crawl tool. The crawl tool
will by default load plugins out of $NUTCH_HOME/build/plugins

> 4. Ran the generate-fetch-index cycle

Okay
> 
> I modified filter() to write some debug statements to a file and
> return null for every url (for debugging purposes). I know my code was
> not executed because no urls should have been indexed and there were
> debug statements in the file. I suspect that step 3 is what I am doing
> incorrectly or there is some other file I need to modify.

Did you enable the plugin in the nutch-default.xml file within the conf
directory? Make sure that you enable the plugin there. Can you post a txt
capture of your crawl log?

Thanks,
 Chris


> 
> I appreciate any help.
> 
> Thanks,
> zennet

Reply via email to