Hi Zennet, > > The URLFilter plugin is already working from previous development but > my changes to the code don't take effect. > > Here are the steps I've taken: > 1. Modified the existing implementation of URLFilter interface
Okay. > 2. Built the project with ant Good. > 3. Copied build/plugin/* to NUTCH_HOME/plugins You don't need to do this if you're running the crawl tool. The crawl tool will by default load plugins out of $NUTCH_HOME/build/plugins > 4. Ran the generate-fetch-index cycle Okay > > I modified filter() to write some debug statements to a file and > return null for every url (for debugging purposes). I know my code was > not executed because no urls should have been indexed and there were > debug statements in the file. I suspect that step 3 is what I am doing > incorrectly or there is some other file I need to modify. Did you enable the plugin in the nutch-default.xml file within the conf directory? Make sure that you enable the plugin there. Can you post a txt capture of your crawl log? Thanks, Chris > > I appreciate any help. > > Thanks, > zennet
