In Nutch 0.7, I wanted to change Nutch's behavior such that when I inject a file it will add the page, even if it is already present.
I did this because I can prepare a list of changed files that I have on my intranet and want Nutch to reindex them right away. I made a change (suggested by Howie Wang) to org.apache.nutch.db.WebDBInjector by changing the addPage method. I replaced the line: dbWriter.addPageIfNotPresent(page); with: dbWriter.addPageWithScore(page); Question: I'm moving to Nutch 0.8 and I'd like similar behavior, but I don't know where to put them as a lot of code has changed (and there's no longer a WebDBInjector.java file). How can I accomplish this? If there is a more appropriate way to do this please let me know that also. Thanks, Ben
