In Nutch 0.7, I wanted to change Nutch's behavior such that when I inject a
file it will add the page, even if it is already present.

I did this because I can prepare a list of changed files that I have on my
intranet and want Nutch to reindex them right away.

I made a change (suggested by Howie Wang) to
org.apache.nutch.db.WebDBInjector by changing the addPage method.  I
replaced the line:

 dbWriter.addPageIfNotPresent(page);

with:

 dbWriter.addPageWithScore(page);

Question: I'm moving to Nutch 0.8 and I'd like similar behavior, but I don't
know where to put them as a lot of code has changed (and there's no longer a
WebDBInjector.java file).

How can I accomplish this?  If there is a more appropriate way to do this
please let me know that also.

Thanks,

Ben
-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys -- and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to