Hi again, On 6/8/06, Mehdi Hemani <[EMAIL PROTECTED]> wrote:
1. I want to filter out webpages based on a list of words. I have tried filtering webpages based on url, but how to do it based on words?
As for this question check the following link: http://wiki.apache.org/nutch/CommandLineOptions As far as I know this prune tool should be available for nutch 0.8 as well (at least I can see the class to be included in source code so you should be able to call it). Regards, Lukas
