How can one remove documents from a specific domain from an existing Nutch db?
Addding a filter to regex-urlfilter.txt seems to prevent them from
being added to the linkDb, but documents already in there are not
affected at all, and I could not see how else to do this.
It can't possibly be that I have to completely recreate the crawl folder, is it?

Reply via email to