How can one remove documents from a specific domain from an existing Nutch db? Addding a filter to regex-urlfilter.txt seems to prevent them from being added to the linkDb, but documents already in there are not affected at all, and I could not see how else to do this. It can't possibly be that I have to completely recreate the crawl folder, is it?
- How to remove domain from Nutch DB Dietrich
- Re: How to remove domain from Nutch DB Markus Jelsma

