Hi Marty
This email exchange may help http://www.mail-archive.com/user%40nutch.apache.org/msg13753.html
Steven Hayles Systems Analyst IT Services, University of Leicester, Propsect House, 94 Regent Rd, Leicester, LE1 7DA, UK t: +44 (0)116 229 7950 e: [email protected] Follow us on Twitter http://twitter.com/uniofleicester or visit our Facebook page https://facebook.com/UniofLeicester On Fri, 25 Nov 2016, Marty-Scott Sainty (NWIS - Software Development) wrote:
Hi, Is there a setting to get Nutch to remove 404 pages from Solr? I'm currently testing the behaviour with different status codes and Nutch doesn't remove pages with 404 status codes. Any help would be much appreciated. Cheers, Marty

