Hello.

I am using MCF 2.0.2 for crawling the web and ingesting data into Solr.

MCF has ingested into Solr documents that returned HTTP error let's says
401, 403, 404 or have a certain content like "this page has expired and has
been removed"

The question is:
is there a way to tell MCF to ingest
- only document not containing a certain content like "Not Found" or
- only document excluding those with header 401, 403, 404, 500, ...

Thank you very much.

Arcadius.

Reply via email to