how to force set fetch-status without actually fetching

Sourajit Basak Mon, 08 Apr 2013 04:15:52 -0700

We have a use case where we are generating multiple parse outputs per url.
In short the url hosts a custom xml file which is being parsed to generate
several records.


However, in reality the discovered or generated urls are not actually
fetched. According to  NUTCH-514, anything which isn't fetched will be
skipped during index.

We need to override this behavior. Any ideas how it can be accomplished ?

how to force set fetch-status without actually fetching

Reply via email to