We have a use case where we are generating multiple parse outputs per url.
In short the url hosts a custom xml file which is being parsed to generate
several records.

However, in reality the discovered or generated urls are not actually
fetched. According to  NUTCH-514, anything which isn't fetched will be
skipped during index.

We need to override this behavior. Any ideas how it can be accomplished ?

Reply via email to