Hello,

Is there a way to make the crawler follow a certain type of regular
expression for instance? Something like "if I come to this page using
regular expression #2, then I should use regular expression #2 for links
that are going out from this page". This is like running N instances of
Nutch in parallel with each instance having its own regex-urlfilter.

I am trying to achieve really basic category based focusing.

Best,

Emre

Reply via email to