Hello,
If a want to crawl a set A of pages, and a set B of pages,
but using a config(A) for A, and a config(B) for B, which is
the suggested 'best strategy' ?
In my mind:
$NUTCH_HOME/runtime -> Keep it as a 'vanilla' reference
$NUTCH_HOME/runtime_A -> A 'clone' of 'vanilla' with a custom
setup for set A
$NUTCH_HOME/runtime_B -> B 'clone' of 'vanilla' with a custom
setup for set B
Any way to do better ? Is my setup reccomended or not ?
A simple multiple installation seems overkilling (multiple
$NUTCH_HOME), and I
think it can be preferred only is config(A) and config(B) have to use
different
nutch versions.
Alessio