Simple, use scripts to operate on different segments (and/or crawldb's and 
configuratons). I have setups with multiple NUTCH_HOME's, each with an 
isolated crawl.

On Tuesday 12 July 2011 13:59:07 jeffersonzhou wrote:
> Hi,
> 
> 
> 
> I want to do my own parser and separate all the interesting URLs into a new
> segment other than Nutch’s default segments. Can I do so? How?
> 
> 
> 
> Thanks.

-- 
Markus Jelsma - CTO - Openindex
http://www.linkedin.com/in/markus17
050-8536620 / 06-50258350

Reply via email to