Hi,

I want to implement an alternative crawler that doesn't crawl over html files 
but uses a proprietary index to identify targets to download. 
The apparent reliance of Nutch 2.1 on hbase is a bit daunting as i don't want 
to use too many new technologies.

Is Nutch 1.5.1 a better fit for me or should i look for something entirely 
different?

Reply via email to