Hi, I want to implement an alternative crawler that doesn't crawl over html files but uses a proprietary index to identify targets to download. The apparent reliance of Nutch 2.1 on hbase is a bit daunting as i don't want to use too many new technologies.
Is Nutch 1.5.1 a better fit for me or should i look for something entirely different?

