We have discussed it but not implemented it. A previous step before implementing interfaces to use HBase for current Nutch databases was to may the Nutch architecture itself more flexible. This is what I have been terming Nutch 2 and what I have been currently working on.

Dennis

Marcus Herou wrote:
Hi.

Anyone tried to implement HBase as storage for:

* CrawlDB
* LinkDB
* Fetched and parsed url data

It would certainly be cool I think to be able to search in all these three
db's. Currently it is a little bit hard to use the data crawled without
actually indexing it.

Kindly

//Marcus



Reply via email to