Nutch how to create database or other storage to store scraped data other than the url?

2019-03-23 Thread hxdariux
I'm new to nutch and am trying to develop nutch plugins to parse html contents of the crawled urls and to scrape for certain data (for my case I'm gathering bitcoin addresses id). However, I learned that the nutch lifecycle produces batches of urls, so my question is, is it possible and how to

Nutch how to create database or other storage to store scraped data other than the url?

2019-03-23 Thread hxdariux
I'm new to nutch and am trying to develop nutch plugins to parse html contents of the crawled urls and to scrape for certain data (for my case I'm gathering bitcoin addresses id). However, I learned that the nutch lifecycle produces batches of urls, so my question is, is it possible and how to