I'm new to nutch and am trying to develop nutch plugins to parse html
contents of the crawled urls and to scrape for certain data (for my case I'm
gathering bitcoin addresses id). However, I learned that the nutch lifecycle
produces batches of urls, so my question is, is it possible and how to
I'm new to nutch and am trying to develop nutch plugins to parse html
contents of the crawled urls and to scrape for certain data (for my case I'm
gathering bitcoin addresses id). However, I learned that the nutch lifecycle
produces batches of urls, so my question is, is it possible and how to
2 matches
Mail list logo