I am quite new to nutch. I have crawled a site successfully using nutch 1.2
and extracted segment dump by *readseg* command but issue is that dump
contains lot of information other than url and outlinks also if i want to
analyse it, manual approach needs to be adopted. It would be really great
if there is any utiltiy, plugin which export link with out links in machine
readable format like csv or sql. Please suggest

Reply via email to