Hi, We have a tool (CommonCrawlDataDumper) to convert Nutch data into Common Crawl WARC format.
Can we import WARC file back to Nutch data (segments) and is there any tool to do that? Thanks, Tien
Hi, We have a tool (CommonCrawlDataDumper) to convert Nutch data into Common Crawl WARC format.
Can we import WARC file back to Nutch data (segments) and is there any tool to do that? Thanks, Tien