Hello,

I tried to crawl manually, only a list of urls. I have issued the
following commands:

bin/nutch inject /home/crawl/crawldb /home/urls

bin/nutch generate /home/crawl/crawldb /home/crawl/segments

bin/nutch fetch /home/crawl/segments/123456789

bin/nutch updatedb /home/crawl/crawldb /home/crawl/segments/123456789
-noAdditions

however for the last command: it skips the segment 12345789 saying it
is an invalid segment?

This is exactly what I need (the -noAdditions flag) but it will not
updatedb. What might have done wrong?

Best Regards,
-C.B.

Reply via email to