Hi

I wanted to start a crawl like it is done in the nutch 0.8.x tutorial.
Unfortunately I get the following error:

[EMAIL PROTECTED] nutch-0.8.1]$ bin/nutch crawl urls/nutch -dir crawl.test 
-depth 10
crawl started in: crawl.test
rootUrlDir = urls/nutch
threads = 10
depth = 10
Injector: starting
Injector: crawlDb: crawl.test/crawldb
Injector: urlDir: urls/nutch
Injector: Converting injected urls to crawl db entries.
Exception in thread "main" java.io.IOException: Input directory
/scratch/nutch-0.8.1/urls/nutch in local is invalid.
        at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:274)
        at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:327)
        at org.apache.nutch.crawl.Injector.inject(Injector.java:138)
        at org.apache.nutch.crawl.Crawl.main(Crawl.java:105)

Any ideas what is causing that?

regards
martin

Reply via email to