is urls/nutch a file or directory? On 6/6/07, Martin Kammerlander <[EMAIL PROTECTED]> wrote:
HiI wanted to start a crawl like it is done in the nutch 0.8.x tutorial. Unfortunately I get the following error: [EMAIL PROTECTED] nutch-0.8.1]$ bin/nutch crawl urls/nutch -dir crawl.test -depth 10 crawl started in: crawl.test rootUrlDir = urls/nutch threads = 10 depth = 10 Injector: starting Injector: crawlDb: crawl.test/crawldb Injector: urlDir: urls/nutch Injector: Converting injected urls to crawl db entries. Exception in thread "main" java.io.IOException: Input directory /scratch/nutch-0.8.1/urls/nutch in local is invalid. at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:274) at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:327) at org.apache.nutch.crawl.Injector.inject(Injector.java:138) at org.apache.nutch.crawl.Crawl.main(Crawl.java:105) Any ideas what is causing that? regards martin
-- "Conscious decisions by conscious minds are what make reality real"
