in 0.8 you submit a _directory_ containing urls.txt not the file itself.

so remove /urls.txt part from your commandline and it should go fine.

--
Sami Siren

BDalton wrote:

I get this error,

bin/nutch crawl url.txt -dir newcrawled -depth 2 >& crawl.log

Exception in thread "main" java.io.IOException: Input directory
d:/nutch3/urls/urls.txt in local is invalid.
        at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:274)
        at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:327)
        at org.apache.nutch.crawl.Injector.inject(Injector.java:138)
        at org.apache.nutch.crawl.Crawl.main(Crawl.java:105)


Reply via email to