Hi, I have been struggling the same problem two days ago. I posted problem to nutch-dev maillist under the following sublect: "nutch-0.8-dev *mapred.input.subdir* problem ?". Stefan and Paul responded by I am not sure this will solve my problem (truly I didn have a time to fully test their suggestions).
To me it seems that the problem can be related to setting the mapred.input.subdir property (I was looking into code). But I was not able to find anything about mapred.input.subdir property on web. Now I know that I am not alone who have this problem so either both I and you are doing something wrong or there is a real problem in the lates nutch trunk package. Anybody else is facing this problem? Lukas On 12/22/05, carmmello <[EMAIL PROTECTED]> wrote: > I have donwloaded the last Nutch nightly version, from 2005-12-18, and > tried to run it, as usual (using the crawl method). As a result, I got > the following error messages: > > "051222 175202 parsing file:/usr/nutch-nightly/conf/nutch-site.xml > java.io.IOException: No input directories specified in: NutchConf: > nutch-default.xml , mapred- > default.xml , /tmp/nutch/mapred/local/localRunner/job_xom6lb.xml , > nutch-site.xml > at org.apache.nutch.mapred.InputFormatBase.listFiles > (InputFormatBase.java:85) > at org.apache.nutch.mapred.InputFormatBase.getSplits > (InputFormatBase.java:95) > at org.apache.nutch.mapred.LocalJobRunner$Job.run > (LocalJobRunner.java:63) > 051222 175203 map 0% > Exception in thread "main" java.io.IOException: Job failed! > at org.apache.nutch.mapred.JobClient.runJob(JobClient.java:308) > at org.apache.nutch.crawl.Injector.inject(Injector.java:102) > at org.apache.nutch.crawl.Crawl.main(Crawl.java:101) > [EMAIL PROTECTED] nutch-nightly]# " > > Also, reviewing some posts I came across the following statement: > > "The next version will be map reduce based in any case. So 0.7 is > already the 'old' one and people will not continue to develop it (may > just some maintenance releases).Map reduce doesn't mean that you need > more than one computer or need the ndfs. It is just a technology to > process large data sets." (Stefan Groschupf) > > I know that the official new release of Nutch (0.8, I think) is not > released yet, but, I think, a new tutorial is needed on how to set up > Nutch to run properly, as the tutorial on the Nutch site,it seems, can > not cope with the new nigthly distributions and the new ones that will > be released. > > Thanks > > > > > > >
