Hi, I have been struggling the same problem two days ago. I posted problem to nutch-dev maillist under the following sublect: "nutch-0.8-dev *mapred.input.subdir* problem ?". Stefan and Paul responded by I am not sure this will solve my problem (truly I didn have a time to fully test their suggestions).
To me it seems that the problem can be related to setting the mapred.input.subdir property (I was looking into code). But I was not able to find anything about mapred.input.subdir property on web. Now I know that I am not alone who have this problem so either both I and you are doing something wrong or there is a real problem in the lates nutch trunk package. Anybody else is facing this problem? Lukas On 12/22/05, carmmello <[EMAIL PROTECTED]> wrote: > I have donwloaded the last Nutch nightly version, from 2005-12-18, and > tried to run it, as usual (using the crawl method). As a result, I got > the following error messages: > > "051222 175202 parsing file:/usr/nutch-nightly/conf/nutch-site.xml > java.io.IOException: No input directories specified in: NutchConf: > nutch-default.xml , mapred- > default.xml , /tmp/nutch/mapred/local/localRunner/job_xom6lb.xml , > nutch-site.xml > at org.apache.nutch.mapred.InputFormatBase.listFiles > (InputFormatBase.java:85) > at org.apache.nutch.mapred.InputFormatBase.getSplits > (InputFormatBase.java:95) > at org.apache.nutch.mapred.LocalJobRunner$Job.run > (LocalJobRunner.java:63) > 051222 175203 map 0% > Exception in thread "main" java.io.IOException: Job failed! > at org.apache.nutch.mapred.JobClient.runJob(JobClient.java:308) > at org.apache.nutch.crawl.Injector.inject(Injector.java:102) > at org.apache.nutch.crawl.Crawl.main(Crawl.java:101) > [EMAIL PROTECTED] nutch-nightly]# " > > Also, reviewing some posts I came across the following statement: > > "The next version will be map reduce based in any case. So 0.7 is > already the 'old' one and people will not continue to develop it (may > just some maintenance releases).Map reduce doesn't mean that you need > more than one computer or need the ndfs. It is just a technology to > process large data sets." (Stefan Groschupf) > > I know that the official new release of Nutch (0.8, I think) is not > released yet, but, I think, a new tutorial is needed on how to set up > Nutch to run properly, as the tutorial on the Nutch site,it seems, can > not cope with the new nigthly distributions and the new ones that will > be released. > > Thanks > > > > > > > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
