Hi,

I have been struggling the same problem two days ago. I posted problem
to nutch-dev maillist under the following sublect: "nutch-0.8-dev
*mapred.input.subdir* problem ?".
Stefan and Paul responded by I am not sure this will solve my problem
(truly I didn have a time to fully test their suggestions).

To me it seems that the problem can be related to setting the
mapred.input.subdir property (I was looking into code). But I was not
able to find anything about mapred.input.subdir property on web.

Now I know that I am not alone who have this problem so either both I
and you are doing something wrong or there is a real problem in the
lates nutch trunk package.

Anybody else is facing this problem?
Lukas

On 12/22/05, carmmello <[EMAIL PROTECTED]> wrote:
> I have donwloaded the last Nutch nightly version, from 2005-12-18, and
> tried to run it,  as usual (using the crawl method).  As a result, I got
> the following error messages:
>
> "051222 175202 parsing file:/usr/nutch-nightly/conf/nutch-site.xml
> java.io.IOException: No input directories specified in: NutchConf:
> nutch-default.xml , mapred-
> default.xml , /tmp/nutch/mapred/local/localRunner/job_xom6lb.xml ,
> nutch-site.xml
>         at org.apache.nutch.mapred.InputFormatBase.listFiles
> (InputFormatBase.java:85)
>         at org.apache.nutch.mapred.InputFormatBase.getSplits
> (InputFormatBase.java:95)
>         at org.apache.nutch.mapred.LocalJobRunner$Job.run
> (LocalJobRunner.java:63)
> 051222 175203  map 0%
> Exception in thread "main" java.io.IOException: Job failed!
>         at org.apache.nutch.mapred.JobClient.runJob(JobClient.java:308)
>         at org.apache.nutch.crawl.Injector.inject(Injector.java:102)
>         at org.apache.nutch.crawl.Crawl.main(Crawl.java:101)
> [EMAIL PROTECTED] nutch-nightly]# "
>
> Also, reviewing some posts I came across the following statement:
>
> "The next version will be map reduce based in any case. So 0.7 is
> already the 'old' one and people will not continue to develop it (may
> just some maintenance releases).Map reduce doesn't mean that you need
> more than one computer or need the ndfs. It is just a technology to
> process large data sets." (Stefan Groschupf)
>
> I know that the official new release of Nutch (0.8, I think) is not
> released yet, but, I think,  a new tutorial is needed on how to set up
> Nutch to run properly, as the  tutorial on the Nutch site,it seems,  can
> not cope with the new nigthly distributions and the new ones that will
> be released.
>
> Thanks
>
>
>
>
>
>
>

Reply via email to