Hi,

I have been struggling the same problem two days ago. I posted problem
to nutch-dev maillist under the following sublect: "nutch-0.8-dev
*mapred.input.subdir* problem ?".
Stefan and Paul responded by I am not sure this will solve my problem
(truly I didn have a time to fully test their suggestions).

To me it seems that the problem can be related to setting the
mapred.input.subdir property (I was looking into code). But I was not
able to find anything about mapred.input.subdir property on web.

Now I know that I am not alone who have this problem so either both I
and you are doing something wrong or there is a real problem in the
lates nutch trunk package.

Anybody else is facing this problem?
Lukas

On 12/22/05, carmmello <[EMAIL PROTECTED]> wrote:
> I have donwloaded the last Nutch nightly version, from 2005-12-18, and
> tried to run it,  as usual (using the crawl method).  As a result, I got
> the following error messages:
>
> "051222 175202 parsing file:/usr/nutch-nightly/conf/nutch-site.xml
> java.io.IOException: No input directories specified in: NutchConf:
> nutch-default.xml , mapred-
> default.xml , /tmp/nutch/mapred/local/localRunner/job_xom6lb.xml ,
> nutch-site.xml
>         at org.apache.nutch.mapred.InputFormatBase.listFiles
> (InputFormatBase.java:85)
>         at org.apache.nutch.mapred.InputFormatBase.getSplits
> (InputFormatBase.java:95)
>         at org.apache.nutch.mapred.LocalJobRunner$Job.run
> (LocalJobRunner.java:63)
> 051222 175203  map 0%
> Exception in thread "main" java.io.IOException: Job failed!
>         at org.apache.nutch.mapred.JobClient.runJob(JobClient.java:308)
>         at org.apache.nutch.crawl.Injector.inject(Injector.java:102)
>         at org.apache.nutch.crawl.Crawl.main(Crawl.java:101)
> [EMAIL PROTECTED] nutch-nightly]# "
>
> Also, reviewing some posts I came across the following statement:
>
> "The next version will be map reduce based in any case. So 0.7 is
> already the 'old' one and people will not continue to develop it (may
> just some maintenance releases).Map reduce doesn't mean that you need
> more than one computer or need the ndfs. It is just a technology to
> process large data sets." (Stefan Groschupf)
>
> I know that the official new release of Nutch (0.8, I think) is not
> released yet, but, I think,  a new tutorial is needed on how to set up
> Nutch to run properly, as the  tutorial on the Nutch site,it seems,  can
> not cope with the new nigthly distributions and the new ones that will
> be released.
>
> Thanks
>
>
>
>
>
>
>


-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_idv37&alloc_id865&op=click
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to