Re: How to do nutch inject?

Susam Pal Tue, 19 Feb 2008 20:52:08 -0800

You will also find a logs/hadoop.log file. Do you find any clue here?

Maybe, instead of trying to inject dmoz you can try injecting a set of
4 to 10 URLs written in a file and see the hadoop.log file and find
out what is going wrong.


Regards,
Susam Pal

On 2/20/08, Nick Duan <[EMAIL PROTECTED]> wrote:
This is a newbie question.  Please forgive me if this is already
> answered somewhere.
>
> I am trying to follow the nutch 0.8 version tutorial to run nutch
> crawler over the web.  I tried to bootstrap the crawldb by injecting the
> urls obtained from dmoz using the command:
>
> bin/nutch inject crawl/crawldb dmoz
>
> The following exception occurred:
>
> Injector: starting
> Injector: crawlDb: crawl/crawldb
> Injector: urlDir: ../devel/dmoz
> Injector: Converting injected urls to crawl db entries.
> Injector: java.io.IOException: Job failed!
>         at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:604)
>         at org.apache.nutch.crawl.Injector.inject(Injector.java:162)
>         at org.apache.nutch.crawl.Injector.run(Injector.java:192)
>         at org.apache.hadoop.util.ToolBase.doMain(ToolBase.java:189)
>         at org.apache.nutch.crawl.Injector.main(Injector.java:182)
>
> The exception didn\'t offer much details.  Any help is highly
> appreciated.
>
> ND
>

Re: How to do nutch inject?

Reply via email to