I have checked and confirmed that the hosts I'm trying to fetch are actually accessible (ping requests and loading the site itself). However, I still get the same error.
Any other alternatives? On 5/14/07, Dennis Kubes <[EMAIL PROTECTED]> wrote:
For some reason the nutch process can't resolve the hosts. This could be due to incorrect setup of dns on the machine or a firewall or proxy in place. See if you can ping one of the urls (hosts) that you are trying to fetch. Dennis Kubes Reza Harditya wrote: > Hi, > > I'm a new nutch user. Currently I'm using Nutch 0.8.1. When I wanted to > start crawling according to the tutorial, I always get the following error: > > Injector: starting > Injector: crawlDb: crawl2/crawldb > Injector: urlDir: urls > Injector: Converting injected urls to crawl db entries. > Exception in thread "main" java.io.IOException: Job failed! > at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:357) > at org.apache.nutch.crawl.Injector.inject(Injector.java:138) > at org.apache.nutch.crawl.Crawl.main(Crawl.java:105) > ------------------------------------------------------------------------------------------------------------ > > > From the log, I found a more detailed description which is: > > 2007-05-14 09:32:57,977 INFO crawl.Injector - Injector: starting > 2007-05-14 09:32:57,978 INFO crawl.Injector - Injector: crawlDb: > crawl2/crawldb > 2007-05-14 09:32:57,978 INFO crawl.Injector - Injector: urlDir: urls > 2007-05-14 09:32:57,978 INFO crawl.Injector - Injector: Converting > injected > urls to crawl db entries. > 2007-05-14 09:32:58,908 WARN mapred.LocalJobRunner - job_lzlk81 > java.lang.RuntimeException: java.net.UnknownHostException: dhcppc0: dhcppc0 > at org.apache.hadoop.io.SequenceFile$Writer.<init>( SequenceFile.java > :76) > at org.apache.hadoop.io.SequenceFile$Writer.<init>( SequenceFile.java > :89) > at org.apache.hadoop.mapred.MapTask.run(MapTask.java:77) > at org.apache.hadoop.mapred.LocalJobRunner$Job.run( > LocalJobRunner.java:91) > Caused by: java.net.UnknownHostException: dhcppc0: dhcppc0 > at java.net.InetAddress.getLocalHost(InetAddress.java:1308) > at org.apache.hadoop.io.SequenceFile$Writer.<init>( SequenceFile.java > :73) > ... 3 more > > > At first I suspect that the error was caused by tomcat not running > properly, > but after doing some checking I am confirmed that tomcat is indeed running. > > Could somebody let me know what I might be doing wrong here? > > Cheers, >
