Forgot to mention I was doing some URL injection bin/nutch inject crawldb urls
Cheers
On 7/1/06, Zaheed Haque <[EMAIL PROTECTED]> wrote:
Hi:
Everything was working good with hadoop 3.2, but now after upgrading
to hadoop-0.4 I am getting the following error
2006-07-01 11:12:44,989 INFO conf.Configuration
(Configuration.java:loadResource(397)) - parsing
jar:file:/usr/local/java/nutch-0.8-dev/lib/hadoop-0.4.0.jar!/hadoop-default.xml
2006-07-01 11:12:45,006 INFO conf.Configuration
(Configuration.java:loadResource(397)) - parsing
file:/usr/local/java/nutch-0.8-dev/conf/nutch-default.xml
2006-07-01 11:12:45,040 INFO conf.Configuration
(Configuration.java:loadResource(397)) - parsing
jar:file:/usr/local/java/nutch-0.8-dev/lib/hadoop-0.4.0.jar!/mapred-default.xml
2006-07-01 11:12:45,058 INFO conf.Configuration
(Configuration.java:loadResource(397)) - parsing
jar:file:/usr/local/java/nutch-0.8-dev/lib/hadoop-0.4.0.jar!/mapred-default.xml
2006-07-01 11:12:45,120 INFO conf.Configuration
(Configuration.java:loadResource(397)) - parsing
file:/usr/local/java/nutch-0.8-dev/conf/hadoop-site.xml
20
2006-07-01 11:12:46,379 ERROR mapred.JobClient
(JobClient.java:submitJob(273)) - Input directory
/usr/local/java/nutch-0.8-dev/crawldb/current in local is invalid.
Exception in thread "main" java.io.IOException: Input directory
/usr/local/java/nutch-0.8-dev/crawldb/current in local is invalid.
at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:274)
at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:327)
at org.apache.nutch.crawl.Injector.inject(Injector.java:146)
at org.apache.nutch.crawl.Injector.main(Injector.java:164)
I am wondering if this is a known fact or do I need to do something
with my configuration?
Thanks
Zaheed