After creating the directories crawldb and current by hand I could perform an injection. Is this a bug should I file a JIRA issue?
Zaheed On 7/1/06, Zaheed Haque <[EMAIL PROTECTED]> wrote:
Forgot to mention I was doing some URL injection bin/nutch inject crawldb urls Cheers On 7/1/06, Zaheed Haque <[EMAIL PROTECTED]> wrote: > Hi: > > Everything was working good with hadoop 3.2, but now after upgrading > to hadoop-0.4 I am getting the following error > > 2006-07-01 11:12:44,989 INFO conf.Configuration > (Configuration.java:loadResource(397)) - parsing > jar:file:/usr/local/java/nutch-0.8-dev/lib/hadoop-0.4.0.jar!/hadoop-default.xml > 2006-07-01 11:12:45,006 INFO conf.Configuration > (Configuration.java:loadResource(397)) - parsing > file:/usr/local/java/nutch-0.8-dev/conf/nutch-default.xml > 2006-07-01 11:12:45,040 INFO conf.Configuration > (Configuration.java:loadResource(397)) - parsing > jar:file:/usr/local/java/nutch-0.8-dev/lib/hadoop-0.4.0.jar!/mapred-default.xml > 2006-07-01 11:12:45,058 INFO conf.Configuration > (Configuration.java:loadResource(397)) - parsing > jar:file:/usr/local/java/nutch-0.8-dev/lib/hadoop-0.4.0.jar!/mapred-default.xml > 2006-07-01 11:12:45,120 INFO conf.Configuration > (Configuration.java:loadResource(397)) - parsing > file:/usr/local/java/nutch-0.8-dev/conf/hadoop-site.xml > 20 > 2006-07-01 11:12:46,379 ERROR mapred.JobClient > (JobClient.java:submitJob(273)) - Input directory > /usr/local/java/nutch-0.8-dev/crawldb/current in local is invalid. > Exception in thread "main" java.io.IOException: Input directory > /usr/local/java/nutch-0.8-dev/crawldb/current in local is invalid. > at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:274) > at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:327) > at org.apache.nutch.crawl.Injector.inject(Injector.java:146) > at org.apache.nutch.crawl.Injector.main(Injector.java:164) > > I am wondering if this is a known fact or do I need to do something > with my configuration? > > Thanks > Zaheed >
