Thanks for the reply.
Now I want to inject the urls into the nutch , but get this error:    


 [local ]$./bin/nutch inject urls  
InjectorJob: starting at 2013-06-05 11:58:37
InjectorJob: Injecting urlDir: urls
InjectorJob: Using class org.apache.gora.hbase.store.HBaseStore as the Gora 
storage class.
InjectorJob: java.lang.RuntimeException: job failed: name=inject urls, 
jobid=job_local_0001
at org.apache.nutch.util.NutchJob.waitForCompletion(NutchJob.java:54)
at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:233)
at org.apache.nutch.crawl.InjectorJob.inject(InjectorJob.java:251)
at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:273)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
at org.apache.nutch.crawl.InjectorJob.main(InjectorJob.java:282)
the urls is a directory, and there is a site inside.
 [local ]$ cat urls/seed.txt 
http://nutch.apache.org/


Thanks
hechuan


At 2013-06-05 11:43:38,"Lewis John Mcgibbney" <[email protected]> wrote:
>Hi,
>On Tue, Jun 4, 2013 at 7:42 PM, RS <[email protected]> wrote:
>
>>
>> InjectorJob: total number of urls injected after normalization and
>> filtering: 0
>>
>
>Nothing is injected here.
>Please review you URL filters and try again.
>Lewis

Reply via email to