Hi Lewis,

Thanks for the reply. I am only using a single url: http://nutch.apache.org/
This sits in a urls/seed.txt in NUTCH_HOME (not runtime folder but the home 
folder generated after unzipping).
As far as crawling is concerned, I am merely following the wiki tutorials. I 
tried inject as well and with the same results. Can you suggest another 
approach?

Thanks,

Mourad


On 23 Oct 2012, at 14:32, Lewis John Mcgibbney <[email protected]> 
wrote:

> Hi,
> 
> On Tue, Oct 23, 2012 at 11:53 AM, Mouradk <[email protected]> wrote:
>> I uploaded Nutch 2.1 and tried to get it started but no luck so far. I am 
>> running it on local with Hbase 0.90.6.
> 
> HBase compatibility should be fine. In all honesty we *should*
> probably upgrade to one of the newer distributions but this has still
> to happen...
> 
>> Exception in thread "main" org.apache.gora.util.GoraException: 
>> java.lang.RuntimeException: java.net.MalformedURLException
> 
> Can you post your URL list for inject?
> 
>> I googled around but cannot find an answer, id there some thing with with my 
>> conf? or the urls ? I get the same error whether I try to inject or crawl 
>> with the following commands:
>> 
>> ./runtime/local/bin/nutch crawl urls -dir crawl -depth 3 -topN 5
> 
> Please don't use the crawl command with 2.1 it is not recommended.
> 
> Lewis

Reply via email to