Hi Lewis, Thanks for the reply. I am only using a single url: http://nutch.apache.org/ This sits in a urls/seed.txt in NUTCH_HOME (not runtime folder but the home folder generated after unzipping). As far as crawling is concerned, I am merely following the wiki tutorials. I tried inject as well and with the same results. Can you suggest another approach?
Thanks, Mourad On 23 Oct 2012, at 14:32, Lewis John Mcgibbney <[email protected]> wrote: > Hi, > > On Tue, Oct 23, 2012 at 11:53 AM, Mouradk <[email protected]> wrote: >> I uploaded Nutch 2.1 and tried to get it started but no luck so far. I am >> running it on local with Hbase 0.90.6. > > HBase compatibility should be fine. In all honesty we *should* > probably upgrade to one of the newer distributions but this has still > to happen... > >> Exception in thread "main" org.apache.gora.util.GoraException: >> java.lang.RuntimeException: java.net.MalformedURLException > > Can you post your URL list for inject? > >> I googled around but cannot find an answer, id there some thing with with my >> conf? or the urls ? I get the same error whether I try to inject or crawl >> with the following commands: >> >> ./runtime/local/bin/nutch crawl urls -dir crawl -depth 3 -topN 5 > > Please don't use the crawl command with 2.1 it is not recommended. > > Lewis

