Re: side by side versions of Nutch

2010-10-11 Thread Markus Jelsma
I don't know about Eclipse but you can safely run different versions on the same machine. Depending on what changed between versions (perhaps a incompatible change in the CrawlDB of whatever) i'd not share the generated data and config. On Friday, October 08, 2010 07:06:22 pm MilleBii wrote:

Re: Can't find org.gora.sql.store.SqlStore

2010-10-11 Thread Markus Jelsma
Hi, I haven't come around to set up Nutch in an IDE as of yet. The exception is thrown at runtime for job that somehow depend on SqlStore. Running classes sch as the ParserChecker are bot affected as expected. The steps: git clone git://github.com/enis/gora.git cd gora ant cd .. svn export ht

Re: Can't find org.gora.sql.store.SqlStore

2010-10-11 Thread Mattmann, Chris A (388J)
Hi Markus, Where are you seeing this ClassNotFoundException? When you look at it in an IDE (e.g., Eclipse), or at runtime? Or building using Ant/Ivy? It seems like it built OK, so just trying to figure out how you are running Nutch. Cheers, Chris On 10/11/10 4:24 AM, "Markus Jelsma" wrote:

Re: Crawl speed control and HTTP Post

2010-10-11 Thread Markus Jelsma
Yes, pay attention to these settings in your configuration: fetcher.server.delay fetcher.threads.fetch fetcher.threads.per.host On Sunday, October 10, 2010 07:37:08 am zouzhile wrote: > Hi all, > I am new to Nutch, and have two questions that I couldn't find the answer > via the web pages and co

Re: Can't find org.gora.sql.store.SqlStore

2010-10-11 Thread Markus Jelsma
It seems so. And reports it has published some Gora jars in my ~/.ivy2/local dirs. The i built Nutch and it reported no issues and deployed to the runtime dir. Do some settings need to be changed in Nutch' ivysettings? On Thursday, October 07, 2010 04:16:13 pm Mattmann, Chris A (388J) wrote: >

HTTP Scheme problem

2010-10-11 Thread matinte
Hi, I am trying to index some XMLs files in URLs like: http::/.xml. But I am having some problems with this isue. First time I tried, the error I found was: java.io.IOException: No FileSystem for scheme: http Then, I added to the hadoop-default.xml file the tag for processing http URIs: fs.http