If you are using Nutch 2.x then you are actually accessing the SQL storage via Apache GORA. The SQL backend in GORA does not work and it is not advised to use it. If you want to use Nutch 2 then use a different backend like HBase or Cassandra or use Nutch 1.x
On 1 August 2013 14:32, Jayadeep Reddy <[email protected]> wrote: > No Julien Using Mysql > > > On Thu, Aug 1, 2013 at 7:00 PM, Julien Nioche < > [email protected] > > wrote: > > > What GORA backend are you using? > > > > > > On 1 August 2013 14:03, Jayadeep Reddy <[email protected]> > wrote: > > > > > I am using Nutch 2.1 every time I run crawl from dmoz directory my > > existing > > > crawled pages in the database are fetched again(Taking long time/). Is > > > there a way to crawl only new sites. > > > > > > Thank you > > > > > > -- > > > Jayadeep Reddy.S, > > > M.D & C.E.O > > > e Health Access Pvt.Ltd > > > www.ehealthaccess.com > > > Hyderabad-Chennai-Banglore > > > http://www.youtube.com/watch?v=0k5LX8mw6Sk > > > > > > > > > > > -- > > * > > *Open Source Solutions for Text Engineering > > > > http://digitalpebble.blogspot.com/ > > http://www.digitalpebble.com > > http://twitter.com/digitalpebble > > > > > > -- > Jayadeep Reddy.S, > M.D & C.E.O > e Health Access Pvt.Ltd > www.ehealthaccess.com > Hyderabad-Chennai-Banglore > http://www.youtube.com/watch?v=0k5LX8mw6Sk > -- * *Open Source Solutions for Text Engineering http://digitalpebble.blogspot.com/ http://www.digitalpebble.com http://twitter.com/digitalpebble

