Julien - whatever you are saying about Nutch 2.x and SQL - does it apply for the recent release 2.2.1 as well?
On Thu, Aug 1, 2013 at 9:38 AM, Julien Nioche <[email protected] > wrote: > If you are using Nutch 2.x then you are actually accessing the SQL storage > via Apache GORA. The SQL backend in GORA does not work and it is not > advised to use it. If you want to use Nutch 2 then use a different backend > like HBase or Cassandra or use Nutch 1.x > > On 1 August 2013 14:32, Jayadeep Reddy <[email protected]> wrote: > > > No Julien Using Mysql > > > > > > On Thu, Aug 1, 2013 at 7:00 PM, Julien Nioche < > > [email protected] > > > wrote: > > > > > What GORA backend are you using? > > > > > > > > > On 1 August 2013 14:03, Jayadeep Reddy <[email protected]> > > wrote: > > > > > > > I am using Nutch 2.1 every time I run crawl from dmoz directory my > > > existing > > > > crawled pages in the database are fetched again(Taking long time/). > Is > > > > there a way to crawl only new sites. > > > > > > > > Thank you > > > > > > > > -- > > > > Jayadeep Reddy.S, > > > > M.D & C.E.O > > > > e Health Access Pvt.Ltd > > > > www.ehealthaccess.com > > > > Hyderabad-Chennai-Banglore > > > > http://www.youtube.com/watch?v=0k5LX8mw6Sk > > > > > > > > > > > > > > > > -- > > > * > > > *Open Source Solutions for Text Engineering > > > > > > http://digitalpebble.blogspot.com/ > > > http://www.digitalpebble.com > > > http://twitter.com/digitalpebble > > > > > > > > > > > -- > > Jayadeep Reddy.S, > > M.D & C.E.O > > e Health Access Pvt.Ltd > > www.ehealthaccess.com > > Hyderabad-Chennai-Banglore > > http://www.youtube.com/watch?v=0k5LX8mw6Sk > > > > > > -- > * > *Open Source Solutions for Text Engineering > > http://digitalpebble.blogspot.com/ > http://www.digitalpebble.com > http://twitter.com/digitalpebble >

