Thanks Tejas! That was exactly what I was looking for.
On Friday, August 2, 2013, Tejas Patil <[email protected]> wrote: > Nutch 2.1 officially had support for MySQL as datastore. There were lot of > issues reported with MySQL and so in the newer version ie. 2.2.X, the MySQL > support is removed. I would recommend using HBase as its the most stable > backend amongst all supported ones. > > > On Thu, Aug 1, 2013 at 7:01 AM, Jayadeep Reddy > <[email protected]>wrote: > >> Thank you Julien, >> Will get hbase and try to crawl. >> >> >> On Thu, Aug 1, 2013 at 7:10 PM, A Laxmi <[email protected]> wrote: >> >> > Julien - whatever you are saying about Nutch 2.x and SQL - does it apply >> > for the recent release 2.2.1 as well? >> > >> > >> > On Thu, Aug 1, 2013 at 9:38 AM, Julien Nioche < >> > [email protected] >> > > wrote: >> > >> > > If you are using Nutch 2.x then you are actually accessing the SQL >> > storage >> > > via Apache GORA. The SQL backend in GORA does not work and it is not >> > > advised to use it. If you want to use Nutch 2 then use a different >> > backend >> > > like HBase or Cassandra or use Nutch 1.x >> > > >> > > On 1 August 2013 14:32, Jayadeep Reddy <[email protected]> >> > wrote: >> > > >> > > > No Julien Using Mysql >> > > > >> > > > >> > > > On Thu, Aug 1, 2013 at 7:00 PM, Julien Nioche < >> > > > [email protected] >> > > > > wrote: >> > > > >> > > > > What GORA backend are you using? >> > > > > >> > > > > >> > > > > On 1 August 2013 14:03, Jayadeep Reddy < [email protected] >> > >> > > > wrote: >> > > > > >> > > > > > I am using Nutch 2.1 every time I run crawl from dmoz directory >> my >> > > > > existing >> > > > > > crawled pages in the database are fetched again(Taking long >> time/). >> > > Is >> > > > > > there a way to crawl only new sites. >> > > > > > >> > > > > > Thank you >> > > > > > >> > > > > > -- >> > > > > > Jayadeep Reddy.S, >> > > > > > M.D & C.E.O >> > > > > > e Health Access Pvt.Ltd >> > > > > > www.ehealthaccess.com >> > > > > > Hyderabad-Chennai-Banglore >> > > > > > e Health Access Medical kiosk >> > > > > > >> > > > > >> > > > > >> > > > > >> > > > > -- >> > > > > * >> > > > > *Open Source Solutions for Text Engineering >> > > > > >> > > > > http://digitalpebble.blogspot.com/ >> > > > > http://www.digitalpebble.com >> > > > > http://twitter.com/digitalpebble >> > > > > >> > > > >> > > > >> > > > >> > > > -- >> > > > Jayadeep Reddy.S, >> > > > M.D & C.E.O >> > > > e Health Access Pvt.Ltd >> > > > www.ehealthaccess.com >> > > > Hyderabad-Chennai-Banglore >> > > > e Health Access Medical kiosk >> > > > >> > > >> > > >> > > >> > > -- >> > > * >> > > *Open Source Solutions for Text Engineering >> > > >> > > http://digitalpebble.blogspot.com/ >> > > http://www.digitalpebble.com >> > > http://twitter.com/digitalpebble >> > > >> > >> >> >> >> -- >> Jayadeep Reddy.S, >> M.D & C.E.O >> e Health Access Pvt.Ltd >

