There are lots of factors to consider, so one can't give a good general answer, but:
Nutch already uses HBase (trunk), so that's +1 for HBase. HBase makes it easy to scale and has built-in replication thanks to being built on top of HDFS. Otis ---- Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ ----- Original Message ---- > From: "McGibbney, Lewis John" <[email protected]> > To: "[email protected]" <[email protected]> > Sent: Fri, January 14, 2011 8:00:50 AM > Subject: Database data storage question > > Hello List, > > I am gathering information on the above topic as I intend to integrate a >database to store fetched data. I would like community input of any >experiences >using different database implementations before doing so. E.g. comparison >between HBase & MySQL etc. > > Thank you > > Lewis > > > Glasgow Caledonian University is a registered Scottish charity, number >SC021474 > > Winner: Times Higher Education's Widening Participation Initiative of the > Year >2009 and Herald Society's Education Initiative of the Year 2009 >http://www.gcu.ac.uk/newsevents/news/bycategory/theuniversity/1/name,6219,en.html >l >

