On 2010-10-26 22:39, Scott Gonyea wrote: > I love relational databases, but their many features are (in my > opinion) wasted on what you find in Nutch. Row-locking and > transactional integrity is great for lots of applications, but becomes > a whole lot of overhead when it's of next-to-no-value to whatever > you're doing. > > RE: counting URLs: Have you looked at Solr's facets, etc? I use them > like they're going out of style--and it's very powerful. > > For my application, Solr *is* my database. Nutch crawls data, stores
.. then you may be interested in the upcoming Gora feature: http://issues.apache.org/jira/browse/GORA-9 . When this is committed you will be able to keep all your data in Solr. -- Best regards, Andrzej Bialecki <>< ___. ___ ___ ___ _ _ __________________________________ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com

