On 16 November 2011 20:27, ctjmorgan <[email protected]> wrote: > Recently went through a similar situation. Check out the two project below > that I posted up to github. Hope they help... > > https://github.com/ctjmorgan/nutch-mongdb-parser > https://github.com/ctjmorgan/nutch-mongodb-indexer > > The first example allows you prepare a set of URLs contained in Mongdb for > Nutch to crawl.
Not sure parser is the right name for it, sounds more like a variant of the injector (haven't looked at code though) > The second indexes the information from Nutch into Mongodb > similiar in the same way the SolrIndexer works. > There are plans for a pluggable indexing backend so that we can send the documents to [SOLR|ElasticSearch|...] This would allow to write expose the MongoDB indexer as a plugin instead of piggybacking the SOLR code. Thanks for sharing these links, it's always interesting to know what people do with/around Nutch Julien > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Usage-of-nutch-tp1894986p3513843.html > Sent from the Nutch - User mailing list archive at Nabble.com. > -- * *Open Source Solutions for Text Engineering http://digitalpebble.blogspot.com/ http://www.digitalpebble.com

