On 16 November 2011 20:27, ctjmorgan <[email protected]> wrote:

> Recently went through a similar situation.  Check out the two project below
> that I posted up to github.  Hope they help...
>
> https://github.com/ctjmorgan/nutch-mongdb-parser
> https://github.com/ctjmorgan/nutch-mongodb-indexer
>
> The first example allows you prepare a set of URLs contained in Mongdb for
> Nutch to crawl.


Not sure parser is the right name for it, sounds more like a variant of the
injector (haven't looked at code though)


> The second indexes the information from Nutch into Mongodb
> similiar in the same way the SolrIndexer works.
>

There are plans for a pluggable indexing backend so that we can send the
documents to [SOLR|ElasticSearch|...] This would allow to write expose the
MongoDB indexer as a plugin instead of piggybacking the SOLR code.

Thanks for sharing these links, it's always interesting to know what people
do with/around Nutch

Julien



>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Usage-of-nutch-tp1894986p3513843.html
> Sent from the Nutch - User mailing list archive at Nabble.com.
>



-- 
*
*Open Source Solutions for Text Engineering

http://digitalpebble.blogspot.com/
http://www.digitalpebble.com

Reply via email to