Recently went through a similar situation. Check out the two project below that I posted up to github. Hope they help...
https://github.com/ctjmorgan/nutch-mongdb-parser https://github.com/ctjmorgan/nutch-mongodb-indexer The first example allows you prepare a set of URLs contained in Mongdb for Nutch to crawl. The second indexes the information from Nutch into Mongodb similiar in the same way the SolrIndexer works. -- View this message in context: http://lucene.472066.n3.nabble.com/Usage-of-nutch-tp1894986p3513843.html Sent from the Nutch - User mailing list archive at Nabble.com.

