Hi everyone, Just to let you know that we've just published a new tutorial on how to use Nutch (and StormCrawler) to crawl and index documents into AWS CloudSearch.
This is related to the recent addition of NUTCH-1517 <https://issues.apache.org/jira/browse/NUTCH-1517> in the trunk codebase. The tutorial is aimed at beginners and gives step by step instructions on how to use Nutch, including in distributed mode. It should also be relevant for more advanced users as it provides an introduction to CloudSearch and a comparison with StormCrawler. The tutorial is on http://digitalpebble.blogspot.co.uk/2015/09/index-web-with-aws-cloudsearch.html Please retweet the announcement if you use Twitter [ https://twitter.com/digitalpebble/status/646614555192336384]. I hope you find it useful Julien -- Open Source Solutions for Text Engineering http://digitalpebble.blogspot.com/ http://www.digitalpebble.com http://twitter.com/digitalpebble

