Tutorial : Index the web with AWS CloudSearch

Julien Nioche Wed, 23 Sep 2015 02:27:14 -0700

Hi everyone,

Just to let you know that we've just published a new tutorial on how to use
Nutch (and StormCrawler) to crawl and index documents into AWS CloudSearch.


This is related to the recent addition of NUTCH-1517
<https://issues.apache.org/jira/browse/NUTCH-1517> in the trunk codebase.
The tutorial is aimed at beginners and gives step by step instructions on
how to use Nutch, including in distributed mode. It should also be relevant
for more advanced users as it provides an introduction to CloudSearch and a
comparison with StormCrawler.

The tutorial is on
http://digitalpebble.blogspot.co.uk/2015/09/index-web-with-aws-cloudsearch.html

Please retweet the announcement if you use Twitter [
https://twitter.com/digitalpebble/status/646614555192336384].

I hope you find it useful

Julien

-- 

Open Source Solutions for Text Engineering

http://digitalpebble.blogspot.com/
http://www.digitalpebble.com
http://twitter.com/digitalpebble

Tutorial : Index the web with AWS CloudSearch

Reply via email to