Hi Folks, A heads up that I sent in a proposal to Apache Big Data on the above topic. Very pleased that it was accepted and I hope to be in Vancouver to share experiences. The is based on my ongoing work on https://issues.apache.org/jira/browse/NUTCH-2005 I would like to share slides and get feedback closer to the event prior to me submitting the slides so I will update this thread nearer the time. Thanks for now. Lewis
-- *Lewis*