It seems this is what I have been proposing, I still have to check the code base https://github.com/USCDataScience/sparkler
Thanks, Vicky -- View this message in context: http://lucene.472066.n3.nabble.com/Nutch-and-workflow-for-scaling-tp4317955p4318171.html Sent from the Nutch - User mailing list archive at Nabble.com.

