Dear All, I am an new user of nutch 1.9. I have successfully deployed nutch on my local machine. And now I want to deploy it on my amazon emr cluster.
Now, as there is no crawl class available in nutch 1.9 . SO I have to figure out a way to run a crawl script on emr cluster. Can anyone please guide me in this regard, who have tested nutch 1.9 on amazon emr. Happy Holidays ! Thanks Regards Ad

