Hi Guys, Can anyone please help me out with the following which I got back from Roman on the BigTop team. If we strike while the iron is hot with this one is would be great to see just what we can flag up regarding what BigTop is capable of :0)
I am happy to finally get round to updating our Hadoop tutorial but if anyone could pass any comments regarding Roman's comments I will work diligently to get them voiced and hopefully implemented. Thanks ---------- Forwarded message ---------- From: Roman Shaposhnik <[email protected]> Date: Tue, Nov 15, 2011 at 9:03 PM Subject: Re: BigTop To: lewis john mcgibbney <[email protected]> Hi Lewis! Perfect timing ;-) I've played with Nutch a little bit and now I need your help. Basically, I need some kind of a tutorial that would allow me to, lets say, crowl the intranet here at Cloudera using a fully distributed Hadoop cluster. I've been trying to follow this one along: http://wiki.apache.org/nutch/NutchHadoopTutorial But it seems a little outdated. So.. any help in setting things up will be greatly appreciated. Also, if you have unit tests in Nutch running Nutch on top of MiniMR -- I'd love to see them. On Tue, Nov 15, 2011 at 12:52 PM, lewis john mcgibbney <[email protected]> wrote: > I understand that you're workload is very heavy and would like to remind you > that any work we can get done with Nutch/BigTop is great, but only when you > have the time to have a look and when it suits you. I hope that many people > were interested in the hard work you have obviously been putting into BigTop > and I am excited to see it progress through the incubator. > > All the best for now, please tell me when I can begin working on getting > Nutch & BigTop working. Thanks a lot for the kind words! It means a lot! Thanks, Roman. -- *Lewis*

