Re: Giraph as Whirr service, see WHIRR-530

2012-04-05 Thread Paolo Castagna
Thank you all for your comments. There seems to be some interest and certainly agreement on just "for testing"/"temporary" and the limits on cloud infrastructure in relation to things as Hadoop, ZooKeeper and Giraph. I also agree that, given Whirr can already spin Hadoop clusters, user can run Gi

Re: Giraph as Whirr service, see WHIRR-530

2012-04-05 Thread Dan McClary
Having used Whirr several times in EC2, it seems like a fine way to spin up a temporary 'developers' cluster. Zookeeper is the most likely source of difficulty on VMs with limited I/O (i.e., it's very chatty and doesn't tolerate the highly variable latency that smaller AMIs provide). The HBase co

Re: Giraph as Whirr service, see WHIRR-530

2012-04-04 Thread Brian Femiano
I've used it on clusters I started on EC2 launched by Whirr. Simply copy the fat jar to your client machine and it will distribute normally as a M/R dependency. It works very well. The only limitation I could potentially find (without much proof) was on VMs with limited IO the RPC message overhea

Re: Giraph as Whirr service, see WHIRR-530

2012-04-04 Thread Jakob Homan
This is interesting. Whirr can already spin up Hadoop MR clusters, which can then run the Giraph jobs. Once Giraph is bootstrapped onto YARN, this will make more sense as a Whirr service. On Wed, Apr 4, 2012 at 9:43 PM, Avery Ching wrote: > I don't use Whirr...I haven't heard it mentioned on th

Re: Giraph as Whirr service, see WHIRR-530

2012-04-04 Thread Avery Ching
I don't use Whirr...I haven't heard it mentioned on this forum yet. Anyone? Avery On 4/4/12 9:30 PM, Paolo Castagna wrote: Hi, seen this? WHIRR-530 - Add Giraph as a service https://issues.apache.org/jira/browse/WHIRR-530 This could be quite useful for users who want to give Giraph a s

Giraph as Whirr service, see WHIRR-530

2012-04-04 Thread Paolo Castagna
Hi, seen this? WHIRR-530 - Add Giraph as a service https://issues.apache.org/jira/browse/WHIRR-530 This could be quite useful for users who want to give Giraph a spin on cloud infrastructure, just for testing or to run a few small experiments. My experience with Whirr an small 10-20 nodes clu