Thank you all for your comments.
There seems to be some interest and certainly agreement on just
"for testing"/"temporary" and the limits on cloud infrastructure
in relation to things as Hadoop, ZooKeeper and Giraph.
I also agree that, given Whirr can already spin Hadoop clusters,
user can run Gi
Having used Whirr several times in EC2, it seems like a fine way to spin up
a temporary 'developers' cluster. Zookeeper is the most likely source of
difficulty on VMs with limited I/O (i.e., it's very chatty and doesn't
tolerate the highly variable latency that smaller AMIs provide). The HBase
co
I've used it on clusters I started on EC2 launched by Whirr. Simply copy
the fat
jar to your client machine and it will distribute normally as a M/R
dependency.
It works very well.
The only limitation I could potentially find (without much proof) was on VMs
with limited IO the RPC message overhea
This is interesting. Whirr can already spin up Hadoop MR clusters,
which can then run the Giraph jobs. Once Giraph is bootstrapped onto
YARN, this will make more sense as a Whirr service.
On Wed, Apr 4, 2012 at 9:43 PM, Avery Ching wrote:
> I don't use Whirr...I haven't heard it mentioned on th
I don't use Whirr...I haven't heard it mentioned on this forum yet. Anyone?
Avery
On 4/4/12 9:30 PM, Paolo Castagna wrote:
Hi,
seen this?
WHIRR-530 - Add Giraph as a service
https://issues.apache.org/jira/browse/WHIRR-530
This could be quite useful for users who want to give Giraph a s
Hi,
seen this?
WHIRR-530 - Add Giraph as a service
https://issues.apache.org/jira/browse/WHIRR-530
This could be quite useful for users who want to give Giraph a spin on cloud
infrastructure, just for testing or to run a few small experiments.
My experience with Whirr an small 10-20 nodes clu