Re: Which instance type on Amazon EC2?

Paul Ingles Tue, 29 Sep 2009 11:28:14 -0700

Hi,

I don't have any real benchmarks or testing to speak of specificallyfor the performance benefits of a larger instance size. However, wehave played around a little and for our work (a form of documentclustering) the benefits of a larger instance were far outweighed byhaving more of the less powerful instances. During the early days ofour experiments with Hadoop and EC2, this was by far and away the mostsurprising thing (although in retrospect I guess it's no so strange!)


Not sure it answers your question, but food for thought hopefully.

Thanks,
Paul

On 29 Sep 2009, at 18:33, Brian Bockelman wrote:

Hey Kevin,
From seeing presentations from the HEP field (totally unrelated toHadoop), I've seen folks claim the large instance is more than 4xbetter than the small, and less than 2x slower than extra-large.I.e., it provided that application the best bang for its buck.
In other words, you're not completely crazy for believing this, andother people have reported seeing non-linear differences between thedifference instance types. I suspect the "best" will depend highlyon what your app is doing.
Brian

On Sep 29, 2009, at 12:19 PM, Kevin Peterson wrote:
Has anyone done any extensive testing of what instance types onAmazon EC2
give you the most bang for the buck?
Given the normal Hadoop recommendations of beefy machines, I wouldexpectthe best performance from the extra-large, but our testing showedotherwise.We did some rough testing while we were just getting started withlike a 10node cluster, and we found that the extra large instance doesn'tcome closeto twice the actual performance of the large instance (pricing at$0.80 and$0.40). My rationalization is that some of the resources areshared, and theextra-large instance corresponds to the actual hardware, while thelargeinstance sometimes gets to take advantage of IO and networkbandwidth beyond
50% when the other tenant isn't doing much.
I'm revisiting our config because we're deploying HBase soon, andI'm notsure whether I would be better off going to the extra-largeinstances sothat I can co-locate the tasktrackers and the region servers on thesamenodes, or if I should stick with large instances and put hbase onseparate
servers. Mostly I'm wondering if my results were a fluke.

Re: Which instance type on Amazon EC2?

Reply via email to