Hi Olivier, This would be an interesting experiment to run! There are benchmarks in Whirr that make this straightforward, see https://cwiki.apache.org/confluence/display/WHIRR/Running+Benchmarks.
Cheers, Tom On Thu, Jan 13, 2011 at 3:34 PM, Olivier Grisel <olivier.gri...@ensta.org> wrote: > Hi all, > > Has anyone tried to compare the speed / $ ratio for CPU bounds and IO > Hadoop MapReduce jobs? > > I have the impression that IO on EC2 is not very good: the duration of > "distcp" command from and to S3 to and from HDFS over local disk on > the same amount of data can vary a lot from one run to another. Is it > the case on the rackspace cloud? > > -- > Olivier > http://twitter.com/ogrisel - http://github.com/ogrisel >