Re: BlockManager issues

2014-09-22 Thread David Rowe
I've run into this with large shuffles - I assumed that there was contention between the shuffle output files and the JVM for memory. Whenever we start getting these fetch failures, it corresponds with high load on the machines the blocks are being fetched from, and in some cases complete unrespons

Re: EC2 clusters ready in launch time + 30 seconds

2014-10-02 Thread David Rowe
I think this is exactly what packer is for. See e.g. http://www.packer.io/intro/getting-started/build-image.html On a related note, the current AMI for hvm systems (e.g. m3.*, r3.*) has a bad package for httpd, whcih causes ganglia not to start. For some reason I can't get access to the raw AMI to

Re: EC2 clusters ready in launch time + 30 seconds

2014-10-06 Thread David Rowe
fter next week. We are > > planning > > > on > > > >>> taking our client work around hive/spark, plus taking over the > bigtop > > > >>> automation work to modernize and get that fit for human consumption > > > outside > > &