Hi guys I figured this was worth sharing as its what I was working on whilst sat with Lewis and Kellen at ApacheCon.
I'm looking at creating a Juju deployment for Joshua which people can instantly attach to Hadoop to train models, but instead of using Hadoop on a standalone mode, I want to be able to simply deploy the same code in the cloud and scale up my training if required (I'm not a translation guy so I don't know how that would work in real life performance, but to the sys admin in me, it makes sense). Anyway, I figured I'd put together a sped up and cut up demo that shows the deployment in AWS: https://www.youtube.com/watch?v=dnOQEVSMB-4&feature=youtu.be This deploys Joshua 6.0.5 on its own compute node, and also a multi node hadoop cluster (which you can scale with 1 command), and associates the two. I need to finialise the hadoop client plumbing but should be done early next week. Anyway, if there is an appitite for this alongside whatever docker stuff people are working on, I'll happily commit the charms( the code that runs it) back to the Joshua git repo and we can maintain it in a more "official" manner. Tom -------------- Director Meteorite.bi - Saiku Analytics Founder Tel: +44(0)5603641316 (Thanks to the Saiku community we reached our Kickstart <http://kickstarter.com/projects/2117053714/saiku-reporting-interactive-report-designer/> goal, but you can always help by sponsoring the project <http://www.meteorite.bi/products/saiku/sponsorship>)
