Hello TJ, welcome to the Drill community! To be honest we only recently enabled drill to run distributed queries in EC2 and other cloud providers, by removing a dependency that required UDP multicast, a feature unsupported between nodes on EC2 and Google CE. As such, we have not done a lot of testing on EC2, but we have confirmed that following the install procedure for deploying Drill in a clustered environment works with a series of EC2 nodes, that have the appropriate ports opened between them. We are in the process of trying to prepare a release for an apache vote, but the current tip of master is stable and you can run a build to use the new multicast free version of Drill.
https://cwiki.apache.org/confluence/display/DRILL/Deploying+Apache+Drill+in+a+Clustered+Environment https://cwiki.apache.org/confluence/display/DRILL/Ports+Used+by+Drill Late last year a few members of our community did create scripts for deploying Drill on EMR specifically. As we were unable to run distributed queries in this environment, we have not prioritized maintaining the script for a number of months. If you are willing to get your hands dirty, you can see if the script still works with the current version and report back your results. I will also try to contact Timothy Chen, the committer who drove the effort, to see if he would be interested in helping to update the script if need be. http://tnachen.wordpress.com/2013/12/24/drill-on-aws-emr/ - Jason Altekruse On Thu, Dec 4, 2014 at 7:13 AM, Yi Yang <[email protected]> wrote: > Wow and here comes a node.js talent! > > Sent from my iPhone > > > On Dec 4, 2014, at 10:26 PM, TJ Holowaychuk <[email protected]> wrote: > > > > Hello, your documentation recommends contacting via an EMR configuration > – > > so here I am :). Excited to give it a test drive! > > > > -- > > TJ Holowaychuk > > Engineer > > Segment >
