You can easily add a function (say setup_pig) inside the function setup_cluster in this script <https://github.com/apache/spark/blob/master/ec2/spark_ec2.py#L649>
Thanks Best Regards On Thu, Feb 26, 2015 at 7:08 AM, Sameer Tilak <ssti...@live.com> wrote: > Hi, > > I was looking at the documentation for deploying Spark cluster on EC2. > http://spark.apache.org/docs/latest/ec2-scripts.html > > We are using Pig to build the data pipeline and then use MLLib for > analytics. I was wondering if someone has any experience to include > additional tools/services such as Pig/Hadoop in the above deployment > script? > >